TOP ACTIVATIONS
MAX = 5.747

the countryside , sn or ke lling over coral reefs ,
Thank you for signing up . For more from The Nation
" public good ," but now that 's an oxy mor
. Get our daily newsletter Upgrade your inbox and
i - eq 1 ) { $ font 1 = New
' not quite even handed ' 2016 primary Sanders : '
first to hear about Nation Travels destinations , and explore the
1 / 2 "] [ EC O " A 46 "]
' not quite even handed ' 2016 primary Sanders : '
VI . Sign Up : Stay on top of
1 m Krist en Rou pen ian , whose
minions . IGN : Sure . Before we move
out The Shield in Live + Same Day numbers .
. Get our daily newsletter Upgrade your inbox and
this . For the sake of comple teness , here is
IM for the sake of putting something in the grid
men or women mocking Val iz ade h . <|endoftext|> NEW
' not quite even handed ' 2016 primary Sanders : '
a Guard ) -> Option < Shared < ' a ,
all minions . IGN : Sure . Before we

MIN ACTIVATIONS
MIN = -6.967

his band Gr ong Gr ong had just blown me away
than most . - July 6 , 2010 R iff Tra
all . ( UPDATE : Sherman suffered a rupt
to state , is of no relevance . Here we see
, Cardinals 1 AL READ Y COMP ET ED : vs
15 , 2010 Que er magnet ? Reviewer :
original copy . should be clone able , and
of a triple revolution of which technological revolution - with the
allow . P R IM A P AR S
the air ? Indeed f out g assing ( t )
_ re pos . contents ] WHERE id IN ( SELECT
100 % ON ALL PERSON EL UPDATE : 05 45 z
P R IM A S EC UN D P
in 3 D with To make this possible , in the
. - July 6 , 2010 R iff Tra x
b xc 6 { ( 6 s )} 26 . Rd
D igg reddit Facebook St
D igg reddit Facebook St
fully clothed and Le Ch iff re ( Peter Lor re
bad news pl s . Good updates ? No more bad

INTERVAL 2.569 - 5.747
CONTAINS 0.796%

was later dismissed by the Judge due to lack of evidence
Path Type Leaf ) { $ b mp = new -
thing . IGN : The armour option
orted by L inda C . Deg ut is
I believe is the first national survey on the pending proposal

INTERVAL -0.610 - 2.569
CONTAINS 72.951%

left , other than what was reported by these mainstream outlets
. And it is obvious that the player did something bad
more than Paul McCartney has pulled in , Mr
I can use it as lap board , and I can
I was young and vulnerable , and didn 't have anywhere

INTERVAL -3.788 - -0.610
CONTAINS 26.188%

can find a static version of it here .
than Fear of the Heaven s and
.) So Mark Ham ill , best known for
ly . The agency also would be given the
RC 4 key stream X OR ed with plain text

INTERVAL -6.967 - -3.788
CONTAINS 0.065%

els bear , To grunt and sweat under a
." And we went on to discuss pattern speed ( about
and the upcoming " Hi Hi P uffy Am i Y
bad news pl s . Good updates ? No more bad
December 5 , 2009 Que ers are Here ....
Token the
Token ID262
Feature activation-4.98e-01

Pos logit contributions
 WIN+0.237
annabin+0.226
 backward+0.221
itably+0.217
oyd+0.215
Neg logit contributions
irement-0.221
akura-0.218
ashi-0.211
 Turing-0.209
Mother-0.209
Token countryside
Token ID25708
Feature activation-3.05e-02

Pos logit contributions
 WIN+0.368
annabin+0.355
 backward+0.346
itably+0.340
oyd+0.338
Neg logit contributions
irement-0.285
akura-0.282
ashi-0.271
 Turing-0.269
Mother-0.268
Token,
Token ID11
Feature activation+5.84e-01

Pos logit contributions
 WIN+0.019
annabin+0.018
 backward+0.017
itably+0.017
oyd+0.017
Neg logit contributions
irement-0.021
akura-0.021
ashi-0.020
 Turing-0.020
Mother-0.020
Token sn
Token ID3013
Feature activation-2.23e-01

Pos logit contributions
irement+0.395
akura+0.393
ashi+0.378
 Turing+0.376
Mother+0.375
Neg logit contributions
 WIN-0.370
annabin-0.357
 backward-0.343
itably-0.339
oyd-0.336
Tokenor
Token ID273
Feature activation+1.26e+00

Pos logit contributions
 WIN+0.126
annabin+0.120
 backward+0.116
itably+0.114
oyd+0.113
Neg logit contributions
irement-0.166
akura-0.165
ashi-0.159
 Turing-0.159
Mother-0.159
Tokenke
Token ID365
Feature activation+5.75e+00

Pos logit contributions
irement+0.864
akura+0.856
ashi+0.830
Mother+0.814
mother+0.814
Neg logit contributions
 WIN-0.786
annabin-0.765
 backward-0.734
itably-0.712
oyd-0.712
Tokenlling
Token ID2680
Feature activation+6.72e-01

Pos logit contributions
akura+3.348
ashi+3.243
mother+3.207
irement+3.202
Mother+3.056
Neg logit contributions
annabin-4.028
 WIN-3.988
itably-3.825
 backward-3.811
075-3.755
Token over
Token ID625
Feature activation+1.41e+00

Pos logit contributions
irement+0.460
akura+0.456
ashi+0.442
 Turing+0.438
Mother+0.436
Neg logit contributions
 WIN-0.420
annabin-0.406
 backward-0.387
itably-0.384
oyd-0.382
Token coral
Token ID29537
Feature activation-3.80e-01

Pos logit contributions
akura+0.937
irement+0.936
ashi+0.902
 Turing+0.893
Mother+0.889
Neg logit contributions
 WIN-0.905
annabin-0.876
 backward-0.839
itably-0.830
oyd-0.824
Token reefs
Token ID43190
Feature activation-1.77e-01

Pos logit contributions
 WIN+0.254
annabin+0.245
 backward+0.236
itably+0.233
oyd+0.231
Neg logit contributions
irement-0.244
akura-0.243
ashi-0.234
 Turing-0.232
Mother-0.232
Token,
Token ID11
Feature activation-1.66e-02

Pos logit contributions
 WIN+0.109
annabin+0.104
 backward+0.101
itably+0.099
oyd+0.098
Neg logit contributions
irement-0.123
akura-0.122
ashi-0.118
 Turing-0.117
Mother-0.117
TokenThank
Token ID10449
Feature activation+1.50e+00

Pos logit contributions
irement+0.261
akura+0.255
Mother+0.251
ashi+0.248
mother+0.247
Neg logit contributions
 WIN-0.300
annabin-0.290
 backward-0.281
oyd-0.277
itably-0.274
Token you
Token ID345
Feature activation+3.68e+00

Pos logit contributions
irement+0.636
akura+0.606
mother+0.582
ashi+0.579
Mother+0.572
Neg logit contributions
 WIN-1.330
annabin-1.304
itably-1.271
oyd-1.256
 backward-1.246
Token for
Token ID329
Feature activation+2.89e+00

Pos logit contributions
irement+2.065
akura+2.041
ashi+2.004
mother+1.990
 Turing+1.974
Neg logit contributions
 WIN-2.656
annabin-2.586
itably-2.496
oyd-2.493
ospace-2.438
Token signing
Token ID8415
Feature activation+2.62e+00

Pos logit contributions
irement+1.530
akura+1.528
 Turing+1.475
ashi+1.465
mother+1.436
Neg logit contributions
 WIN-2.185
annabin-2.153
itably-2.084
oyd-2.073
 backward-2.033
Token up
Token ID510
Feature activation+4.77e+00

Pos logit contributions
irement+1.541
 Turing+1.503
akura+1.491
mother+1.489
ashi+1.480
Neg logit contributions
 WIN-1.822
annabin-1.793
oyd-1.759
itably-1.706
ospace-1.682
Token.
Token ID13
Feature activation+5.72e+00

Pos logit contributions
irement+2.513
mother+2.481
 Turing+2.468
Mother+2.461
akura+2.449
Neg logit contributions
annabin-3.497
 WIN-3.496
 backward-3.359
oyd-3.349
itably-3.333
Token For
Token ID1114
Feature activation+4.13e+00

Pos logit contributions
 Turing+2.804
irement+2.728
Mother+2.717
mother+2.673
 Consortium+2.658
Neg logit contributions
annabin-4.500
oyd-4.396
 WIN-4.355
itably-4.340
 backward-4.316
Token more
Token ID517
Feature activation+2.22e+00

Pos logit contributions
irement+2.502
mother+2.454
 Turing+2.450
akura+2.414
ashi+2.394
Neg logit contributions
annabin-2.730
 WIN-2.714
itably-2.611
oyd-2.538
OHN-2.525
Token from
Token ID422
Feature activation+2.70e+00

Pos logit contributions
irement+1.436
akura+1.416
mother+1.402
Mother+1.382
ashi+1.376
Neg logit contributions
 WIN-1.405
annabin-1.403
oyd-1.342
itably-1.324
ospace-1.312
Token The
Token ID383
Feature activation+1.53e+00

Pos logit contributions
irement+1.528
Mother+1.517
mother+1.515
akura+1.497
 Turing+1.491
Neg logit contributions
 WIN-1.900
annabin-1.896
oyd-1.845
 backward-1.837
ospace-1.813
Token Nation
Token ID8741
Feature activation+3.15e+00

Pos logit contributions
irement+0.882
Mother+0.862
mother+0.854
akura+0.852
 Turing+0.837
Neg logit contributions
 WIN-1.101
annabin-1.080
oyd-1.035
 backward-1.035
itably-1.035
Token "
Token ID366
Feature activation-7.40e-01

Pos logit contributions
 WIN+0.002
annabin+0.002
itably+0.002
 backward+0.001
oyd+0.001
Neg logit contributions
irement-0.002
akura-0.002
 Turing-0.002
ashi-0.002
Mother-0.002
Tokenpublic
Token ID11377
Feature activation-7.76e-01

Pos logit contributions
 WIN+0.466
annabin+0.445
 backward+0.428
itably+0.422
oyd+0.419
Neg logit contributions
irement-0.506
akura-0.504
Mother-0.488
ashi-0.487
mother-0.486
Token good
Token ID922
Feature activation-2.06e+00

Pos logit contributions
 WIN+0.458
annabin+0.440
 backward+0.418
itably+0.415
oyd+0.408
Neg logit contributions
irement-0.563
akura-0.556
 Turing-0.541
ashi-0.539
Mother-0.537
Token,"
Token ID553
Feature activation+1.09e+00

Pos logit contributions
 WIN+1.023
annabin+0.968
 backward+0.923
itably+0.907
oyd+0.895
Neg logit contributions
irement-1.687
akura-1.674
ashi-1.627
 Turing-1.619
Mother-1.615
Token but
Token ID475
Feature activation+4.45e+00

Pos logit contributions
irement+0.692
akura+0.689
 Turing+0.670
ashi+0.665
Mother+0.660
Neg logit contributions
 WIN-0.725
annabin-0.707
itably-0.672
oyd-0.665
 backward-0.663
Token now
Token ID783
Feature activation+5.72e+00

Pos logit contributions
irement+2.544
akura+2.530
 Turing+2.486
ashi+2.446
Mother+2.432
Neg logit contributions
 WIN-3.122
annabin-3.103
itably-2.953
oyd-2.894
075-2.853
Token that
Token ID326
Feature activation+1.21e+00

Pos logit contributions
irement+3.487
akura+3.467
 Turing+3.361
ashi+3.353
Mother+3.324
Neg logit contributions
 WIN-3.780
annabin-3.733
itably-3.496
075-3.432
oyd-3.428
Token's
Token ID338
Feature activation+3.90e-01

Pos logit contributions
irement+0.778
akura+0.772
 Turing+0.752
ashi+0.747
Mother+0.739
Neg logit contributions
 WIN-0.797
annabin-0.773
itably-0.732
 backward-0.730
oyd-0.723
Token an
Token ID281
Feature activation+1.63e-01

Pos logit contributions
irement+0.257
akura+0.255
ashi+0.246
 Turing+0.245
Mother+0.243
Neg logit contributions
 WIN-0.253
annabin-0.246
itably-0.233
oyd-0.231
 backward-0.231
Token oxy
Token ID10661
Feature activation+5.51e-01

Pos logit contributions
irement+0.116
akura+0.115
ashi+0.112
 Turing+0.111
Mother+0.110
Neg logit contributions
 WIN-0.097
annabin-0.095
itably-0.090
 backward-0.089
oyd-0.088
Tokenmor
Token ID4491
Feature activation+2.01e-01

Pos logit contributions
irement+0.271
akura+0.270
ashi+0.257
mother+0.254
 Turing+0.253
Neg logit contributions
 WIN-0.451
annabin-0.436
 backward-0.425
itably-0.419
oyd-0.417
Token.
Token ID13
Feature activation-1.75e-02

Pos logit contributions
irement+1.711
akura+1.677
ashi+1.629
 Turing+1.618
Mother+1.613
Neg logit contributions
 WIN-1.710
annabin-1.671
itably-1.585
oyd-1.580
 backward-1.568
Token
Token ID198
Feature activation-6.45e-01

Pos logit contributions
 WIN+0.012
annabin+0.011
 backward+0.011
itably+0.011
oyd+0.011
Neg logit contributions
irement-0.011
akura-0.011
ashi-0.011
 Turing-0.011
Mother-0.011
Token
Token ID198
Feature activation-9.43e-01

Pos logit contributions
 WIN+0.436
annabin+0.426
 backward+0.411
itably+0.407
oyd+0.406
Neg logit contributions
irement-0.403
akura-0.397
Mother-0.388
mother-0.387
 Turing-0.386
TokenGet
Token ID3855
Feature activation+1.38e-01

Pos logit contributions
 WIN+0.599
annabin+0.574
 backward+0.557
itably+0.546
oyd+0.541
Neg logit contributions
irement-0.638
akura-0.634
ashi-0.612
 Turing-0.607
Mother-0.605
Token our
Token ID674
Feature activation+2.25e+00

Pos logit contributions
irement+0.084
akura+0.082
ashi+0.080
Mother+0.080
 Turing+0.079
Neg logit contributions
 WIN-0.096
annabin-0.093
 backward-0.090
itably-0.089
oyd-0.089
Token daily
Token ID4445
Feature activation+5.64e+00

Pos logit contributions
irement+1.816
akura+1.793
ashi+1.763
Mother+1.753
mother+1.747
Neg logit contributions
 WIN-1.071
annabin-1.053
oyd-1.009
itably-0.976
 backward-0.975
Token newsletter
Token ID13129
Feature activation+8.72e-01

Pos logit contributions
irement+3.352
akura+3.254
 Turing+3.249
Mother+3.172
 Intercept+3.139
Neg logit contributions
annabin-3.834
oyd-3.823
 WIN-3.760
ospace-3.689
itably-3.607
Token Upgrade
Token ID24236
Feature activation+1.94e+00

Pos logit contributions
irement+0.462
akura+0.460
ashi+0.442
Mother+0.433
 Turing+0.430
Neg logit contributions
 WIN-0.687
annabin-0.656
 backward-0.645
itably-0.630
oyd-0.622
Token your
Token ID534
Feature activation+1.00e+00

Pos logit contributions
irement+0.965
akura+0.932
Mother+0.901
ashi+0.900
 Turing+0.898
Neg logit contributions
 WIN-1.563
annabin-1.524
oyd-1.483
 backward-1.474
itably-1.440
Token inbox
Token ID13734
Feature activation-4.74e-01

Pos logit contributions
irement+0.776
akura+0.764
ashi+0.743
 Turing+0.743
Mother+0.740
Neg logit contributions
 WIN-0.538
annabin-0.515
 backward-0.492
itably-0.485
oyd-0.485
Token and
Token ID290
Feature activation-6.39e-03

Pos logit contributions
 WIN+0.282
annabin+0.269
 backward+0.260
itably+0.255
oyd+0.253
Neg logit contributions
irement-0.340
akura-0.338
ashi-0.327
 Turing-0.325
Mother-0.324
Tokeni
Token ID72
Feature activation+9.14e-01

Pos logit contributions
irement+0.108
akura+0.107
ashi+0.104
Mother+0.103
mother+0.102
Neg logit contributions
 WIN-0.100
annabin-0.097
 backward-0.094
itably-0.092
oyd-0.090
Token -
Token ID532
Feature activation+1.88e+00

Pos logit contributions
irement+0.530
akura+0.521
ashi+0.503
Mother+0.499
 Turing+0.498
Neg logit contributions
 WIN-0.666
annabin-0.646
 backward-0.625
itably-0.622
oyd-0.614
Tokeneq
Token ID27363
Feature activation+1.36e+00

Pos logit contributions
irement+1.313
akura+1.295
ashi+1.264
mother+1.251
Mother+1.245
Neg logit contributions
 WIN-1.163
annabin-1.112
 backward-1.068
itably-1.046
oyd-1.033
Token 1
Token ID352
Feature activation-4.56e-01

Pos logit contributions
irement+0.843
akura+0.831
ashi+0.807
 Turing+0.805
Mother+0.802
Neg logit contributions
 WIN-0.919
annabin-0.904
itably-0.864
 backward-0.863
oyd-0.847
Token)
Token ID8
Feature activation+1.65e+00

Pos logit contributions
 WIN+0.237
annabin+0.225
 backward+0.217
itably+0.212
oyd+0.209
Neg logit contributions
irement-0.361
akura-0.359
ashi-0.349
 Turing-0.346
Mother-0.346
Token {
Token ID1391
Feature activation+5.57e+00

Pos logit contributions
irement+1.103
akura+1.097
ashi+1.057
 Turing+1.048
Mother+1.047
Neg logit contributions
 WIN-1.049
annabin-1.023
itably-0.980
 backward-0.976
oyd-0.957
Token $
Token ID720
Feature activation+3.60e+00

Pos logit contributions
akura+3.268
irement+3.251
Mother+3.140
ashi+3.138
 Turing+3.136
Neg logit contributions
annabin-3.801
 WIN-3.708
itably-3.625
-3.510
oyd-3.504
Tokenfont
Token ID10331
Feature activation+2.24e+00

Pos logit contributions
akura+2.318
irement+2.313
ashi+2.253
Mother+2.234
mother+2.233
Neg logit contributions
 WIN-2.363
annabin-2.294
 backward-2.220
itably-2.169
 GOD-2.123
Token1
Token ID16
Feature activation+1.93e+00

Pos logit contributions
irement+1.510
akura+1.506
Mother+1.455
ashi+1.451
mother+1.440
Neg logit contributions
 WIN-1.419
annabin-1.367
 backward-1.318
itably-1.298
oyd-1.284
Token =
Token ID796
Feature activation+1.76e+00

Pos logit contributions
irement+1.267
akura+1.260
ashi+1.215
Mother+1.210
mother+1.201
Neg logit contributions
 WIN-1.258
annabin-1.212
 backward-1.173
itably-1.154
oyd-1.140
Token New
Token ID968
Feature activation+2.79e+00

Pos logit contributions
irement+1.115
akura+1.112
ashi+1.067
 Turing+1.066
Mother+1.064
Neg logit contributions
 WIN-1.171
annabin-1.143
 backward-1.099
itably-1.096
oyd-1.082
Token '
Token ID705
Feature activation+1.71e+00

Pos logit contributions
irement+0.657
akura+0.647
ashi+0.620
 Turing+0.616
Mother+0.610
Neg logit contributions
 WIN-1.123
annabin-1.090
 backward-1.064
itably-1.052
oyd-1.044
Tokennot
Token ID1662
Feature activation+2.79e+00

Pos logit contributions
irement+1.182
akura+1.161
mother+1.153
Mother+1.152
ashi+1.135
Neg logit contributions
 WIN-1.060
annabin-1.016
 backward-0.981
oyd-0.960
itably-0.958
Token quite
Token ID2407
Feature activation+2.32e+00

Pos logit contributions
irement+1.743
 Turing+1.660
ashi+1.641
akura+1.594
mother+1.554
Neg logit contributions
 WIN-1.924
annabin-1.869
ospace-1.836
itably-1.817
oyd-1.804
Token even
Token ID772
Feature activation+1.63e+00

Pos logit contributions
irement+1.363
akura+1.331
ashi+1.301
 Turing+1.298
Mother+1.286
Neg logit contributions
 WIN-1.667
annabin-1.600
oyd-1.551
itably-1.526
 backward-1.518
Token handed
Token ID10158
Feature activation+1.85e+00

Pos logit contributions
irement+0.936
akura+0.875
 Turing+0.872
ashi+0.854
mother+0.854
Neg logit contributions
 WIN-1.209
annabin-1.180
oyd-1.165
itably-1.141
 backward-1.127
Token'
Token ID6
Feature activation+5.56e+00

Pos logit contributions
irement+1.143
akura+1.119
ashi+1.078
 Turing+1.077
mother+1.074
Neg logit contributions
 WIN-1.279
annabin-1.246
 backward-1.200
itably-1.182
oyd-1.180
Token 2016
Token ID1584
Feature activation+2.62e+00

Pos logit contributions
irement+3.260
 Turing+3.229
mother+3.116
Mother+3.023
akura+2.979
Neg logit contributions
annabin-3.926
 WIN-3.800
ospace-3.697
 backward-3.687
flat-3.608
Token primary
Token ID4165
Feature activation+4.08e+00

Pos logit contributions
irement+1.824
akura+1.806
ashi+1.746
 Turing+1.742
Mother+1.734
Neg logit contributions
 WIN-1.614
annabin-1.547
 backward-1.496
itably-1.468
oyd-1.458
Token Sanders
Token ID5831
Feature activation+2.08e+00

Pos logit contributions
irement+2.343
 Turing+2.262
akura+2.245
Mother+2.203
ashi+2.181
Neg logit contributions
 WIN-2.869
annabin-2.829
itably-2.763
 backward-2.747
ospace-2.725
Token:
Token ID25
Feature activation+1.28e+00

Pos logit contributions
irement+1.723
akura+1.643
Mother+1.630
 Turing+1.626
mother+1.619
Neg logit contributions
 WIN-1.013
annabin-0.974
itably-0.954
oyd-0.922
 backward-0.921
Token '
Token ID705
Feature activation+9.56e-01

Pos logit contributions
irement+0.641
akura+0.633
ashi+0.610
 Turing+0.610
Mother+0.602
Neg logit contributions
 WIN-1.027
annabin-0.995
 backward-0.969
itably-0.964
oyd-0.955
Token first
Token ID717
Feature activation+7.97e-01

Pos logit contributions
 WIN+0.529
annabin+0.506
 backward+0.492
itably+0.483
oyd+0.478
Neg logit contributions
irement-0.535
akura-0.531
ashi-0.512
 Turing-0.509
Mother-0.507
Token to
Token ID284
Feature activation+2.65e+00

Pos logit contributions
irement+0.514
akura+0.505
ashi+0.493
mother+0.485
 Turing+0.483
Neg logit contributions
 WIN-0.523
annabin-0.508
 backward-0.479
oyd-0.479
itably-0.477
Token hear
Token ID3285
Feature activation+3.61e+00

Pos logit contributions
irement+1.702
ashi+1.679
Mother+1.621
mother+1.620
akura+1.610
Neg logit contributions
annabin-1.689
 WIN-1.636
 backward-1.561
 GOD-1.535
oyd-1.533
Token about
Token ID546
Feature activation+2.75e+00

Pos logit contributions
irement+1.838
 Turing+1.793
Mother+1.792
mother+1.782
akura+1.763
Neg logit contributions
 WIN-2.744
annabin-2.687
oyd-2.558
ospace-2.549
itably-2.544
Token Nation
Token ID8741
Feature activation+2.76e+00

Pos logit contributions
irement+1.474
Mother+1.453
akura+1.422
mother+1.412
 Turing+1.390
Neg logit contributions
 WIN-2.055
annabin-2.043
oyd-1.936
itably-1.934
 backward-1.917
Token Travels
Token ID48899
Feature activation+5.51e+00

Pos logit contributions
Mother+1.406
mother+1.366
psc+1.328
ashi+1.301
irement+1.300
Neg logit contributions
 WIN-2.185
itably-2.093
annabin-2.082
oyd-2.032
 backward-2.018
Token destinations
Token ID23982
Feature activation+2.28e+00

Pos logit contributions
akura+2.617
irement+2.596
Mother+2.594
 Turing+2.572
 Consortium+2.569
Neg logit contributions
annabin-4.344
itably-4.216
 WIN-4.215
flat-4.122
075-4.058
Token,
Token ID11
Feature activation+7.54e-01

Pos logit contributions
irement+1.281
akura+1.251
mother+1.229
Mother+1.225
ashi+1.218
Neg logit contributions
 WIN-1.675
annabin-1.635
itably-1.571
oyd-1.564
 backward-1.555
Token and
Token ID290
Feature activation+1.64e+00

Pos logit contributions
irement+0.470
akura+0.464
 Turing+0.446
ashi+0.442
mother+0.441
Neg logit contributions
 WIN-0.505
annabin-0.494
oyd-0.490
itably-0.475
ospace-0.474
Token explore
Token ID7301
Feature activation+1.16e+00

Pos logit contributions
irement+0.555
akura+0.542
ashi+0.518
 Turing+0.506
mother+0.501
Neg logit contributions
 WIN-1.578
annabin-1.550
itably-1.503
oyd-1.499
 backward-1.495
Token the
Token ID262
Feature activation-8.88e-02

Pos logit contributions
akura+0.694
Mother+0.693
irement+0.689
 Turing+0.684
mother+0.681
Neg logit contributions
 WIN-0.791
annabin-0.764
oyd-0.740
itably-0.738
ospace-0.731
Token1
Token ID16
Feature activation-1.46e-02

Pos logit contributions
akura+1.006
irement+0.994
Mother+0.972
 Turing+0.964
ashi+0.960
Neg logit contributions
 WIN-1.117
annabin-1.099
 backward-1.067
itably-1.053
ospace-1.049
Token/
Token ID14
Feature activation+3.20e-01

Pos logit contributions
 WIN+0.009
annabin+0.009
 backward+0.009
itably+0.009
oyd+0.009
Neg logit contributions
irement-0.010
akura-0.010
ashi-0.009
 Turing-0.009
Mother-0.009
Token2
Token ID17
Feature activation+7.35e-01

Pos logit contributions
irement+0.202
akura+0.201
ashi+0.193
Mother+0.192
 Turing+0.192
Neg logit contributions
 WIN-0.217
annabin-0.209
 backward-0.203
itably-0.199
oyd-0.198
Token"]
Token ID8973
Feature activation+2.43e+00

Pos logit contributions
irement+0.381
akura+0.379
ashi+0.362
 Turing+0.360
Mother+0.358
Neg logit contributions
 WIN-0.576
annabin-0.564
 backward-0.548
itably-0.541
oyd-0.537
Token [
Token ID685
Feature activation+6.49e-01

Pos logit contributions
irement+1.256
akura+1.230
 Turing+1.226
Mother+1.196
mother+1.192
Neg logit contributions
annabin-1.896
 WIN-1.829
itably-1.820
oyd-1.791
ospace-1.781
TokenEC
Token ID2943
Feature activation+5.44e+00

Pos logit contributions
irement+0.311
akura+0.310
ashi+0.295
 Turing+0.292
Mother+0.292
Neg logit contributions
 WIN-0.538
annabin-0.523
 backward-0.511
itably-0.502
oyd-0.500
TokenO
Token ID46
Feature activation+1.89e+00

Pos logit contributions
mother+3.317
irement+3.307
Mother+3.287
ashi+3.191
akura+3.181
Neg logit contributions
annabin-3.534
 WIN-3.514
 backward-3.453
itably-3.429
oyd-3.289
Token "
Token ID366
Feature activation+3.20e+00

Pos logit contributions
 Turing+1.009
irement+1.000
mother+0.975
Mother+0.974
 Tsuk+0.967
Neg logit contributions
annabin-1.428
 WIN-1.356
ospace-1.353
oyd-1.322
 backward-1.311
TokenA
Token ID32
Feature activation+7.13e-01

Pos logit contributions
akura+1.994
Mother+1.990
irement+1.973
mother+1.957
ashi+1.927
Neg logit contributions
annabin-2.061
 WIN-1.994
 backward-1.943
oyd-1.936
itably-1.927
Token46
Token ID3510
Feature activation+2.92e-01

Pos logit contributions
irement+0.434
akura+0.432
ashi+0.417
Mother+0.415
 Turing+0.411
Neg logit contributions
 WIN-0.492
annabin-0.478
 backward-0.468
itably-0.461
oyd-0.454
Token"]
Token ID8973
Feature activation+1.63e+00

Pos logit contributions
irement+0.140
akura+0.139
ashi+0.134
 Turing+0.133
Mother+0.133
Neg logit contributions
 WIN-0.238
annabin-0.235
 backward-0.227
itably-0.225
oyd-0.225
Token '
Token ID705
Feature activation+1.66e+00

Pos logit contributions
irement+0.705
akura+0.700
ashi+0.668
 Turing+0.661
Mother+0.660
Neg logit contributions
 WIN-1.054
annabin-1.017
 backward-0.992
itably-0.977
oyd-0.970
Tokennot
Token ID1662
Feature activation+2.80e+00

Pos logit contributions
irement+1.144
akura+1.127
Mother+1.115
mother+1.114
ashi+1.098
Neg logit contributions
 WIN-1.027
annabin-0.985
 backward-0.949
itably-0.929
oyd-0.929
Token quite
Token ID2407
Feature activation+2.47e+00

Pos logit contributions
irement+1.790
 Turing+1.698
ashi+1.692
akura+1.638
mother+1.602
Neg logit contributions
 WIN-1.884
annabin-1.839
ospace-1.806
itably-1.775
oyd-1.768
Token even
Token ID772
Feature activation+1.50e+00

Pos logit contributions
irement+1.433
akura+1.386
ashi+1.369
 Turing+1.361
Mother+1.348
Neg logit contributions
 WIN-1.797
annabin-1.721
oyd-1.678
itably-1.645
ospace-1.634
Token handed
Token ID10158
Feature activation+1.34e+00

Pos logit contributions
irement+0.866
akura+0.809
 Turing+0.803
ashi+0.792
mother+0.788
Neg logit contributions
 WIN-1.116
annabin-1.088
oyd-1.074
itably-1.053
 backward-1.036
Token'
Token ID6
Feature activation+5.42e+00

Pos logit contributions
irement+0.845
akura+0.832
ashi+0.802
 Turing+0.799
Mother+0.794
Neg logit contributions
 WIN-0.916
annabin-0.885
 backward-0.857
itably-0.843
oyd-0.840
Token 2016
Token ID1584
Feature activation+2.24e+00

Pos logit contributions
irement+3.097
 Turing+3.083
mother+2.945
Mother+2.865
akura+2.849
Neg logit contributions
annabin-3.899
 WIN-3.805
ospace-3.687
 backward-3.663
oyd-3.605
Token primary
Token ID4165
Feature activation+3.90e+00

Pos logit contributions
irement+1.524
akura+1.508
ashi+1.456
 Turing+1.454
Mother+1.447
Neg logit contributions
 WIN-1.411
annabin-1.353
 backward-1.309
itably-1.285
oyd-1.277
Token Sanders
Token ID5831
Feature activation+1.08e+00

Pos logit contributions
irement+2.306
akura+2.224
 Turing+2.220
Mother+2.157
ashi+2.157
Neg logit contributions
 WIN-2.721
annabin-2.646
 backward-2.593
itably-2.581
oyd-2.544
Token:
Token ID25
Feature activation+7.15e-01

Pos logit contributions
irement+0.914
akura+0.887
 Turing+0.869
Mother+0.868
ashi+0.865
Neg logit contributions
 WIN-0.507
annabin-0.481
 backward-0.459
itably-0.459
oyd-0.453
Token '
Token ID705
Feature activation+5.56e-01

Pos logit contributions
irement+0.405
akura+0.401
ashi+0.385
 Turing+0.382
Mother+0.380
Neg logit contributions
 WIN-0.533
annabin-0.513
 backward-0.500
itably-0.493
oyd-0.489
Token VI
Token ID13889
Feature activation-1.45e+00

Pos logit contributions
 WIN+1.133
annabin+1.078
 backward+1.055
itably+1.028
oyd+1.019
Neg logit contributions
irement-1.053
akura-1.043
ashi-1.001
Mother-0.991
 Turing-0.991
Token.
Token ID13
Feature activation+3.20e-01

Pos logit contributions
 WIN+0.914
annabin+0.872
 backward+0.849
itably+0.826
oyd+0.823
Neg logit contributions
irement-0.988
akura-0.981
ashi-0.946
 Turing-0.938
Mother-0.937
Token
Token ID198
Feature activation+2.52e-01

Pos logit contributions
irement+0.204
akura+0.202
 Turing+0.196
ashi+0.195
Mother+0.195
Neg logit contributions
 WIN-0.214
annabin-0.208
 backward-0.201
itably-0.199
oyd-0.197
Token
Token ID198
Feature activation+4.69e-01

Pos logit contributions
irement+0.152
akura+0.151
Mother+0.150
mother+0.148
 Turing+0.148
Neg logit contributions
 WIN-0.173
annabin-0.172
 backward-0.165
itably-0.164
oyd-0.163
TokenSign
Token ID11712
Feature activation-1.28e+00

Pos logit contributions
irement+0.310
akura+0.308
Mother+0.301
ashi+0.299
mother+0.297
Neg logit contributions
 WIN-0.303
annabin-0.293
 backward-0.284
itably-0.278
oyd-0.277
Token Up
Token ID3205
Feature activation+5.42e+00

Pos logit contributions
 WIN+0.752
annabin+0.719
 backward+0.694
itably+0.681
oyd+0.676
Neg logit contributions
irement-0.927
akura-0.922
ashi-0.894
 Turing-0.887
Mother-0.885
Token:
Token ID25
Feature activation+1.63e+00

Pos logit contributions
akura+4.092
irement+4.090
ashi+4.000
 Turing+3.959
Mother+3.922
Neg logit contributions
annabin-2.762
 WIN-2.754
itably-2.703
oyd-2.681
 backward-2.681
Token Stay
Token ID16160
Feature activation+2.84e+00

Pos logit contributions
irement+0.854
akura+0.825
Mother+0.805
mother+0.802
 Turing+0.791
Neg logit contributions
annabin-1.251
 WIN-1.242
 backward-1.201
itably-1.195
oyd-1.193
Token on
Token ID319
Feature activation+1.76e+00

Pos logit contributions
akura+1.961
ashi+1.886
irement+1.873
 Turing+1.833
mother+1.831
Neg logit contributions
 WIN-1.743
annabin-1.655
itably-1.619
oyd-1.609
 backward-1.597
Token top
Token ID1353
Feature activation+2.72e+00

Pos logit contributions
irement+1.170
akura+1.166
 Turing+1.139
Mother+1.132
ashi+1.131
Neg logit contributions
 WIN-1.094
annabin-1.064
oyd-1.033
itably-1.026
 backward-1.022
Token of
Token ID286
Feature activation+1.27e+00

Pos logit contributions
irement+1.570
mother+1.552
akura+1.543
 Turing+1.520
Mother+1.495
Neg logit contributions
 WIN-1.938
annabin-1.873
itably-1.836
oyd-1.834
 backward-1.794
Token1
Token ID16
Feature activation-1.87e+00

Pos logit contributions
 WIN+0.933
annabin+0.899
 backward+0.874
itably+0.858
oyd+0.850
Neg logit contributions
irement-0.872
akura-0.866
ashi-0.835
Mother-0.831
 Turing-0.831
Tokenm
Token ID76
Feature activation-2.75e-01

Pos logit contributions
 WIN+1.078
annabin+1.028
 backward+0.994
itably+0.974
oyd+0.963
Neg logit contributions
irement-1.372
akura-1.363
ashi-1.320
 Turing-1.312
Mother-1.308
Token
Token ID198
Feature activation+8.48e-01

Pos logit contributions
 WIN+0.180
annabin+0.174
 backward+0.168
itably+0.166
oyd+0.164
Neg logit contributions
akura-0.179
irement-0.179
ashi-0.173
 Turing-0.171
Mother-0.170
Token
Token ID198
Feature activation+1.85e+00

Pos logit contributions
irement+0.504
Mother+0.504
akura+0.500
mother+0.498
 Turing+0.490
Neg logit contributions
 WIN-0.588
annabin-0.581
 backward-0.560
oyd-0.557
itably-0.556
TokenKrist
Token ID40756
Feature activation-1.37e-01

Pos logit contributions
irement+1.194
akura+1.184
Mother+1.180
mother+1.161
ashi+1.156
Neg logit contributions
 WIN-1.212
annabin-1.177
 backward-1.140
oyd-1.115
itably-1.115
Tokenen
Token ID268
Feature activation+5.32e+00

Pos logit contributions
 WIN+0.104
annabin+0.100
 backward+0.098
itably+0.096
oyd+0.096
Neg logit contributions
irement-0.076
akura-0.075
ashi-0.072
Mother-0.070
 Turing-0.070
Token Rou
Token ID13876
Feature activation+2.77e+00

Pos logit contributions
irement+2.509
akura+2.464
 Turing+2.381
ashi+2.345
mother+2.313
Neg logit contributions
 WIN-4.336
annabin-4.266
 backward-4.101
itably-4.098
flat-4.058
Tokenpen
Token ID3617
Feature activation+2.67e+00

Pos logit contributions
irement+2.142
akura+2.124
mother+2.079
immer+2.037
psc+2.011
Neg logit contributions
 WIN-1.451
 backward-1.345
annabin-1.339
-1.311
itably-1.309
Tokenian
Token ID666
Feature activation+3.00e-01

Pos logit contributions
irement+1.736
akura+1.707
mother+1.692
ashi+1.649
Mother+1.641
Neg logit contributions
 WIN-1.723
annabin-1.670
 GOD-1.604
itably-1.598
 backward-1.590
Token,
Token ID11
Feature activation+1.59e+00

Pos logit contributions
irement+0.198
akura+0.197
ashi+0.190
 Turing+0.189
Mother+0.189
Neg logit contributions
 WIN-0.195
annabin-0.188
 backward-0.182
itably-0.179
oyd-0.177
Token whose
Token ID3025
Feature activation-8.78e-01

Pos logit contributions
irement+0.947
akura+0.943
ashi+0.909
 Turing+0.905
Mother+0.903
Neg logit contributions
 WIN-1.122
annabin-1.091
 backward-1.054
itably-1.044
oyd-1.041
Token minions
Token ID22811
Feature activation-1.49e+00

Pos logit contributions
 WIN+0.955
annabin+0.908
 backward+0.887
itably+0.866
oyd+0.858
Neg logit contributions
irement-1.027
akura-1.021
ashi-0.983
 Turing-0.978
Mother-0.973
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
 WIN+0.928
annabin+0.885
 backward+0.861
itably+0.840
oyd+0.832
Neg logit contributions
irement-1.030
akura-1.023
ashi-0.988
 Turing-0.981
Mother-0.978
Token
Token ID198
Feature activation-2.82e-01

Pos logit contributions
 WIN+0.232
annabin+0.227
 backward+0.220
itably+0.218
oyd+0.217
Neg logit contributions
irement-0.204
akura-0.203
 Turing-0.196
ashi-0.196
Mother-0.195
Token
Token ID198
Feature activation-3.82e-01

Pos logit contributions
 WIN+0.193
annabin+0.189
 backward+0.182
itably+0.180
oyd+0.180
Neg logit contributions
irement-0.174
akura-0.172
Mother-0.168
mother-0.167
ashi-0.167
TokenIGN
Token ID16284
Feature activation+5.11e+00

Pos logit contributions
 WIN+0.241
annabin+0.231
 backward+0.225
itably+0.220
oyd+0.218
Neg logit contributions
irement-0.260
akura-0.258
ashi-0.250
Mother-0.249
mother-0.247
Token:
Token ID25
Feature activation+5.31e+00

Pos logit contributions
Mother+3.638
mother+3.614
irement+3.580
 Turing+3.562
ashi+3.526
Neg logit contributions
annabin-2.799
 WIN-2.742
oyd-2.678
 backward-2.620
ospace-2.614
Token Sure
Token ID10889
Feature activation+1.76e+00

Pos logit contributions
 Turing+2.891
Mother+2.785
akura+2.768
irement+2.763
mother+2.760
Neg logit contributions
annabin-3.885
itably-3.750
 WIN-3.744
oyd-3.724
ospace-3.703
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
irement+1.150
akura+1.147
ashi+1.112
Mother+1.102
 Turing+1.100
Neg logit contributions
 WIN-1.139
annabin-1.108
 backward-1.065
itably-1.054
oyd-1.043
Token Before
Token ID7413
Feature activation+3.29e+00

Pos logit contributions
akura+0.789
irement+0.789
 Turing+0.776
Mother+0.765
ashi+0.764
Neg logit contributions
 WIN-0.963
annabin-0.956
 backward-0.921
itably-0.916
oyd-0.912
Token we
Token ID356
Feature activation+1.43e+00

Pos logit contributions
irement+2.018
akura+1.991
ashi+1.948
 Turing+1.921
mother+1.907
Neg logit contributions
 WIN-2.220
annabin-2.213
itably-2.108
 backward-2.077
oyd-2.076
Token move
Token ID1445
Feature activation+3.63e-01

Pos logit contributions
irement+0.904
akura+0.902
ashi+0.875
 Turing+0.869
mother+0.850
Neg logit contributions
 WIN-0.943
annabin-0.919
itably-0.880
oyd-0.880
 backward-0.871
Token out
Token ID503
Feature activation-5.36e-02

Pos logit contributions
 WIN+0.189
annabin+0.186
 backward+0.175
itably+0.175
oyd+0.174
Neg logit contributions
irement-0.190
akura-0.190
 Turing-0.184
ashi-0.183
Mother-0.182
Token The
Token ID383
Feature activation-1.34e+00

Pos logit contributions
 WIN+0.035
annabin+0.034
 backward+0.033
oyd+0.032
itably+0.032
Neg logit contributions
irement-0.035
akura-0.034
ashi-0.033
 Turing-0.033
Mother-0.033
Token Shield
Token ID10022
Feature activation-2.40e+00

Pos logit contributions
 WIN+0.954
annabin+0.911
 backward+0.890
itably+0.875
oyd+0.865
Neg logit contributions
irement-0.813
akura-0.808
ashi-0.774
 Turing-0.764
Mother-0.762
Token in
Token ID287
Feature activation-1.38e+00

Pos logit contributions
 WIN+1.616
annabin+1.539
 backward+1.508
itably+1.471
oyd+1.457
Neg logit contributions
irement-1.536
akura-1.525
ashi-1.466
 Turing-1.457
Mother-1.450
Token Live
Token ID7547
Feature activation-6.60e-01

Pos logit contributions
 WIN+0.865
annabin+0.828
 backward+0.802
itably+0.788
oyd+0.780
Neg logit contributions
irement-0.949
akura-0.943
ashi-0.911
 Turing-0.905
Mother-0.903
Token+
Token ID10
Feature activation+5.30e+00

Pos logit contributions
 WIN+0.471
annabin+0.458
 backward+0.442
itably+0.439
oyd+0.433
Neg logit contributions
irement-0.393
akura-0.389
ashi-0.375
 Turing-0.374
Mother-0.371
TokenSame
Token ID30556
Feature activation+4.58e+00

Pos logit contributions
irement+2.707
Mother+2.664
ashi+2.639
akura+2.618
mother+2.585
Neg logit contributions
 WIN-4.065
annabin-4.015
itably-3.958
ospace-3.830
 backward-3.813
Token Day
Token ID3596
Feature activation+1.87e+00

Pos logit contributions
irement+3.481
akura+3.433
mother+3.385
Mother+3.353
ashi+3.320
Neg logit contributions
annabin-2.408
 WIN-2.404
 backward-2.263
oyd-2.259
ospace-2.250
Token numbers
Token ID3146
Feature activation-7.89e-01

Pos logit contributions
irement+1.214
akura+1.208
ashi+1.184
Mother+1.173
mother+1.171
Neg logit contributions
 WIN-1.178
annabin-1.175
itably-1.136
 backward-1.114
oyd-1.107
Token.
Token ID13
Feature activation-1.10e+00

Pos logit contributions
 WIN+0.522
annabin+0.503
 backward+0.486
itably+0.480
oyd+0.476
Neg logit contributions
irement-0.511
akura-0.506
ashi-0.490
 Turing-0.485
Mother-0.484
Token
Token ID198
Feature activation+4.09e-01

Pos logit contributions
 WIN+0.726
annabin+0.697
 backward+0.677
itably+0.666
oyd+0.660
Neg logit contributions
irement-0.710
akura-0.705
ashi-0.680
 Turing-0.675
Mother-0.673
Token.
Token ID13
Feature activation+4.60e-01

Pos logit contributions
irement+1.479
akura+1.471
ashi+1.417
 Turing+1.411
Mother+1.406
Neg logit contributions
 WIN-1.509
annabin-1.442
oyd-1.384
 backward-1.375
itably-1.373
Token
Token ID198
Feature activation-1.25e+00

Pos logit contributions
irement+0.295
akura+0.293
 Turing+0.283
ashi+0.283
Mother+0.282
Neg logit contributions
 WIN-0.306
annabin-0.296
 backward-0.287
itably-0.283
oyd-0.281
Token
Token ID198
Feature activation-1.54e+00

Pos logit contributions
 WIN+0.830
annabin+0.803
 backward+0.776
itably+0.766
oyd+0.761
Neg logit contributions
irement-0.810
akura-0.802
ashi-0.777
Mother-0.776
 Turing-0.774
TokenGet
Token ID3855
Feature activation+6.84e-01

Pos logit contributions
 WIN+1.028
annabin+0.984
 backward+0.957
itably+0.941
oyd+0.932
Neg logit contributions
irement-0.988
akura-0.980
ashi-0.944
 Turing-0.937
Mother-0.924
Token our
Token ID674
Feature activation+1.99e+00

Pos logit contributions
irement+0.412
akura+0.406
ashi+0.393
Mother+0.393
mother+0.389
Neg logit contributions
 WIN-0.483
annabin-0.467
 backward-0.449
oyd-0.445
itably-0.444
Token daily
Token ID4445
Feature activation+5.28e+00

Pos logit contributions
irement+1.610
akura+1.581
ashi+1.550
Mother+1.546
mother+1.544
Neg logit contributions
 WIN-0.954
annabin-0.937
oyd-0.906
 backward-0.875
itably-0.867
Token newsletter
Token ID13129
Feature activation+5.44e-01

Pos logit contributions
irement+3.258
 Turing+3.140
akura+3.137
Mother+3.051
mother+3.026
Neg logit contributions
oyd-3.492
annabin-3.474
 WIN-3.429
ospace-3.373
itably-3.296
Token Upgrade
Token ID24236
Feature activation+2.24e+00

Pos logit contributions
irement+0.283
akura+0.281
ashi+0.269
Mother+0.265
 Turing+0.264
Neg logit contributions
 WIN-0.432
annabin-0.416
 backward-0.407
itably-0.399
oyd-0.395
Token your
Token ID534
Feature activation+9.97e-01

Pos logit contributions
irement+1.089
akura+1.047
ashi+1.014
 Turing+1.013
Mother+1.011
Neg logit contributions
 WIN-1.814
annabin-1.773
oyd-1.735
 backward-1.715
-1.675
Token inbox
Token ID13734
Feature activation-3.14e-01

Pos logit contributions
irement+0.766
akura+0.753
 Turing+0.734
ashi+0.734
Mother+0.728
Neg logit contributions
 WIN-0.543
annabin-0.520
 backward-0.497
oyd-0.491
itably-0.488
Token and
Token ID290
Feature activation+1.37e-01

Pos logit contributions
 WIN+0.187
annabin+0.178
 backward+0.173
itably+0.169
oyd+0.168
Neg logit contributions
irement-0.225
akura-0.224
ashi-0.216
 Turing-0.215
Mother-0.214
Token this
Token ID428
Feature activation+6.44e-01

Pos logit contributions
 WIN+1.047
annabin+1.003
 backward+0.976
itably+0.958
oyd+0.949
Neg logit contributions
irement-1.003
akura-0.995
ashi-0.958
 Turing-0.950
Mother-0.949
Token.
Token ID13
Feature activation+6.89e-01

Pos logit contributions
irement+0.513
akura+0.511
ashi+0.496
 Turing+0.494
Mother+0.493
Neg logit contributions
 WIN-0.328
annabin-0.314
 backward-0.300
itably-0.295
oyd-0.293
Token For
Token ID1114
Feature activation+6.11e-01

Pos logit contributions
akura+0.416
irement+0.413
 Turing+0.399
ashi+0.398
Mother+0.395
Neg logit contributions
 WIN-0.482
annabin-0.471
 backward-0.453
itably-0.452
oyd-0.448
Token the
Token ID262
Feature activation-2.59e-01

Pos logit contributions
akura+0.418
irement+0.417
 Turing+0.406
ashi+0.405
Mother+0.398
Neg logit contributions
 WIN-0.376
annabin-0.367
itably-0.348
 backward-0.344
oyd-0.343
Token sake
Token ID11060
Feature activation+2.10e+00

Pos logit contributions
 WIN+0.187
annabin+0.182
itably+0.175
 backward+0.174
oyd+0.173
Neg logit contributions
irement-0.151
akura-0.150
 Turing-0.145
ashi-0.144
Mother-0.142
Token of
Token ID286
Feature activation+5.25e+00

Pos logit contributions
irement+1.298
akura+1.280
ashi+1.240
mother+1.235
 Turing+1.232
Neg logit contributions
 WIN-1.422
annabin-1.390
itably-1.330
oyd-1.322
 backward-1.300
Token comple
Token ID1224
Feature activation+2.06e+00

Pos logit contributions
 Turing+3.656
irement+3.644
akura+3.605
ashi+3.521
Mother+3.481
Neg logit contributions
annabin-2.989
 WIN-2.943
oyd-2.788
itably-2.746
075-2.697
Tokenteness
Token ID43205
Feature activation+3.00e+00

Pos logit contributions
irement+0.882
akura+0.849
ashi+0.820
 Turing+0.799
mother+0.789
Neg logit contributions
 WIN-1.839
annabin-1.781
 backward-1.749
oyd-1.709
itably-1.708
Token,
Token ID11
Feature activation+2.66e+00

Pos logit contributions
irement+1.847
akura+1.825
ashi+1.774
 Turing+1.760
Mother+1.736
Neg logit contributions
 WIN-2.047
annabin-2.002
itably-1.895
 backward-1.889
oyd-1.885
Token here
Token ID994
Feature activation+3.55e+00

Pos logit contributions
akura+1.465
irement+1.447
 Turing+1.401
ashi+1.389
Mother+1.369
Neg logit contributions
 WIN-1.988
annabin-1.961
itably-1.867
 backward-1.855
oyd-1.855
Token is
Token ID318
Feature activation+8.83e-01

Pos logit contributions
akura+2.217
irement+2.206
ashi+2.135
Mother+2.109
mother+2.098
Neg logit contributions
 WIN-2.405
annabin-2.340
 backward-2.251
itably-2.233
oyd-2.215
Token IM
Token ID8959
Feature activation-3.65e-01

Pos logit contributions
irement+0.129
akura+0.128
ashi+0.125
 Turing+0.124
Mother+0.123
Neg logit contributions
 WIN-0.120
annabin-0.116
 backward-0.111
itably-0.110
oyd-0.109
Token
Token ID1906
Feature activation+9.94e-01

Pos logit contributions
 WIN+0.247
annabin+0.238
 backward+0.230
itably+0.227
oyd+0.225
Neg logit contributions
irement-0.232
akura-0.230
ashi-0.222
 Turing-0.220
Mother-0.220
Token for
Token ID329
Feature activation-8.77e-01

Pos logit contributions
irement+0.653
akura+0.649
ashi+0.628
Mother+0.625
 Turing+0.624
Neg logit contributions
 WIN-0.645
annabin-0.626
 backward-0.601
itably-0.597
oyd-0.589
Token the
Token ID262
Feature activation-2.24e-01

Pos logit contributions
 WIN+0.556
annabin+0.537
 backward+0.515
itably+0.511
oyd+0.505
Neg logit contributions
irement-0.592
akura-0.588
ashi-0.568
 Turing-0.566
Mother-0.562
Token sake
Token ID11060
Feature activation+4.24e-01

Pos logit contributions
 WIN+0.143
annabin+0.138
 backward+0.132
itably+0.132
oyd+0.130
Neg logit contributions
irement-0.150
akura-0.149
 Turing-0.144
ashi-0.144
Mother-0.143
Token of
Token ID286
Feature activation+5.21e+00

Pos logit contributions
irement+0.275
akura+0.274
ashi+0.265
 Turing+0.263
mother+0.263
Neg logit contributions
 WIN-0.277
annabin-0.269
itably-0.257
oyd-0.256
 backward-0.256
Token putting
Token ID5137
Feature activation+5.73e-01

Pos logit contributions
 Turing+3.610
irement+3.604
akura+3.534
ashi+3.473
Mother+3.398
Neg logit contributions
annabin-3.024
 WIN-2.935
oyd-2.823
itably-2.798
075-2.722
Token something
Token ID1223
Feature activation+1.60e+00

Pos logit contributions
irement+0.377
akura+0.373
 Turing+0.365
ashi+0.363
mother+0.357
Neg logit contributions
 WIN-0.368
annabin-0.360
 backward-0.340
itably-0.339
oyd-0.338
Token in
Token ID287
Feature activation-1.03e-02

Pos logit contributions
irement+1.076
akura+1.075
 Turing+1.061
ashi+1.041
mother+1.035
Neg logit contributions
 WIN-0.994
annabin-0.977
itably-0.919
oyd-0.918
 backward-0.903
Token the
Token ID262
Feature activation+6.84e-01

Pos logit contributions
 WIN+0.007
annabin+0.006
 backward+0.006
itably+0.006
oyd+0.006
Neg logit contributions
irement-0.007
akura-0.007
ashi-0.006
 Turing-0.006
Mother-0.006
Token grid
Token ID10706
Feature activation+1.20e+00

Pos logit contributions
irement+0.457
akura+0.453
 Turing+0.442
ashi+0.441
Mother+0.435
Neg logit contributions
 WIN-0.433
annabin-0.422
itably-0.401
 backward-0.401
oyd-0.398
Token men
Token ID1450
Feature activation-3.33e-01

Pos logit contributions
irement+0.198
akura+0.196
ashi+0.191
 Turing+0.191
Mother+0.190
Neg logit contributions
 WIN-0.186
annabin-0.180
 backward-0.172
oyd-0.170
itably-0.170
Token or
Token ID393
Feature activation+2.55e-01

Pos logit contributions
 WIN+0.219
annabin+0.210
 backward+0.203
itably+0.200
oyd+0.200
Neg logit contributions
irement-0.217
akura-0.216
ashi-0.209
 Turing-0.207
Mother-0.206
Token women
Token ID1466
Feature activation-1.21e+00

Pos logit contributions
irement+0.164
akura+0.162
ashi+0.157
 Turing+0.157
Mother+0.157
Neg logit contributions
 WIN-0.169
annabin-0.165
 backward-0.157
itably-0.156
oyd-0.155
Token mocking
Token ID31202
Feature activation+6.87e-01

Pos logit contributions
 WIN+0.800
annabin+0.768
 backward+0.745
itably+0.732
oyd+0.726
Neg logit contributions
irement-0.782
akura-0.776
ashi-0.749
 Turing-0.743
Mother-0.740
Token Val
Token ID3254
Feature activation+4.06e-01

Pos logit contributions
irement+0.443
akura+0.438
ashi+0.428
 Turing+0.425
Mother+0.424
Neg logit contributions
 WIN-0.454
annabin-0.440
 backward-0.420
itably-0.416
oyd-0.414
Tokeniz
Token ID528
Feature activation+5.15e+00

Pos logit contributions
irement+0.251
akura+0.249
ashi+0.240
 Turing+0.238
Mother+0.236
Neg logit contributions
 WIN-0.282
annabin-0.272
 backward-0.264
itably-0.259
oyd-0.257
Tokenade
Token ID671
Feature activation+1.94e+00

Pos logit contributions
akura+3.276
ashi+3.149
irement+3.130
 Turing+3.048
immer+2.973
Neg logit contributions
 WIN-3.443
annabin-3.375
 backward-3.194
itably-3.193
oyd-3.176
Tokenh
Token ID71
Feature activation-6.24e-01

Pos logit contributions
akura+1.553
irement+1.549
ashi+1.531
Mother+1.503
 Turing+1.502
Neg logit contributions
 WIN-0.967
annabin-0.917
itably-0.875
 backward-0.864
oyd-0.858
Token.
Token ID13
Feature activation-1.12e+00

Pos logit contributions
 WIN+0.401
annabin+0.385
 backward+0.371
itably+0.367
oyd+0.364
Neg logit contributions
irement-0.416
akura-0.413
ashi-0.403
 Turing-0.398
Mother-0.396
Token<|endoftext|>
Token ID50256
Feature activation-4.04e-01

Pos logit contributions
 WIN+0.737
annabin+0.709
 backward+0.688
itably+0.677
oyd+0.670
Neg logit contributions
irement-0.731
akura-0.726
ashi-0.701
 Turing-0.697
Mother-0.695
TokenNEW
Token ID13965
Feature activation-1.38e+00

Pos logit contributions
 WIN+0.266
annabin+0.255
 backward+0.248
itably+0.243
oyd+0.241
Neg logit contributions
irement-0.265
akura-0.263
ashi-0.254
Mother-0.253
 Turing-0.251
Token '
Token ID705
Feature activation+1.67e+00

Pos logit contributions
akura+0.465
irement+0.464
ashi+0.437
Mother+0.432
mother+0.431
Neg logit contributions
 WIN-0.780
annabin-0.748
 backward-0.733
itably-0.716
oyd-0.712
Tokennot
Token ID1662
Feature activation+2.84e+00

Pos logit contributions
irement+1.145
akura+1.126
mother+1.117
Mother+1.117
 Turing+1.099
Neg logit contributions
 WIN-1.042
annabin-0.998
 backward-0.963
oyd-0.942
itably-0.940
Token quite
Token ID2407
Feature activation+2.46e+00

Pos logit contributions
irement+1.841
ashi+1.736
 Turing+1.732
akura+1.676
mother+1.639
Neg logit contributions
 WIN-1.905
annabin-1.852
ospace-1.817
oyd-1.786
itably-1.785
Token even
Token ID772
Feature activation+1.55e+00

Pos logit contributions
irement+1.368
akura+1.321
ashi+1.309
 Turing+1.297
Mother+1.289
Neg logit contributions
 WIN-1.844
annabin-1.773
oyd-1.737
ospace-1.695
 backward-1.688
Token handed
Token ID10158
Feature activation+1.46e+00

Pos logit contributions
irement+0.895
akura+0.840
 Turing+0.827
ashi+0.816
mother+0.815
Neg logit contributions
 WIN-1.149
annabin-1.119
oyd-1.102
itably-1.083
 backward-1.067
Token'
Token ID6
Feature activation+5.14e+00

Pos logit contributions
irement+0.921
akura+0.905
ashi+0.872
 Turing+0.868
Mother+0.866
Neg logit contributions
 WIN-1.000
annabin-0.969
 backward-0.936
itably-0.921
oyd-0.918
Token 2016
Token ID1584
Feature activation+1.73e+00

Pos logit contributions
irement+3.152
 Turing+3.121
mother+3.011
Mother+2.947
akura+2.907
Neg logit contributions
annabin-3.454
 WIN-3.381
ospace-3.281
 backward-3.272
flat-3.196
Token primary
Token ID4165
Feature activation+3.90e+00

Pos logit contributions
irement+1.128
akura+1.116
ashi+1.075
 Turing+1.073
Mother+1.068
Neg logit contributions
 WIN-1.141
annabin-1.097
 backward-1.062
itably-1.044
oyd-1.037
Token Sanders
Token ID5831
Feature activation+1.14e+00

Pos logit contributions
irement+2.520
 Turing+2.427
akura+2.419
Mother+2.373
ashi+2.356
Neg logit contributions
 WIN-2.507
annabin-2.433
 backward-2.383
itably-2.367
ospace-2.348
Token:
Token ID25
Feature activation+6.18e-01

Pos logit contributions
irement+0.962
akura+0.928
Mother+0.914
 Turing+0.910
ashi+0.907
Neg logit contributions
 WIN-0.537
annabin-0.510
itably-0.489
 backward-0.486
oyd-0.480
Token '
Token ID705
Feature activation+7.26e-01

Pos logit contributions
irement+0.343
akura+0.340
ashi+0.326
 Turing+0.323
Mother+0.322
Neg logit contributions
 WIN-0.468
annabin-0.451
 backward-0.440
itably-0.434
oyd-0.430
Tokena
Token ID64
Feature activation-8.68e-01

Pos logit contributions
irement+0.387
akura+0.386
ashi+0.374
 Turing+0.372
Mother+0.370
Neg logit contributions
 WIN-0.303
annabin-0.288
 backward-0.277
itably-0.274
oyd-0.269
Token Guard
Token ID4932
Feature activation+8.59e-02

Pos logit contributions
 WIN+0.591
annabin+0.564
 backward+0.552
itably+0.538
oyd+0.535
Neg logit contributions
irement-0.550
akura-0.547
ashi-0.525
Mother-0.520
 Turing-0.519
Token )
Token ID1267
Feature activation-5.88e-01

Pos logit contributions
irement+0.070
akura+0.069
ashi+0.067
 Turing+0.067
Mother+0.067
Neg logit contributions
 WIN-0.043
annabin-0.041
 backward-0.039
itably-0.038
oyd-0.038
Token ->
Token ID4613
Feature activation+2.48e+00

Pos logit contributions
 WIN+0.292
annabin+0.275
 backward+0.266
itably+0.257
oyd+0.255
Neg logit contributions
irement-0.480
akura-0.478
ashi-0.464
 Turing-0.461
Mother-0.460
Token Option
Token ID16018
Feature activation+3.97e+00

Pos logit contributions
irement+1.450
akura+1.426
 Turing+1.399
ashi+1.396
Mother+1.376
Neg logit contributions
 WIN-1.758
annabin-1.731
itably-1.651
 backward-1.649
oyd-1.634
Token <
Token ID1279
Feature activation+5.12e+00

Pos logit contributions
irement+1.532
akura+1.427
Mother+1.410
 Turing+1.383
ashi+1.355
Neg logit contributions
 WIN-3.598
annabin-3.578
 backward-3.442
-3.427
itably-3.407
Token Shared
Token ID39403
Feature activation+2.11e+00

Pos logit contributions
irement+2.800
akura+2.720
 Turing+2.711
ashi+2.697
Mother+2.684
Neg logit contributions
annabin-3.711
 WIN-3.710
itably-3.557
 backward-3.485
075-3.471
Token <
Token ID1279
Feature activation+3.68e+00

Pos logit contributions
irement+1.220
akura+1.198
ashi+1.156
Mother+1.151
 Turing+1.150
Neg logit contributions
 WIN-1.545
annabin-1.490
 backward-1.445
itably-1.428
oyd-1.415
Token '
Token ID705
Feature activation+1.86e+00

Pos logit contributions
irement+1.997
akura+1.961
 Turing+1.920
ashi+1.907
Mother+1.873
Neg logit contributions
 WIN-2.756
annabin-2.700
itably-2.602
 backward-2.594
oyd-2.561
Tokena
Token ID64
Feature activation+8.65e-01

Pos logit contributions
akura+1.266
irement+1.260
ashi+1.234
mother+1.224
Mother+1.217
Neg logit contributions
 WIN-1.146
annabin-1.100
itably-1.068
 backward-1.051
oyd-1.041
Token,
Token ID837
Feature activation+2.09e+00

Pos logit contributions
akura+0.721
irement+0.717
ashi+0.698
 Turing+0.691
Mother+0.691
Neg logit contributions
 WIN-0.419
annabin-0.388
 backward-0.380
itably-0.362
oyd-0.361
Token all
Token ID477
Feature activation-1.51e+00

Pos logit contributions
 WIN+0.971
annabin+0.927
 backward+0.902
itably+0.884
oyd+0.875
Neg logit contributions
irement-1.025
akura-1.019
ashi-0.981
 Turing-0.975
Mother-0.972
Token minions
Token ID22811
Feature activation-1.49e+00

Pos logit contributions
 WIN+0.955
annabin+0.908
 backward+0.887
itably+0.866
oyd+0.858
Neg logit contributions
irement-1.027
akura-1.021
ashi-0.983
 Turing-0.978
Mother-0.973
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
 WIN+0.928
annabin+0.885
 backward+0.861
itably+0.840
oyd+0.832
Neg logit contributions
irement-1.030
akura-1.023
ashi-0.988
 Turing-0.981
Mother-0.978
Token
Token ID198
Feature activation-2.82e-01

Pos logit contributions
 WIN+0.232
annabin+0.227
 backward+0.220
itably+0.218
oyd+0.217
Neg logit contributions
irement-0.204
akura-0.203
 Turing-0.196
ashi-0.196
Mother-0.195
Token
Token ID198
Feature activation-3.82e-01

Pos logit contributions
 WIN+0.193
annabin+0.189
 backward+0.182
itably+0.180
oyd+0.180
Neg logit contributions
irement-0.174
akura-0.172
Mother-0.168
mother-0.167
ashi-0.167
TokenIGN
Token ID16284
Feature activation+5.11e+00

Pos logit contributions
 WIN+0.241
annabin+0.231
 backward+0.225
itably+0.220
oyd+0.218
Neg logit contributions
irement-0.260
akura-0.258
ashi-0.250
Mother-0.249
mother-0.247
Token:
Token ID25
Feature activation+5.31e+00

Pos logit contributions
Mother+3.638
mother+3.614
irement+3.580
 Turing+3.562
ashi+3.526
Neg logit contributions
annabin-2.799
 WIN-2.742
oyd-2.678
 backward-2.620
ospace-2.614
Token Sure
Token ID10889
Feature activation+1.76e+00

Pos logit contributions
 Turing+2.891
Mother+2.785
akura+2.768
irement+2.763
mother+2.760
Neg logit contributions
annabin-3.885
itably-3.750
 WIN-3.744
oyd-3.724
ospace-3.703
Token.
Token ID13
Feature activation+1.36e+00

Pos logit contributions
irement+1.150
akura+1.147
ashi+1.112
Mother+1.102
 Turing+1.100
Neg logit contributions
 WIN-1.139
annabin-1.108
 backward-1.065
itably-1.054
oyd-1.043
Token Before
Token ID7413
Feature activation+3.29e+00

Pos logit contributions
akura+0.789
irement+0.789
 Turing+0.776
Mother+0.765
ashi+0.764
Neg logit contributions
 WIN-0.963
annabin-0.956
 backward-0.921
itably-0.916
oyd-0.912
Token we
Token ID356
Feature activation+1.43e+00

Pos logit contributions
irement+2.018
akura+1.991
ashi+1.948
 Turing+1.921
mother+1.907
Neg logit contributions
 WIN-2.220
annabin-2.213
itably-2.108
 backward-2.077
oyd-2.076
Token his
Token ID465
Feature activation-5.81e-01

Pos logit contributions
 WIN+0.954
annabin+0.911
 backward+0.891
itably+0.873
oyd+0.867
Neg logit contributions
irement-0.824
akura-0.817
ashi-0.784
Mother-0.776
 Turing-0.775
Token band
Token ID4097
Feature activation-7.15e-01

Pos logit contributions
 WIN+0.454
annabin+0.439
 backward+0.427
itably+0.422
oyd+0.419
Neg logit contributions
irement-0.308
akura-0.305
ashi-0.292
 Turing-0.290
Mother-0.289
Token Gr
Token ID1902
Feature activation-5.34e-01

Pos logit contributions
 WIN+0.454
annabin+0.436
 backward+0.422
itably+0.415
oyd+0.411
Neg logit contributions
irement-0.483
akura-0.480
ashi-0.464
 Turing-0.461
Mother-0.460
Tokenong
Token ID506
Feature activation-2.61e+00

Pos logit contributions
 WIN+0.377
annabin+0.362
 backward+0.353
itably+0.346
oyd+0.343
Neg logit contributions
irement-0.325
akura-0.322
ashi-0.310
 Turing-0.307
Mother-0.306
Token Gr
Token ID1902
Feature activation-1.56e+00

Pos logit contributions
 WIN+1.733
annabin+1.653
 backward+1.612
itably+1.578
oyd+1.574
Neg logit contributions
irement-1.700
akura-1.680
 Turing-1.616
ashi-1.610
Mother-1.599
Tokenong
Token ID506
Feature activation-6.97e+00

Pos logit contributions
 WIN+0.868
annabin+0.825
 backward+0.795
itably+0.781
oyd+0.773
Neg logit contributions
irement-1.179
akura-1.170
ashi-1.133
 Turing-1.130
Mother-1.126
Token had
Token ID550
Feature activation-3.54e-01

Pos logit contributions
 WIN+4.410
annabin+4.137
 backward+4.058
itably+3.953
 GOD+3.951
Neg logit contributions
irement-4.648
akura-4.566
 Turing-4.418
Mother-4.341
ashi-4.312
Token just
Token ID655
Feature activation-1.27e+00

Pos logit contributions
 WIN+0.220
annabin+0.212
 backward+0.205
itably+0.202
oyd+0.200
Neg logit contributions
irement-0.242
akura-0.241
ashi-0.234
 Turing-0.232
Mother-0.230
Token blown
Token ID16318
Feature activation-1.32e+00

Pos logit contributions
 WIN+0.735
annabin+0.705
 backward+0.679
itably+0.667
oyd+0.662
Neg logit contributions
irement-0.930
akura-0.924
ashi-0.896
 Turing-0.890
Mother-0.885
Token me
Token ID502
Feature activation+8.57e-01

Pos logit contributions
 WIN+0.807
annabin+0.772
 backward+0.747
itably+0.734
oyd+0.726
Neg logit contributions
irement-0.921
akura-0.915
ashi-0.885
 Turing-0.879
Mother-0.876
Token away
Token ID1497
Feature activation-9.42e-02

Pos logit contributions
irement+0.631
akura+0.631
ashi+0.612
Mother+0.607
 Turing+0.606
Neg logit contributions
 WIN-0.482
annabin-0.467
itably-0.444
 backward-0.438
oyd-0.436
Token than
Token ID621
Feature activation-1.44e+00

Pos logit contributions
 WIN+0.791
annabin+0.759
 backward+0.737
itably+0.724
oyd+0.718
Neg logit contributions
irement-0.760
akura-0.754
ashi-0.727
 Turing-0.721
Mother-0.719
Token most
Token ID749
Feature activation-1.51e+00

Pos logit contributions
 WIN+0.943
annabin+0.910
 backward+0.875
itably+0.865
oyd+0.857
Neg logit contributions
irement-0.943
akura-0.937
ashi-0.906
 Turing-0.900
Mother-0.896
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
 WIN+0.949
annabin+0.907
 backward+0.880
itably+0.863
oyd+0.855
Neg logit contributions
irement-1.039
akura-1.032
ashi-0.997
 Turing-0.989
Mother-0.987
Token -
Token ID532
Feature activation-2.83e+00

Pos logit contributions
 WIN+0.477
annabin+0.463
 backward+0.448
itably+0.443
oyd+0.438
Neg logit contributions
irement-0.412
akura-0.410
ashi-0.395
 Turing-0.394
Mother-0.393
Token July
Token ID2901
Feature activation-5.09e+00

Pos logit contributions
 WIN+1.239
annabin+1.215
 backward+1.148
itably+1.121
oyd+1.109
Neg logit contributions
akura-2.436
irement-2.434
ashi-2.386
 Turing-2.362
mother-2.350
Token 6
Token ID718
Feature activation-6.18e+00

Pos logit contributions
 WIN+3.234
annabin+2.993
 backward+2.937
itably+2.871
oyd+2.861
Neg logit contributions
irement-3.489
akura-3.425
ashi-3.300
Mother-3.274
mother-3.249
Token,
Token ID11
Feature activation-4.65e+00

Pos logit contributions
 WIN+3.601
annabin+3.327
 backward+3.297
itably+3.172
075+3.170
Neg logit contributions
akura-4.530
irement-4.508
ashi-4.313
 Turing-4.260
Mother-4.189
Token 2010
Token ID3050
Feature activation-5.16e+00

Pos logit contributions
 WIN+3.114
annabin+2.905
 backward+2.900
itably+2.795
oyd+2.772
Neg logit contributions
akura-3.027
irement-3.017
ashi-2.911
Mother-2.811
 Turing-2.802
TokenR
Token ID49
Feature activation-1.59e+00

Pos logit contributions
 WIN+2.943
annabin+2.801
 backward+2.674
oyd+2.625
itably+2.600
Neg logit contributions
akura-3.805
irement-3.799
 Turing-3.642
ashi-3.624
mother-3.522
Tokeniff
Token ID733
Feature activation-2.60e+00

Pos logit contributions
 WIN+1.066
annabin+1.015
 backward+0.988
itably+0.971
 GOD+0.952
Neg logit contributions
irement-1.033
akura-1.015
ashi-0.993
Mother-0.971
 Turing-0.969
TokenTra
Token ID15721
Feature activation-1.81e+00

Pos logit contributions
 WIN+2.075
annabin+2.003
 backward+1.957
itably+1.926
oyd+1.913
Neg logit contributions
irement-1.336
akura-1.325
ashi-1.262
 Turing-1.250
Mother-1.241
Token all
Token ID477
Feature activation-1.41e-01

Pos logit contributions
 WIN+0.028
annabin+0.027
 backward+0.026
itably+0.026
oyd+0.025
Neg logit contributions
irement-0.039
akura-0.038
ashi-0.037
 Turing-0.037
Mother-0.037
Token.
Token ID13
Feature activation+1.66e+00

Pos logit contributions
 WIN+0.090
annabin+0.086
 backward+0.083
itably+0.082
oyd+0.081
Neg logit contributions
irement-0.095
akura-0.094
ashi-0.091
 Turing-0.090
Mother-0.090
Token
Token ID198
Feature activation+1.01e+00

Pos logit contributions
irement+1.051
akura+1.044
 Turing+1.011
ashi+1.009
Mother+1.004
Neg logit contributions
 WIN-1.106
annabin-1.079
 backward-1.045
itably-1.033
oyd-1.023
Token
Token ID198
Feature activation+1.98e-01

Pos logit contributions
irement+0.580
akura+0.570
mother+0.566
Mother+0.564
 Turing+0.561
Neg logit contributions
 WIN-0.717
annabin-0.713
oyd-0.682
itably-0.681
 backward-0.681
Token(
Token ID7
Feature activation-1.72e+00

Pos logit contributions
irement+0.132
akura+0.131
ashi+0.126
Mother+0.126
 Turing+0.125
Neg logit contributions
 WIN-0.127
annabin-0.122
 backward-0.118
itably-0.116
oyd-0.115
Token UPDATE
Token ID35717
Feature activation-6.13e+00

Pos logit contributions
 WIN+1.133
annabin+1.085
 backward+1.054
itably+1.035
oyd+1.025
Neg logit contributions
irement-1.127
akura-1.118
ashi-1.078
 Turing-1.071
Mother-1.065
Token:
Token ID25
Feature activation-1.56e+00

Pos logit contributions
 WIN+2.838
 backward+2.533
annabin+2.442
 GOD+2.405
oyd+2.291
Neg logit contributions
akura-5.264
irement-5.203
ashi-5.048
 Turing-5.007
Mother-4.988
Token Sherman
Token ID23198
Feature activation-3.08e-01

Pos logit contributions
 WIN+1.053
annabin+1.008
 backward+0.981
itably+0.962
oyd+0.954
Neg logit contributions
irement-0.999
akura-0.991
ashi-0.954
 Turing-0.946
Mother-0.944
Token suffered
Token ID6989
Feature activation-2.93e+00

Pos logit contributions
 WIN+0.202
annabin+0.195
 backward+0.188
itably+0.186
oyd+0.184
Neg logit contributions
irement-0.202
akura-0.200
ashi-0.194
 Turing-0.192
Mother-0.191
Token a
Token ID257
Feature activation-1.26e+00

Pos logit contributions
 WIN+1.817
 backward+1.705
annabin+1.702
itably+1.620
oyd+1.614
Neg logit contributions
irement-2.030
akura-2.022
Mother-1.940
ashi-1.938
 Turing-1.930
Token rupt
Token ID31580
Feature activation-3.01e-01

Pos logit contributions
 WIN+0.791
annabin+0.757
 backward+0.732
itably+0.721
oyd+0.714
Neg logit contributions
irement-0.868
akura-0.861
ashi-0.832
 Turing-0.826
Mother-0.823
Token to
Token ID284
Feature activation-4.18e-01

Pos logit contributions
 WIN+1.313
annabin+1.245
 backward+1.222
itably+1.187
oyd+1.173
Neg logit contributions
irement-1.563
akura-1.550
ashi-1.496
 Turing-1.490
Mother-1.485
Token state
Token ID1181
Feature activation-4.56e-02

Pos logit contributions
 WIN+0.267
annabin+0.257
 backward+0.247
itably+0.244
oyd+0.242
Neg logit contributions
irement-0.279
akura-0.278
ashi-0.269
 Turing-0.267
Mother-0.266
Token,
Token ID11
Feature activation-1.35e+00

Pos logit contributions
 WIN+0.028
annabin+0.027
 backward+0.026
itably+0.026
oyd+0.025
Neg logit contributions
irement-0.031
akura-0.031
ashi-0.031
 Turing-0.030
Mother-0.030
Token is
Token ID318
Feature activation-2.04e+00

Pos logit contributions
 WIN+0.874
annabin+0.837
 backward+0.812
itably+0.798
oyd+0.791
Neg logit contributions
irement-0.899
akura-0.893
ashi-0.862
 Turing-0.855
Mother-0.853
Token of
Token ID286
Feature activation-1.93e+00

Pos logit contributions
 WIN+1.267
annabin+1.196
 backward+1.186
itably+1.143
oyd+1.130
Neg logit contributions
irement-1.412
akura-1.401
ashi-1.349
 Turing-1.338
Mother-1.337
Token no
Token ID645
Feature activation-5.76e+00

Pos logit contributions
 WIN+1.192
annabin+1.132
 backward+1.114
itably+1.080
oyd+1.069
Neg logit contributions
irement-1.339
akura-1.333
ashi-1.282
Mother-1.271
 Turing-1.271
Token relevance
Token ID23082
Feature activation-7.53e-01

Pos logit contributions
 WIN+3.320
 backward+3.163
annabin+3.081
itably+2.949
 GOD+2.925
Neg logit contributions
akura-4.172
irement-4.141
ashi-4.009
Mother-3.955
mother-3.944
Token.
Token ID13
Feature activation+8.38e-01

Pos logit contributions
 WIN+0.456
annabin+0.436
 backward+0.421
itably+0.414
oyd+0.410
Neg logit contributions
irement-0.532
akura-0.528
ashi-0.511
 Turing-0.507
Mother-0.506
Token Here
Token ID3423
Feature activation+1.01e+00

Pos logit contributions
irement+0.510
akura+0.509
 Turing+0.496
Mother+0.493
ashi+0.492
Neg logit contributions
 WIN-0.574
annabin-0.565
 backward-0.541
itably-0.538
oyd-0.537
Token we
Token ID356
Feature activation+4.27e-01

Pos logit contributions
akura+0.663
irement+0.661
ashi+0.641
 Turing+0.633
Mother+0.633
Neg logit contributions
 WIN-0.660
annabin-0.638
 backward-0.609
itably-0.606
oyd-0.598
Token see
Token ID766
Feature activation-4.26e-01

Pos logit contributions
irement+0.281
akura+0.280
ashi+0.271
 Turing+0.268
Mother+0.266
Neg logit contributions
 WIN-0.277
annabin-0.268
 backward-0.257
itably-0.255
oyd-0.253
Token,
Token ID11
Feature activation+1.00e+00

Pos logit contributions
irement+0.172
akura+0.169
ashi+0.165
Mother+0.165
 Turing+0.165
Neg logit contributions
 WIN-0.186
annabin-0.183
itably-0.174
 backward-0.174
oyd-0.173
Token Cardinals
Token ID18071
Feature activation+1.06e+00

Pos logit contributions
irement+0.618
akura+0.607
 Turing+0.594
ashi+0.594
Mother+0.592
Neg logit contributions
 WIN-0.678
annabin-0.665
 backward-0.638
itably-0.636
ospace-0.626
Token 1
Token ID352
Feature activation+2.05e+00

Pos logit contributions
irement+0.595
akura+0.588
ashi+0.566
 Turing+0.562
Mother+0.560
Neg logit contributions
 WIN-0.786
annabin-0.764
 backward-0.740
itably-0.732
oyd-0.727
Token AL
Token ID8355
Feature activation-2.30e+00

Pos logit contributions
irement+1.577
akura+1.554
 Turing+1.520
ashi+1.517
Mother+1.508
Neg logit contributions
annabin-1.064
 WIN-1.045
itably-0.988
oyd-0.981
 backward-0.978
TokenREAD
Token ID15675
Feature activation-2.95e+00

Pos logit contributions
 WIN+1.349
annabin+1.324
 backward+1.275
itably+1.244
oyd+1.237
Neg logit contributions
irement-1.641
akura-1.618
Mother-1.586
 Turing-1.575
ashi-1.574
TokenY
Token ID56
Feature activation-5.69e+00

Pos logit contributions
 WIN+2.225
annabin+2.145
 backward+2.091
itably+2.059
oyd+2.043
Neg logit contributions
irement-1.652
akura-1.637
ashi-1.569
 Turing-1.555
Mother-1.550
Token COMP
Token ID24301
Feature activation-3.41e+00

Pos logit contributions
 WIN+3.104
 backward+2.714
annabin+2.675
 GOD+2.630
itably+2.564
Neg logit contributions
akura-4.504
irement-4.491
ashi-4.322
mother-4.284
Mother-4.276
TokenET
Token ID2767
Feature activation-1.60e+00

Pos logit contributions
 WIN+2.781
annabin+2.685
 backward+2.623
itably+2.587
oyd+2.567
Neg logit contributions
irement-1.692
akura-1.678
ashi-1.599
 Turing-1.582
Mother-1.574
TokenED
Token ID1961
Feature activation-1.74e+00

Pos logit contributions
 WIN+1.044
annabin+1.021
 backward+0.989
oyd+0.969
itably+0.959
Neg logit contributions
irement-1.048
akura-1.021
Mother-1.002
 Turing-0.998
ashi-0.997
Token:
Token ID25
Feature activation-4.17e-01

Pos logit contributions
 WIN+0.703
annabin+0.625
 backward+0.611
itably+0.583
oyd+0.569
Neg logit contributions
akura-1.603
irement-1.595
ashi-1.555
 Turing-1.541
Mother-1.539
Token vs
Token ID3691
Feature activation+1.47e+00

Pos logit contributions
 WIN+0.294
annabin+0.283
 backward+0.275
itably+0.271
oyd+0.269
Neg logit contributions
irement-0.253
akura-0.251
ashi-0.241
 Turing-0.239
Mother-0.238
Token 15
Token ID1315
Feature activation-4.44e+00

Pos logit contributions
 WIN+1.976
annabin+1.886
 backward+1.824
itably+1.788
oyd+1.771
Neg logit contributions
irement-2.405
akura-2.388
ashi-2.311
 Turing-2.296
Mother-2.290
Token,
Token ID11
Feature activation-3.53e+00

Pos logit contributions
 WIN+2.722
annabin+2.570
 backward+2.510
itably+2.446
oyd+2.417
Neg logit contributions
irement-3.110
akura-3.102
ashi-2.979
 Turing-2.952
Mother-2.917
Token 2010
Token ID3050
Feature activation-3.65e+00

Pos logit contributions
 WIN+2.409
annabin+2.280
 backward+2.250
itably+2.185
oyd+2.167
Neg logit contributions
irement-2.236
akura-2.236
ashi-2.151
Mother-2.095
 Turing-2.092
TokenQue
Token ID15681
Feature activation-3.87e+00

Pos logit contributions
 WIN+2.363
annabin+2.263
 backward+2.197
itably+2.157
oyd+2.138
Neg logit contributions
irement-2.426
akura-2.408
ashi-2.324
 Turing-2.307
Mother-2.298
Tokener
Token ID263
Feature activation-1.69e+00

Pos logit contributions
 WIN+3.277
annabin+3.178
 backward+3.104
itably+3.062
oyd+3.042
Neg logit contributions
irement-1.791
akura-1.780
ashi-1.683
 Turing-1.676
Mother-1.666
Token magnet
Token ID19972
Feature activation-5.35e+00

Pos logit contributions
 WIN+0.956
annabin+0.916
 backward+0.870
itably+0.867
oyd+0.865
Neg logit contributions
irement-1.262
akura-1.233
Mother-1.211
mother-1.210
 Turing-1.208
Token?
Token ID30
Feature activation-4.02e+00

Pos logit contributions
 WIN+2.913
annabin+2.721
 backward+2.695
itably+2.673
oyd+2.603
Neg logit contributions
akura-4.000
irement-3.936
ashi-3.873
 Turing-3.805
Mother-3.791
Token
Token ID198
Feature activation-4.26e+00

Pos logit contributions
 WIN+2.343
annabin+2.151
 backward+2.140
oyd+2.078
 GOD+2.078
Neg logit contributions
akura-2.891
irement-2.884
ashi-2.803
Mother-2.774
mother-2.744
Token
Token ID198
Feature activation-2.40e+00

Pos logit contributions
 WIN+2.639
annabin+2.496
 backward+2.437
itably+2.375
oyd+2.348
Neg logit contributions
irement-2.957
akura-2.944
ashi-2.831
 Turing-2.801
Mother-2.787
TokenReviewer
Token ID35407
Feature activation+2.65e+00

Pos logit contributions
 WIN+2.107
annabin+2.046
 backward+1.997
itably+1.978
oyd+1.967
Neg logit contributions
irement-1.040
akura-1.028
ashi-0.967
 Turing-0.963
Mother-0.949
Token:
Token ID25
Feature activation+3.31e+00

Pos logit contributions
irement+2.022
mother+2.004
Mother+1.991
ashi+1.969
akura+1.961
Neg logit contributions
 WIN-1.376
annabin-1.374
oyd-1.282
 backward-1.261
ospace-1.258
Token original
Token ID2656
Feature activation+9.12e-01

Pos logit contributions
irement+0.549
akura+0.543
 Turing+0.522
ashi+0.520
Mother+0.514
Neg logit contributions
 WIN-0.873
annabin-0.849
itably-0.820
 backward-0.820
oyd-0.813
Token copy
Token ID4866
Feature activation-2.08e+00

Pos logit contributions
irement+0.478
akura+0.473
ashi+0.453
 Turing+0.452
Mother+0.447
Neg logit contributions
 WIN-0.717
annabin-0.695
 backward-0.673
itably-0.668
oyd-0.664
Token.
Token ID13
Feature activation+7.62e-02

Pos logit contributions
 WIN+1.291
annabin+1.230
 backward+1.205
itably+1.169
oyd+1.151
Neg logit contributions
irement-1.440
akura-1.433
ashi-1.378
 Turing-1.365
Mother-1.362
Token
Token ID198
Feature activation+2.00e-01

Pos logit contributions
irement+0.047
akura+0.046
 Turing+0.045
ashi+0.045
Mother+0.044
Neg logit contributions
 WIN-0.053
annabin-0.051
 backward-0.049
itably-0.049
oyd-0.049
Token
Token ID198
Feature activation-7.07e-01

Pos logit contributions
irement+0.123
akura+0.121
Mother+0.118
 Turing+0.118
ashi+0.118
Neg logit contributions
 WIN-0.137
annabin-0.134
 backward-0.129
itably-0.128
oyd-0.128
Tokenshould
Token ID21754
Feature activation-5.32e+00

Pos logit contributions
 WIN+0.429
annabin+0.410
 backward+0.397
itably+0.389
oyd+0.385
Neg logit contributions
irement-0.499
akura-0.495
ashi-0.479
 Turing-0.476
Mother-0.475
Token be
Token ID307
Feature activation-2.00e+00

Pos logit contributions
 WIN+2.787
annabin+2.623
 backward+2.610
oyd+2.422
itably+2.400
Neg logit contributions
akura-4.075
irement-4.015
ashi-3.945
Mother-3.926
mother-3.912
Token clone
Token ID17271
Feature activation-1.71e+00

Pos logit contributions
 WIN+1.799
annabin+1.733
 backward+1.721
itably+1.676
oyd+1.655
Neg logit contributions
irement-0.822
akura-0.817
ashi-0.782
Mother-0.772
mother-0.766
Tokenable
Token ID540
Feature activation+3.61e-01

Pos logit contributions
 WIN+0.770
annabin+0.720
 backward+0.681
oyd+0.667
itably+0.663
Neg logit contributions
irement-1.478
akura-1.468
ashi-1.433
 Turing-1.425
Mother-1.422
Token,
Token ID11
Feature activation-7.79e-01

Pos logit contributions
irement+0.222
akura+0.218
 Turing+0.214
Mother+0.213
mother+0.212
Neg logit contributions
 WIN-0.246
annabin-0.241
 backward-0.230
itably-0.230
oyd-0.229
Token and
Token ID290
Feature activation-5.16e-01

Pos logit contributions
 WIN+0.483
annabin+0.467
 backward+0.445
itably+0.444
oyd+0.441
Neg logit contributions
irement-0.535
akura-0.531
 Turing-0.515
ashi-0.514
Mother-0.511
Token of
Token ID286
Feature activation-1.32e+00

Pos logit contributions
 WIN+0.209
annabin+0.201
 backward+0.193
itably+0.191
oyd+0.189
Neg logit contributions
irement-0.218
akura-0.216
ashi-0.209
 Turing-0.208
Mother-0.207
Token a
Token ID257
Feature activation-6.72e-01

Pos logit contributions
 WIN+0.843
annabin+0.806
 backward+0.784
itably+0.767
oyd+0.760
Neg logit contributions
irement-0.894
akura-0.888
ashi-0.857
 Turing-0.849
Mother-0.848
Token triple
Token ID15055
Feature activation-1.22e+00

Pos logit contributions
 WIN+0.417
annabin+0.400
 backward+0.384
itably+0.381
oyd+0.377
Neg logit contributions
irement-0.463
akura-0.459
ashi-0.444
 Turing-0.444
Mother-0.440
Token revolution
Token ID5854
Feature activation+4.05e-01

Pos logit contributions
 WIN+0.732
annabin+0.698
 backward+0.677
itably+0.663
oyd+0.656
Neg logit contributions
irement-0.868
akura-0.862
ashi-0.834
 Turing-0.828
Mother-0.826
Token of
Token ID286
Feature activation-4.71e-01

Pos logit contributions
irement+0.277
akura+0.275
ashi+0.267
 Turing+0.265
Mother+0.263
Neg logit contributions
 WIN-0.252
annabin-0.243
itably-0.232
 backward-0.231
oyd-0.230
Token which
Token ID543
Feature activation-5.31e+00

Pos logit contributions
 WIN+0.281
annabin+0.269
 backward+0.259
itably+0.256
oyd+0.254
Neg logit contributions
irement-0.336
akura-0.333
ashi-0.322
 Turing-0.321
Mother-0.320
Token technological
Token ID14614
Feature activation-8.77e-02

Pos logit contributions
 WIN+3.327
annabin+3.127
 backward+3.115
itably+2.961
oyd+2.952
Neg logit contributions
irement-3.648
akura-3.629
ashi-3.490
mother-3.438
Mother-3.437
Token revolution
Token ID5854
Feature activation+9.38e-01

Pos logit contributions
 WIN+0.049
annabin+0.047
itably+0.044
oyd+0.044
 backward+0.044
Neg logit contributions
irement-0.066
akura-0.065
 Turing-0.064
ashi-0.064
Mother-0.063
Token -
Token ID532
Feature activation+8.79e-01

Pos logit contributions
irement+0.597
akura+0.591
 Turing+0.571
ashi+0.570
Mother+0.566
Neg logit contributions
 WIN-0.627
annabin-0.604
 backward-0.581
itably-0.580
oyd-0.574
Token with
Token ID351
Feature activation+9.50e-01

Pos logit contributions
irement+0.602
akura+0.599
 Turing+0.586
ashi+0.579
Mother+0.577
Neg logit contributions
 WIN-0.544
annabin-0.522
itably-0.500
 backward-0.499
oyd-0.498
Token the
Token ID262
Feature activation+1.65e-01

Pos logit contributions
irement+0.647
akura+0.641
 Turing+0.629
ashi+0.622
Mother+0.616
Neg logit contributions
 WIN-0.591
annabin-0.571
itably-0.544
 backward-0.543
oyd-0.542
Token allow
Token ID1249
Feature activation+2.04e-01

Pos logit contributions
irement+0.165
akura+0.163
ashi+0.159
 Turing+0.158
mother+0.155
Neg logit contributions
 WIN-0.160
annabin-0.156
 backward-0.148
itably-0.147
oyd-0.147
Token.
Token ID13
Feature activation+5.84e-02

Pos logit contributions
irement+0.135
akura+0.134
ashi+0.129
 Turing+0.129
Mother+0.128
Neg logit contributions
 WIN-0.132
annabin-0.127
 backward-0.122
itably-0.120
oyd-0.120
Token
Token ID198
Feature activation+3.93e-01

Pos logit contributions
irement+0.036
akura+0.036
 Turing+0.035
Mother+0.035
ashi+0.035
Neg logit contributions
 WIN-0.040
annabin-0.039
 backward-0.037
itably-0.037
oyd-0.037
Token
Token ID198
Feature activation-4.24e-01

Pos logit contributions
irement+0.237
akura+0.233
Mother+0.229
 Turing+0.229
mother+0.229
Neg logit contributions
 WIN-0.272
annabin-0.269
 backward-0.257
oyd-0.256
itably-0.256
TokenP
Token ID47
Feature activation-5.14e-02

Pos logit contributions
 WIN+0.281
annabin+0.270
 backward+0.262
itably+0.257
oyd+0.255
Neg logit contributions
irement-0.274
akura-0.272
Mother-0.266
ashi-0.263
mother-0.262
Token R
Token ID371
Feature activation-5.31e+00

Pos logit contributions
 WIN+0.036
annabin+0.034
 backward+0.034
itably+0.033
oyd+0.033
Neg logit contributions
irement-0.032
akura-0.031
ashi-0.030
 Turing-0.030
Mother-0.030
TokenIM
Token ID3955
Feature activation+2.86e-01

Pos logit contributions
 WIN+3.381
annabin+3.173
 backward+3.105
oyd+3.013
 GOD+3.006
Neg logit contributions
irement-3.544
akura-3.526
Mother-3.393
mother-3.370
 Turing-3.359
TokenA
Token ID32
Feature activation-1.71e-01

Pos logit contributions
irement+0.192
akura+0.190
ashi+0.183
 Turing+0.182
Mother+0.182
Neg logit contributions
 WIN-0.183
annabin-0.177
 backward-0.170
itably-0.169
oyd-0.168
Token P
Token ID350
Feature activation-2.25e+00

Pos logit contributions
 WIN+0.105
annabin+0.102
 backward+0.098
itably+0.096
oyd+0.096
Neg logit contributions
irement-0.119
akura-0.117
ashi-0.114
 Turing-0.113
Mother-0.113
Token AR
Token ID5923
Feature activation-2.62e+00

Pos logit contributions
 WIN+1.369
annabin+1.300
 backward+1.262
itably+1.237
oyd+1.223
Neg logit contributions
irement-1.585
akura-1.575
ashi-1.521
 Turing-1.512
Mother-1.509
TokenS
Token ID50
Feature activation-6.53e-01

Pos logit contributions
 WIN+1.649
annabin+1.559
 backward+1.516
itably+1.488
oyd+1.470
Neg logit contributions
irement-1.797
akura-1.787
ashi-1.718
 Turing-1.707
Mother-1.701
Token the
Token ID262
Feature activation+4.96e-02

Pos logit contributions
irement+0.328
akura+0.326
ashi+0.316
 Turing+0.313
Mother+0.312
Neg logit contributions
 WIN-0.282
annabin-0.268
 backward-0.261
itably-0.255
oyd-0.252
Token air
Token ID1633
Feature activation+8.38e-01

Pos logit contributions
irement+0.032
akura+0.032
ashi+0.031
 Turing+0.031
Mother+0.031
Neg logit contributions
 WIN-0.033
annabin-0.031
 backward-0.030
itably-0.030
oyd-0.030
Token?
Token ID30
Feature activation-1.39e+00

Pos logit contributions
irement+0.593
akura+0.588
ashi+0.570
 Turing+0.567
Mother+0.564
Neg logit contributions
 WIN-0.502
annabin-0.481
 backward-0.459
itably-0.457
oyd-0.453
Token Indeed
Token ID9676
Feature activation-4.91e-01

Pos logit contributions
 WIN+0.922
annabin+0.882
 backward+0.858
itably+0.842
oyd+0.834
Neg logit contributions
irement-0.907
akura-0.899
ashi-0.867
 Turing-0.860
Mother-0.858
Token f
Token ID277
Feature activation-2.30e+00

Pos logit contributions
 WIN+0.320
annabin+0.310
 backward+0.298
itably+0.294
oyd+0.292
Neg logit contributions
irement-0.322
akura-0.320
ashi-0.309
 Turing-0.308
Mother-0.305
Token out
Token ID503
Feature activation-5.29e+00

Pos logit contributions
 WIN+1.533
annabin+1.465
 backward+1.433
itably+1.399
oyd+1.393
Neg logit contributions
irement-1.481
akura-1.467
ashi-1.417
 Turing-1.411
Mother-1.399
Tokeng
Token ID70
Feature activation-3.84e+00

Pos logit contributions
 WIN+2.498
annabin+2.351
 backward+2.256
itably+2.202
oyd+2.173
Neg logit contributions
irement-4.414
akura-4.382
ashi-4.271
 Turing-4.245
Mother-4.222
Tokenassing
Token ID19696
Feature activation-2.04e+00

Pos logit contributions
 WIN+3.054
annabin+2.949
 backward+2.879
itably+2.837
oyd+2.817
Neg logit contributions
irement-1.987
akura-1.969
ashi-1.881
 Turing-1.860
Mother-1.855
Token (
Token ID357
Feature activation-1.42e+00

Pos logit contributions
 WIN+1.413
annabin+1.352
 backward+1.321
itably+1.294
oyd+1.281
Neg logit contributions
irement-1.269
akura-1.260
ashi-1.211
 Turing-1.201
Mother-1.198
Tokent
Token ID83
Feature activation-4.66e-01

Pos logit contributions
 WIN+0.845
annabin+0.808
 backward+0.780
itably+0.766
oyd+0.758
Neg logit contributions
irement-1.011
akura-1.005
ashi-0.971
 Turing-0.963
Mother-0.958
Token)
Token ID8
Feature activation-1.67e-01

Pos logit contributions
 WIN+0.243
annabin+0.230
 backward+0.223
itably+0.217
oyd+0.214
Neg logit contributions
irement-0.366
akura-0.364
ashi-0.353
 Turing-0.352
Mother-0.350
Token_
Token ID62
Feature activation-3.07e-01

Pos logit contributions
 WIN+0.689
annabin+0.664
 backward+0.646
itably+0.636
oyd+0.631
Neg logit contributions
irement-0.556
akura-0.552
ashi-0.530
 Turing-0.525
Mother-0.524
Tokenre
Token ID260
Feature activation-1.90e+00

Pos logit contributions
 WIN+0.147
annabin+0.138
 backward+0.133
itably+0.129
oyd+0.128
Neg logit contributions
irement-0.256
akura-0.254
ashi-0.246
 Turing-0.246
Mother-0.245
Tokenpos
Token ID1930
Feature activation-8.86e-03

Pos logit contributions
 WIN+1.103
annabin+1.051
 backward+1.016
itably+0.994
oyd+0.982
Neg logit contributions
irement-1.391
akura-1.380
ashi-1.339
mother-1.323
Mother-1.323
Token.
Token ID764
Feature activation-9.82e-01

Pos logit contributions
 WIN+0.005
annabin+0.005
 backward+0.004
itably+0.004
oyd+0.004
Neg logit contributions
irement-0.007
akura-0.007
ashi-0.006
 Turing-0.006
Mother-0.006
Token contents
Token ID10154
Feature activation-3.45e+00

Pos logit contributions
 WIN+0.632
annabin+0.603
 backward+0.592
itably+0.575
oyd+0.570
Neg logit contributions
irement-0.657
akura-0.653
ashi-0.632
Mother-0.625
 Turing-0.624
Token ]
Token ID2361
Feature activation-5.28e+00

Pos logit contributions
 WIN+1.593
annabin+1.440
 backward+1.433
oyd+1.352
itably+1.344
Neg logit contributions
irement-2.909
akura-2.901
ashi-2.840
 Turing-2.816
Mother-2.813
Token WHERE
Token ID33411
Feature activation-3.48e+00

Pos logit contributions
 WIN+3.403
 backward+3.163
annabin+3.142
 GOD+3.029
itably+3.008
Neg logit contributions
irement-3.542
akura-3.528
ashi-3.406
 Turing-3.353
Mother-3.347
Token id
Token ID4686
Feature activation-1.20e+00

Pos logit contributions
 WIN+1.976
annabin+1.828
 backward+1.819
oyd+1.726
itably+1.726
Neg logit contributions
akura-2.613
irement-2.612
ashi-2.528
Mother-2.501
mother-2.494
Token IN
Token ID3268
Feature activation-1.68e+00

Pos logit contributions
 WIN+0.787
annabin+0.754
 backward+0.733
itably+0.720
oyd+0.713
Neg logit contributions
irement-0.790
akura-0.784
ashi-0.756
 Turing-0.751
Mother-0.749
Token (
Token ID357
Feature activation-2.09e+00

Pos logit contributions
 WIN+1.213
annabin+1.152
 backward+1.135
oyd+1.106
itably+1.104
Neg logit contributions
irement-1.008
akura-1.001
ashi-0.964
 Turing-0.950
Mother-0.948
Token SELECT
Token ID33493
Feature activation-1.49e+00

Pos logit contributions
 WIN+1.424
annabin+1.361
 backward+1.332
itably+1.298
oyd+1.295
Neg logit contributions
irement-1.320
akura-1.312
ashi-1.265
 Turing-1.254
Mother-1.248
Token 100
Token ID1802
Feature activation-4.72e+00

Pos logit contributions
 WIN+1.863
annabin+1.748
 backward+1.698
itably+1.654
oyd+1.638
Neg logit contributions
irement-2.548
akura-2.533
ashi-2.450
Mother-2.430
 Turing-2.430
Token%
Token ID4
Feature activation-3.32e+00

Pos logit contributions
 WIN+2.732
annabin+2.524
 backward+2.449
itably+2.386
 GOD+2.381
Neg logit contributions
irement-3.504
akura-3.490
ashi-3.371
Mother-3.327
mother-3.316
Token ON
Token ID6177
Feature activation-1.64e+00

Pos logit contributions
 WIN+2.133
annabin+2.015
 backward+1.966
itably+1.921
oyd+1.898
Neg logit contributions
irement-2.244
akura-2.231
ashi-2.149
Mother-2.127
 Turing-2.123
Token ALL
Token ID11096
Feature activation-1.16e+00

Pos logit contributions
 WIN+0.917
annabin+0.899
 backward+0.859
itably+0.850
oyd+0.839
Neg logit contributions
irement-1.214
akura-1.202
ashi-1.169
 Turing-1.167
Mother-1.161
Token PERSON
Token ID46740
Feature activation-1.57e+00

Pos logit contributions
 WIN+0.658
annabin+0.638
 backward+0.614
itably+0.605
oyd+0.597
Neg logit contributions
irement-0.855
akura-0.848
ashi-0.823
 Turing-0.822
Mother-0.818
TokenEL
Token ID3698
Feature activation-5.26e+00

Pos logit contributions
 WIN+1.223
annabin+1.181
 backward+1.152
itably+1.135
oyd+1.127
Neg logit contributions
irement-0.836
akura-0.828
ashi-0.792
 Turing-0.785
Mother-0.782
Token UPDATE
Token ID35717
Feature activation-1.42e+00

Pos logit contributions
 WIN+3.175
annabin+2.893
 backward+2.846
 GOD+2.769
 AZ+2.735
Neg logit contributions
irement-3.786
akura-3.772
ashi-3.626
Mother-3.586
mother-3.578
Token:
Token ID25
Feature activation-1.30e+00

Pos logit contributions
 WIN+0.641
annabin+0.602
 backward+0.576
itably+0.560
oyd+0.553
Neg logit contributions
irement-1.218
akura-1.212
ashi-1.179
 Turing-1.172
Mother-1.170
Token 05
Token ID8870
Feature activation-1.25e+00

Pos logit contributions
 WIN+0.516
annabin+0.478
 backward+0.455
itably+0.440
oyd+0.433
Neg logit contributions
irement-1.195
akura-1.189
ashi-1.159
 Turing-1.152
Mother-1.151
Token45
Token ID2231
Feature activation-7.56e-01

Pos logit contributions
 WIN+0.952
annabin+0.919
 backward+0.896
itably+0.884
oyd+0.876
Neg logit contributions
irement-0.691
akura-0.686
ashi-0.657
 Turing-0.652
Mother-0.650
Tokenz
Token ID89
Feature activation+4.58e-01

Pos logit contributions
 WIN+0.572
annabin+0.555
 backward+0.539
itably+0.533
oyd+0.525
Neg logit contributions
irement-0.418
akura-0.418
ashi-0.402
mother-0.394
Mother-0.394
TokenP
Token ID47
Feature activation-3.96e-01

Pos logit contributions
 WIN+0.659
annabin+0.632
 backward+0.613
itably+0.602
oyd+0.597
Neg logit contributions
irement-0.668
akura-0.662
ashi-0.640
Mother-0.639
 Turing-0.635
Token R
Token ID371
Feature activation-1.26e+00

Pos logit contributions
 WIN+0.242
annabin+0.232
 backward+0.225
itably+0.220
oyd+0.218
Neg logit contributions
irement-0.277
akura-0.276
ashi-0.267
 Turing-0.265
Mother-0.264
TokenIM
Token ID3955
Feature activation-1.18e+00

Pos logit contributions
 WIN+0.907
annabin+0.868
 backward+0.845
itably+0.828
oyd+0.824
Neg logit contributions
irement-0.750
akura-0.746
Mother-0.711
ashi-0.708
mother-0.706
TokenA
Token ID32
Feature activation-5.32e-01

Pos logit contributions
 WIN+0.668
annabin+0.628
 backward+0.606
itably+0.592
 GOD+0.586
Neg logit contributions
irement-0.887
akura-0.878
ashi-0.850
 Turing-0.846
mother-0.841
Token S
Token ID311
Feature activation+1.39e-01

Pos logit contributions
 WIN+0.318
annabin+0.305
 backward+0.295
itably+0.289
oyd+0.286
Neg logit contributions
irement-0.379
akura-0.377
ashi-0.365
 Turing-0.362
Mother-0.361
Token EC
Token ID13182
Feature activation-5.22e+00

Pos logit contributions
irement+0.104
akura+0.103
ashi+0.101
 Turing+0.100
Mother+0.099
Neg logit contributions
 WIN-0.078
annabin-0.076
 backward-0.072
itably-0.072
oyd-0.071
TokenUN
Token ID4944
Feature activation-3.02e+00

Pos logit contributions
 WIN+3.154
annabin+2.960
 backward+2.872
 GOD+2.808
oyd+2.765
Neg logit contributions
irement-3.695
akura-3.674
ashi-3.536
 Turing-3.502
Mother-3.502
TokenD
Token ID35
Feature activation-1.11e+00

Pos logit contributions
 WIN+1.910
annabin+1.818
 backward+1.764
itably+1.730
oyd+1.711
Neg logit contributions
irement-2.059
akura-2.049
ashi-1.973
 Turing-1.956
Mother-1.949
Token
Token ID127
Feature activation-2.01e+00

Pos logit contributions
 WIN+0.697
annabin+0.671
 backward+0.649
itably+0.639
oyd+0.632
Neg logit contributions
irement-0.759
akura-0.753
ashi-0.728
 Turing-0.723
Mother-0.721
Token
Token ID228
Feature activation-3.88e+00

Pos logit contributions
 WIN+1.446
annabin+1.394
 backward+1.357
itably+1.336
oyd+1.325
Neg logit contributions
irement-1.185
akura-1.175
ashi-1.129
 Turing-1.120
Mother-1.117
Token P
Token ID350
Feature activation-1.96e+00

Pos logit contributions
 WIN+2.423
annabin+2.276
 backward+2.212
itably+2.162
 GOD+2.156
Neg logit contributions
irement-2.682
akura-2.673
ashi-2.573
 Turing-2.542
Mother-2.529
Token in
Token ID287
Feature activation-2.64e-01

Pos logit contributions
irement+0.151
akura+0.150
ashi+0.145
 Turing+0.145
Mother+0.143
Neg logit contributions
 WIN-0.141
annabin-0.137
itably-0.130
oyd-0.129
 backward-0.128
Token 3
Token ID513
Feature activation+1.06e+00

Pos logit contributions
 WIN+0.169
annabin+0.164
 backward+0.157
itably+0.155
oyd+0.155
Neg logit contributions
irement-0.175
akura-0.174
ashi-0.168
 Turing-0.168
Mother-0.167
TokenD
Token ID35
Feature activation+1.26e+00

Pos logit contributions
irement+0.701
akura+0.696
Mother+0.670
ashi+0.670
 Turing+0.668
Neg logit contributions
 WIN-0.675
annabin-0.652
 backward-0.627
itably-0.622
oyd-0.615
Token with
Token ID351
Feature activation+6.35e-01

Pos logit contributions
irement+0.806
akura+0.805
 Turing+0.773
ashi+0.773
Mother+0.764
Neg logit contributions
 WIN-0.832
annabin-0.809
itably-0.778
 backward-0.768
oyd-0.758
Token To
Token ID1675
Feature activation-3.36e+00

Pos logit contributions
irement+0.419
akura+0.414
ashi+0.401
 Turing+0.401
Mother+0.397
Neg logit contributions
 WIN-0.409
annabin-0.396
 backward-0.379
itably-0.378
oyd-0.375
Token make
Token ID787
Feature activation-5.20e+00

Pos logit contributions
 WIN+2.318
annabin+2.219
 backward+2.162
itably+2.115
oyd+2.097
Neg logit contributions
irement-2.096
akura-2.059
Mother-1.984
ashi-1.983
mother-1.968
Token this
Token ID428
Feature activation-4.33e+00

Pos logit contributions
 WIN+3.593
annabin+3.424
 backward+3.395
itably+3.293
 GOD+3.214
Neg logit contributions
irement-3.233
akura-3.213
ashi-3.039
Mother-3.021
 Turing-2.997
Token possible
Token ID1744
Feature activation-1.74e+00

Pos logit contributions
 WIN+1.635
annabin+1.504
 backward+1.504
itably+1.408
 GOD+1.341
Neg logit contributions
irement-4.010
akura-4.005
ashi-3.903
 Turing-3.854
Mother-3.847
Token,
Token ID11
Feature activation-3.03e+00

Pos logit contributions
 WIN+1.146
annabin+1.108
 backward+1.068
itably+1.053
oyd+1.049
Neg logit contributions
irement-1.129
akura-1.117
ashi-1.084
Mother-1.077
 Turing-1.077
Tokenin
Token ID259
Feature activation-2.18e+00

Pos logit contributions
 WIN+1.874
annabin+1.795
 backward+1.736
itably+1.708
oyd+1.692
Neg logit contributions
irement-2.102
akura-2.088
ashi-2.020
 Turing-2.008
Mother-2.001
Token the
Token ID262
Feature activation-2.48e+00

Pos logit contributions
 WIN+1.310
annabin+1.240
 backward+1.216
itably+1.175
oyd+1.163
Neg logit contributions
irement-1.547
akura-1.530
ashi-1.480
 Turing-1.471
Mother-1.469
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
 WIN+0.949
annabin+0.907
 backward+0.880
itably+0.863
oyd+0.855
Neg logit contributions
irement-1.039
akura-1.032
ashi-0.997
 Turing-0.989
Mother-0.987
Token -
Token ID532
Feature activation-2.83e+00

Pos logit contributions
 WIN+0.477
annabin+0.463
 backward+0.448
itably+0.443
oyd+0.438
Neg logit contributions
irement-0.412
akura-0.410
ashi-0.395
 Turing-0.394
Mother-0.393
Token July
Token ID2901
Feature activation-5.09e+00

Pos logit contributions
 WIN+1.239
annabin+1.215
 backward+1.148
itably+1.121
oyd+1.109
Neg logit contributions
akura-2.436
irement-2.434
ashi-2.386
 Turing-2.362
mother-2.350
Token 6
Token ID718
Feature activation-6.18e+00

Pos logit contributions
 WIN+3.234
annabin+2.993
 backward+2.937
itably+2.871
oyd+2.861
Neg logit contributions
irement-3.489
akura-3.425
ashi-3.300
Mother-3.274
mother-3.249
Token,
Token ID11
Feature activation-4.65e+00

Pos logit contributions
 WIN+3.601
annabin+3.327
 backward+3.297
itably+3.172
075+3.170
Neg logit contributions
akura-4.530
irement-4.508
ashi-4.313
 Turing-4.260
Mother-4.189
Token 2010
Token ID3050
Feature activation-5.16e+00

Pos logit contributions
 WIN+3.114
annabin+2.905
 backward+2.900
itably+2.795
oyd+2.772
Neg logit contributions
akura-3.027
irement-3.017
ashi-2.911
Mother-2.811
 Turing-2.802
TokenR
Token ID49
Feature activation-1.59e+00

Pos logit contributions
 WIN+2.943
annabin+2.801
 backward+2.674
oyd+2.625
itably+2.600
Neg logit contributions
akura-3.805
irement-3.799
 Turing-3.642
ashi-3.624
mother-3.522
Tokeniff
Token ID733
Feature activation-2.60e+00

Pos logit contributions
 WIN+1.066
annabin+1.015
 backward+0.988
itably+0.971
 GOD+0.952
Neg logit contributions
irement-1.033
akura-1.015
ashi-0.993
Mother-0.971
 Turing-0.969
TokenTra
Token ID15721
Feature activation-1.81e+00

Pos logit contributions
 WIN+2.075
annabin+2.003
 backward+1.957
itably+1.926
oyd+1.913
Neg logit contributions
irement-1.336
akura-1.325
ashi-1.262
 Turing-1.250
Mother-1.241
Tokenx
Token ID87
Feature activation-3.42e+00

Pos logit contributions
 WIN+1.586
annabin+1.535
 backward+1.505
itably+1.484
oyd+1.474
Neg logit contributions
irement-0.793
akura-0.785
ashi-0.742
 Turing-0.732
Mother-0.729
Token
Token ID198
Feature activation-5.07e+00

Pos logit contributions
 WIN+2.096
annabin+1.994
 backward+1.931
itably+1.894
oyd+1.886
Neg logit contributions
irement-2.389
akura-2.366
ashi-2.272
 Turing-2.260
Mother-2.249
Token b
Token ID275
Feature activation-1.70e+00

Pos logit contributions
akura+0.193
irement+0.193
 Turing+0.188
ashi+0.187
Mother+0.186
Neg logit contributions
 WIN-0.148
annabin-0.147
itably-0.138
 backward-0.137
oyd-0.136
Tokenxc
Token ID25306
Feature activation+4.02e-01

Pos logit contributions
 WIN+1.168
annabin+1.121
 backward+1.095
itably+1.068
oyd+1.055
Neg logit contributions
akura-1.066
irement-1.063
ashi-1.028
Mother-1.019
mother-1.018
Token6
Token ID21
Feature activation-1.20e+00

Pos logit contributions
irement+0.248
akura+0.247
ashi+0.238
Mother+0.235
 Turing+0.234
Neg logit contributions
 WIN-0.277
annabin-0.268
 backward-0.262
itably-0.256
oyd-0.253
Token {
Token ID1391
Feature activation-7.42e-01

Pos logit contributions
 WIN+0.750
annabin+0.735
 backward+0.705
itably+0.697
oyd+0.689
Neg logit contributions
irement-0.808
akura-0.804
ashi-0.776
 Turing-0.774
mother-0.771
Token(
Token ID7
Feature activation-1.37e+00

Pos logit contributions
 WIN+0.580
annabin+0.563
 backward+0.549
itably+0.541
oyd+0.537
Neg logit contributions
irement-0.391
akura-0.387
ashi-0.372
Mother-0.371
mother-0.369
Token 6
Token ID718
Feature activation-5.15e+00

Pos logit contributions
 WIN+0.915
annabin+0.879
 backward+0.854
itably+0.839
oyd+0.831
Neg logit contributions
irement-0.883
akura-0.877
ashi-0.846
 Turing-0.840
Mother-0.838
Tokens
Token ID82
Feature activation-3.18e+00

Pos logit contributions
 WIN+3.390
annabin+3.188
 backward+3.157
itably+3.037
oyd+3.032
Neg logit contributions
irement-3.376
akura-3.303
 Turing-3.180
ashi-3.167
Mother-3.160
Token)}
Token ID38165
Feature activation+1.21e-01

Pos logit contributions
 WIN+2.386
annabin+2.289
 backward+2.236
itably+2.200
oyd+2.184
Neg logit contributions
irement-1.786
akura-1.770
ashi-1.696
 Turing-1.682
Mother-1.673
Token 26
Token ID2608
Feature activation-7.08e-01

Pos logit contributions
irement+0.072
akura+0.071
ashi+0.068
 Turing+0.067
Mother+0.067
Neg logit contributions
 WIN-0.088
annabin-0.084
 backward-0.082
itably-0.081
oyd-0.080
Token.
Token ID13
Feature activation-2.51e+00

Pos logit contributions
 WIN+0.471
annabin+0.463
 backward+0.444
itably+0.442
oyd+0.439
Neg logit contributions
irement-0.446
akura-0.444
Mother-0.432
 Turing-0.431
ashi-0.431
Token Rd
Token ID20769
Feature activation-3.61e-01

Pos logit contributions
 WIN+1.358
annabin+1.308
 backward+1.246
itably+1.232
oyd+1.218
Neg logit contributions
irement-1.917
akura-1.912
ashi-1.852
 Turing-1.851
Mother-1.838
TokenD
Token ID35
Feature activation+1.30e+00

Pos logit contributions
irement+0.993
Mother+0.954
akura+0.950
mother+0.908
 Turing+0.905
Neg logit contributions
 WIN-1.208
annabin-1.159
 backward-1.150
oyd-1.112
itably-1.099
Tokenigg
Token ID6950
Feature activation+1.63e+00

Pos logit contributions
irement+0.966
akura+0.956
ashi+0.937
Mother+0.915
 Turing+0.913
Neg logit contributions
 WIN-0.734
annabin-0.708
 backward-0.683
itably-0.668
ospace-0.654
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
irement+1.026
akura+1.015
ashi+0.982
 Turing+0.981
Mother+0.977
Neg logit contributions
 WIN-1.088
annabin-1.063
 backward-1.037
itably-1.028
oyd-1.014
Token
Token ID198
Feature activation-2.40e+00

Pos logit contributions
 WIN+1.678
annabin+1.622
 backward+1.570
itably+1.544
oyd+1.538
Neg logit contributions
irement-1.645
akura-1.627
Mother-1.572
ashi-1.571
 Turing-1.571
Tokenreddit
Token ID10748
Feature activation+6.71e-01

Pos logit contributions
 WIN+2.124
annabin+2.054
 backward+2.017
itably+1.979
oyd+1.975
Neg logit contributions
irement-1.043
akura-1.027
ashi-0.971
Mother-0.965
 Turing-0.955
Token
Token ID198
Feature activation-5.13e+00

Pos logit contributions
irement+0.390
akura+0.388
ashi+0.384
 Turing+0.379
mother+0.376
Neg logit contributions
 WIN-0.470
annabin-0.466
 backward-0.451
ospace-0.444
oyd-0.442
Token
Token ID198
Feature activation-5.03e+00

Pos logit contributions
 WIN+3.133
annabin+2.937
 backward+2.872
itably+2.803
075+2.759
Neg logit contributions
irement-3.613
akura-3.601
ashi-3.463
 Turing-3.402
Mother-3.382
TokenFacebook
Token ID12025
Feature activation-1.20e+00

Pos logit contributions
 WIN+2.819
annabin+2.664
 backward+2.585
oyd+2.548
itably+2.532
Neg logit contributions
irement-3.721
akura-3.685
 Turing-3.582
ashi-3.572
mother-3.461
Token
Token ID198
Feature activation-2.67e+00

Pos logit contributions
 WIN+0.818
annabin+0.801
 backward+0.775
itably+0.771
oyd+0.765
Neg logit contributions
irement-0.730
akura-0.720
 Turing-0.712
ashi-0.708
mother-0.707
Token
Token ID198
Feature activation-2.46e+00

Pos logit contributions
 WIN+1.732
annabin+1.662
 backward+1.612
itably+1.584
oyd+1.570
Neg logit contributions
irement-1.771
akura-1.758
ashi-1.697
 Turing-1.687
Mother-1.682
TokenSt
Token ID1273
Feature activation+2.38e-01

Pos logit contributions
 WIN+1.678
annabin+1.610
 backward+1.565
itably+1.538
oyd+1.526
Neg logit contributions
irement-1.549
akura-1.543
ashi-1.487
Mother-1.476
 Turing-1.469
TokenD
Token ID35
Feature activation+2.11e+00

Pos logit contributions
irement+1.106
Mother+1.059
akura+1.052
 Turing+1.010
mother+1.008
Neg logit contributions
 WIN-1.330
annabin-1.275
 backward-1.261
oyd-1.222
itably-1.211
Tokenigg
Token ID6950
Feature activation+1.75e+00

Pos logit contributions
irement+1.555
akura+1.535
ashi+1.507
Mother+1.461
 Turing+1.461
Neg logit contributions
 WIN-1.218
annabin-1.180
 backward-1.137
itably-1.111
ospace-1.095
Token
Token ID198
Feature activation-2.82e+00

Pos logit contributions
irement+1.110
akura+1.096
ashi+1.062
 Turing+1.059
Mother+1.054
Neg logit contributions
 WIN-1.167
annabin-1.135
 backward-1.109
itably-1.097
oyd-1.084
Token
Token ID198
Feature activation-2.61e+00

Pos logit contributions
 WIN+1.847
annabin+1.779
 backward+1.725
itably+1.694
oyd+1.684
Neg logit contributions
irement-1.846
akura-1.828
ashi-1.765
Mother-1.759
 Turing-1.759
Tokenreddit
Token ID10748
Feature activation+1.05e+00

Pos logit contributions
 WIN+2.260
annabin+2.189
 backward+2.141
itably+2.114
oyd+2.099
Neg logit contributions
irement-1.169
akura-1.157
ashi-1.097
 Turing-1.086
Mother-1.078
Token
Token ID198
Feature activation-5.12e+00

Pos logit contributions
irement+0.604
akura+0.597
ashi+0.594
 Turing+0.589
mother+0.580
Neg logit contributions
 WIN-0.745
annabin-0.739
 backward-0.716
ospace-0.706
oyd-0.699
Token
Token ID198
Feature activation-5.02e+00

Pos logit contributions
 WIN+3.132
annabin+2.942
 backward+2.876
itably+2.807
oyd+2.766
Neg logit contributions
irement-3.600
akura-3.589
ashi-3.451
 Turing-3.394
Mother-3.371
TokenFacebook
Token ID12025
Feature activation-7.90e-01

Pos logit contributions
 WIN+2.806
annabin+2.651
 backward+2.571
oyd+2.541
itably+2.517
Neg logit contributions
irement-3.716
akura-3.695
 Turing-3.573
ashi-3.573
mother-3.467
Token
Token ID198
Feature activation-2.54e+00

Pos logit contributions
 WIN+0.544
annabin+0.534
 backward+0.516
itably+0.515
oyd+0.511
Neg logit contributions
irement-0.473
akura-0.466
 Turing-0.464
mother-0.460
ashi-0.460
Token
Token ID198
Feature activation-2.37e+00

Pos logit contributions
 WIN+1.652
annabin+1.587
 backward+1.538
itably+1.512
oyd+1.499
Neg logit contributions
irement-1.680
akura-1.667
ashi-1.610
 Turing-1.601
Mother-1.596
TokenSt
Token ID1273
Feature activation+1.76e-01

Pos logit contributions
 WIN+1.635
annabin+1.567
 backward+1.523
itably+1.498
oyd+1.487
Neg logit contributions
irement-1.478
akura-1.475
ashi-1.422
Mother-1.414
 Turing-1.403
Token fully
Token ID3938
Feature activation+1.16e+00

Pos logit contributions
irement+0.664
akura+0.654
ashi+0.638
 Turing+0.633
Mother+0.630
Neg logit contributions
 WIN-0.621
annabin-0.594
 backward-0.571
itably-0.570
oyd-0.560
Token clothed
Token ID45361
Feature activation+1.66e+00

Pos logit contributions
irement+0.674
akura+0.666
 Turing+0.644
ashi+0.643
Mother+0.635
Neg logit contributions
 WIN-0.841
annabin-0.813
itably-0.783
 backward-0.781
oyd-0.773
Token and
Token ID290
Feature activation+1.80e+00

Pos logit contributions
irement+1.096
akura+1.076
 Turing+1.051
ashi+1.049
Mother+1.040
Neg logit contributions
 WIN-1.067
annabin-1.030
itably-0.983
 backward-0.980
oyd-0.972
Token Le
Token ID1004
Feature activation+1.44e+00

Pos logit contributions
irement+1.131
akura+1.127
 Turing+1.113
ashi+1.100
Mother+1.088
Neg logit contributions
 WIN-1.197
annabin-1.161
itably-1.107
 backward-1.098
ospace-1.097
Token Ch
Token ID609
Feature activation+9.91e-01

Pos logit contributions
irement+0.980
akura+0.976
ashi+0.948
 Turing+0.940
Mother+0.935
Neg logit contributions
 WIN-0.905
annabin-0.872
 backward-0.847
itably-0.825
oyd-0.812
Tokeniff
Token ID733
Feature activation-5.12e+00

Pos logit contributions
irement+0.646
akura+0.641
ashi+0.615
 Turing+0.607
mother+0.607
Neg logit contributions
 WIN-0.659
annabin-0.630
 backward-0.613
itably-0.603
oyd-0.589
Tokenre
Token ID260
Feature activation+2.89e-01

Pos logit contributions
 WIN+2.459
annabin+2.303
 backward+2.222
itably+2.149
oyd+2.137
Neg logit contributions
irement-4.219
akura-4.210
 Turing-4.089
ashi-4.081
Mother-4.072
Token (
Token ID357
Feature activation+3.36e-01

Pos logit contributions
irement+0.187
akura+0.186
ashi+0.180
 Turing+0.178
Mother+0.178
Neg logit contributions
 WIN-0.192
annabin-0.184
 backward-0.178
itably-0.175
oyd-0.174
TokenPeter
Token ID19727
Feature activation+1.72e+00

Pos logit contributions
irement+0.218
akura+0.217
ashi+0.210
Mother+0.209
 Turing+0.208
Neg logit contributions
 WIN-0.222
annabin-0.214
 backward-0.207
itably-0.203
oyd-0.201
Token Lor
Token ID26068
Feature activation+1.28e+00

Pos logit contributions
irement+1.012
akura+1.011
 Turing+0.978
ashi+0.967
Mother+0.960
Neg logit contributions
 WIN-1.229
annabin-1.195
 backward-1.152
itably-1.149
oyd-1.129
Tokenre
Token ID260
Feature activation+2.44e-01

Pos logit contributions
irement+0.992
akura+0.973
ashi+0.962
 Turing+0.943
mother+0.938
Neg logit contributions
 WIN-0.707
annabin-0.652
 backward-0.640
itably-0.624
075-0.610
Token bad
Token ID2089
Feature activation-5.09e-01

Pos logit contributions
irement+0.209
akura+0.208
ashi+0.203
 Turing+0.201
Mother+0.200
Neg logit contributions
 WIN-0.190
annabin-0.185
 backward-0.176
itably-0.175
oyd-0.174
Token news
Token ID1705
Feature activation-3.46e-01

Pos logit contributions
 WIN+0.330
annabin+0.317
 backward+0.306
itably+0.301
oyd+0.299
Neg logit contributions
irement-0.338
akura-0.336
ashi-0.324
 Turing-0.322
Mother-0.321
Token pl
Token ID458
Feature activation+4.90e-01

Pos logit contributions
 WIN+0.222
annabin+0.214
 backward+0.206
itably+0.204
oyd+0.203
Neg logit contributions
irement-0.231
akura-0.229
ashi-0.222
 Turing-0.220
Mother-0.219
Tokens
Token ID82
Feature activation-9.52e-01

Pos logit contributions
akura+0.297
irement+0.297
ashi+0.287
Mother+0.280
mother+0.280
Neg logit contributions
 WIN-0.344
annabin-0.332
 backward-0.322
itably-0.318
oyd-0.312
Token.
Token ID13
Feature activation-6.73e-01

Pos logit contributions
 WIN+0.598
annabin+0.576
 backward+0.556
itably+0.547
oyd+0.542
Neg logit contributions
irement-0.648
akura-0.644
ashi-0.624
 Turing-0.619
Mother-0.617
Token Good
Token ID4599
Feature activation-5.12e+00

Pos logit contributions
 WIN+0.425
annabin+0.412
 backward+0.396
itably+0.392
oyd+0.388
Neg logit contributions
irement-0.453
akura-0.452
ashi-0.437
 Turing-0.434
Mother-0.432
Token updates
Token ID5992
Feature activation-3.36e+00

Pos logit contributions
 WIN+3.545
annabin+3.377
 backward+3.348
itably+3.229
oyd+3.199
Neg logit contributions
irement-3.167
akura-3.164
ashi-3.020
 Turing-2.999
Mother-2.982
Token?
Token ID30
Feature activation-1.11e+00

Pos logit contributions
 WIN+2.132
annabin+2.041
 backward+1.979
itably+1.943
oyd+1.925
Neg logit contributions
irement-2.283
akura-2.265
ashi-2.188
 Turing-2.173
Mother-2.167
Token No
Token ID1400
Feature activation-8.86e-01

Pos logit contributions
 WIN+0.834
annabin+0.806
 backward+0.785
itably+0.774
oyd+0.768
Neg logit contributions
irement-0.626
akura-0.620
ashi-0.595
 Turing-0.592
Mother-0.589
Token more
Token ID517
Feature activation-7.29e-01

Pos logit contributions
 WIN+0.515
annabin+0.500
itably+0.471
oyd+0.470
 backward+0.466
Neg logit contributions
irement-0.641
akura-0.634
ashi-0.625
 Turing-0.621
mother-0.615
Token bad
Token ID2089
Feature activation-8.71e-01

Pos logit contributions
 WIN+0.559
annabin+0.555
oyd+0.528
itably+0.525
ospace+0.519
Neg logit contributions
irement-0.391
akura-0.379
ashi-0.375
 Turing-0.373
mother-0.370
Token was
Token ID373
Feature activation-1.30e-02

Pos logit contributions
irement+0.002
akura+0.002
ashi+0.002
 Turing+0.002
Mother+0.002
Neg logit contributions
 WIN-0.002
annabin-0.002
 backward-0.002
itably-0.002
oyd-0.002
Token later
Token ID1568
Feature activation+8.74e-01

Pos logit contributions
 WIN+0.008
annabin+0.008
 backward+0.008
itably+0.008
oyd+0.008
Neg logit contributions
akura-0.008
irement-0.008
ashi-0.008
 Turing-0.008
Mother-0.008
Token dismissed
Token ID11126
Feature activation+5.47e-01

Pos logit contributions
irement+0.460
akura+0.458
ashi+0.443
 Turing+0.438
Mother+0.430
Neg logit contributions
 WIN-0.680
annabin-0.664
 backward-0.636
itably-0.634
oyd-0.630
Token by
Token ID416
Feature activation+1.32e-01

Pos logit contributions
irement+0.363
akura+0.362
ashi+0.351
 Turing+0.346
mother+0.346
Neg logit contributions
 WIN-0.352
annabin-0.340
itably-0.325
 backward-0.324
oyd-0.320
Token the
Token ID262
Feature activation+3.98e-01

Pos logit contributions
irement+0.077
akura+0.076
ashi+0.074
 Turing+0.073
Mother+0.073
Neg logit contributions
 WIN-0.096
annabin-0.093
 backward-0.090
itably-0.089
oyd-0.088
Token Judge
Token ID8974
Feature activation+3.29e+00

Pos logit contributions
irement+0.230
akura+0.229
ashi+0.220
 Turing+0.220
Mother+0.219
Neg logit contributions
 WIN-0.288
annabin-0.280
 backward-0.271
itably-0.269
oyd-0.265
Token due
Token ID2233
Feature activation-5.60e-01

Pos logit contributions
irement+2.093
akura+2.084
ashi+2.032
mother+2.019
Mother+2.007
Neg logit contributions
 WIN-2.145
annabin-2.098
itably-2.008
 backward-1.998
oyd-1.973
Token to
Token ID284
Feature activation+1.07e+00

Pos logit contributions
 WIN+0.319
annabin+0.299
 backward+0.293
itably+0.285
oyd+0.281
Neg logit contributions
irement-0.417
akura-0.415
ashi-0.400
Mother-0.400
 Turing-0.398
Token lack
Token ID3092
Feature activation+3.46e-01

Pos logit contributions
irement+0.699
akura+0.696
ashi+0.674
 Turing+0.673
Mother+0.666
Neg logit contributions
 WIN-0.697
annabin-0.673
 backward-0.644
itably-0.641
oyd-0.636
Token of
Token ID286
Feature activation+9.96e-01

Pos logit contributions
irement+0.221
akura+0.219
ashi+0.214
mother+0.213
 Turing+0.212
Neg logit contributions
 WIN-0.230
annabin-0.222
itably-0.214
oyd-0.213
 backward-0.212
Token evidence
Token ID2370
Feature activation+1.23e-01

Pos logit contributions
irement+0.746
akura+0.746
 Turing+0.729
ashi+0.726
Mother+0.720
Neg logit contributions
 WIN-0.545
annabin-0.525
itably-0.493
oyd-0.493
 backward-0.492
TokenPath
Token ID15235
Feature activation+1.77e+00

Pos logit contributions
 WIN+0.946
annabin+0.908
 backward+0.886
itably+0.871
oyd+0.862
Neg logit contributions
irement-0.848
akura-0.845
ashi-0.811
 Turing-0.807
Mother-0.800
TokenType
Token ID6030
Feature activation+2.10e+00

Pos logit contributions
irement+1.145
akura+1.133
 Turing+1.095
Mother+1.094
ashi+1.092
Neg logit contributions
 WIN-1.142
annabin-1.127
 backward-1.074
itably-1.070
oyd-1.052
Token Leaf
Token ID14697
Feature activation-6.33e-01

Pos logit contributions
irement+1.313
akura+1.302
Mother+1.259
 Turing+1.253
ashi+1.243
Neg logit contributions
annabin-1.392
 WIN-1.383
 backward-1.323
itably-1.322
oyd-1.309
Token)
Token ID8
Feature activation-1.30e+00

Pos logit contributions
 WIN+0.433
annabin+0.414
 backward+0.403
itably+0.397
oyd+0.393
Neg logit contributions
irement-0.399
akura-0.395
ashi-0.381
 Turing-0.378
Mother-0.376
Token {
Token ID1391
Feature activation+2.05e+00

Pos logit contributions
 WIN+0.809
annabin+0.773
 backward+0.750
itably+0.735
oyd+0.728
Neg logit contributions
irement-0.893
akura-0.887
ashi-0.857
 Turing-0.851
Mother-0.848
Token $
Token ID720
Feature activation+3.26e+00

Pos logit contributions
irement+1.283
akura+1.273
ashi+1.224
Mother+1.223
 Turing+1.223
Neg logit contributions
 WIN-1.364
annabin-1.352
itably-1.289
 backward-1.280
oyd-1.265
Tokenb
Token ID65
Feature activation+4.96e-01

Pos logit contributions
irement+2.237
akura+2.235
ashi+2.168
Mother+2.167
mother+2.146
Neg logit contributions
 WIN-1.984
annabin-1.931
 backward-1.869
itably-1.821
oyd-1.794
Tokenmp
Token ID3149
Feature activation+4.03e-01

Pos logit contributions
irement+0.319
akura+0.316
ashi+0.305
Mother+0.303
mother+0.301
Neg logit contributions
 WIN-0.333
annabin-0.320
 backward-0.312
itably-0.305
oyd-0.300
Token =
Token ID796
Feature activation+1.88e+00

Pos logit contributions
irement+0.267
akura+0.265
ashi+0.255
Mother+0.253
 Turing+0.253
Neg logit contributions
 WIN-0.262
annabin-0.252
 backward-0.244
itably-0.239
oyd-0.237
Token new
Token ID649
Feature activation+2.23e+00

Pos logit contributions
irement+1.160
akura+1.154
ashi+1.108
 Turing+1.105
Mother+1.103
Neg logit contributions
 WIN-1.285
annabin-1.253
 backward-1.208
itably-1.205
oyd-1.188
Token-
Token ID12
Feature activation+1.58e+00

Pos logit contributions
irement+1.603
akura+1.590
ashi+1.539
 Turing+1.536
Mother+1.529
Neg logit contributions
 WIN-1.309
annabin-1.270
 backward-1.222
itably-1.199
oyd-1.188
Token thing
Token ID1517
Feature activation-2.76e-01

Pos logit contributions
irement+0.199
akura+0.198
ashi+0.192
 Turing+0.191
Mother+0.189
Neg logit contributions
 WIN-0.194
annabin-0.187
 backward-0.179
itably-0.178
oyd-0.177
Token.
Token ID13
Feature activation+5.44e-01

Pos logit contributions
 WIN+0.172
annabin+0.166
 backward+0.158
itably+0.158
oyd+0.156
Neg logit contributions
irement-0.190
akura-0.189
ashi-0.183
 Turing-0.182
mother-0.181
Token
Token ID198
Feature activation+7.38e-01

Pos logit contributions
irement+0.326
akura+0.326
 Turing+0.314
ashi+0.313
Mother+0.312
Neg logit contributions
 WIN-0.380
annabin-0.373
 backward-0.361
itably-0.356
oyd-0.355
Token
Token ID198
Feature activation+4.17e-01

Pos logit contributions
irement+0.437
akura+0.430
Mother+0.425
mother+0.424
 Turing+0.421
Neg logit contributions
 WIN-0.515
annabin-0.510
 backward-0.489
oyd-0.487
itably-0.486
TokenIGN
Token ID16284
Feature activation+4.45e+00

Pos logit contributions
irement+0.274
akura+0.273
Mother+0.266
ashi+0.264
mother+0.262
Neg logit contributions
 WIN-0.271
annabin-0.262
 backward-0.254
itably-0.249
oyd-0.247
Token:
Token ID25
Feature activation+4.07e+00

Pos logit contributions
Mother+3.232
irement+3.214
mother+3.209
 Turing+3.180
ashi+3.147
Neg logit contributions
annabin-2.375
 WIN-2.345
oyd-2.267
 backward-2.229
itably-2.218
Token The
Token ID383
Feature activation+7.51e-01

Pos logit contributions
 Turing+2.270
akura+2.205
irement+2.203
Mother+2.184
ashi+2.162
Neg logit contributions
annabin-2.943
 WIN-2.881
itably-2.832
oyd-2.816
 backward-2.810
Token armour
Token ID18588
Feature activation-8.43e-01

Pos logit contributions
irement+0.487
akura+0.483
ashi+0.468
 Turing+0.467
Mother+0.462
Neg logit contributions
 WIN-0.493
annabin-0.478
 backward-0.459
itably-0.457
oyd-0.453
Token option
Token ID3038
Feature activation+6.27e-01

Pos logit contributions
 WIN+0.532
annabin+0.510
 backward+0.494
itably+0.486
oyd+0.481
Neg logit contributions
irement-0.573
akura-0.569
ashi-0.550
 Turing-0.546
Mother-0.544
Token
Token ID447
Feature activation+1.48e+00

Pos logit contributions
irement+0.400
akura+0.396
ashi+0.383
 Turing+0.380
Mother+0.377
Neg logit contributions
 WIN-0.421
annabin-0.407
 backward-0.391
itably-0.388
oyd-0.386
Token
Token ID247
Feature activation+4.45e-01

Pos logit contributions
irement+0.947
akura+0.930
ashi+0.917
 Turing+0.893
Mother+0.888
Neg logit contributions
 WIN-0.990
annabin-0.974
 backward-0.919
itably-0.914
oyd-0.906
Tokenorted
Token ID9741
Feature activation-1.78e+00

Pos logit contributions
irement+0.003
akura+0.003
ashi+0.003
 Turing+0.003
Mother+0.003
Neg logit contributions
 WIN-0.006
annabin-0.006
 backward-0.006
itably-0.006
oyd-0.006
Token by
Token ID416
Feature activation-6.37e-02

Pos logit contributions
 WIN+1.135
annabin+1.091
 backward+1.056
itably+1.040
oyd+1.030
Neg logit contributions
irement-1.190
akura-1.180
ashi-1.145
 Turing-1.135
Mother-1.128
Token
Token ID198
Feature activation-6.33e-01

Pos logit contributions
 WIN+0.043
annabin+0.041
 backward+0.040
itably+0.039
oyd+0.039
Neg logit contributions
irement-0.040
akura-0.040
ashi-0.039
 Turing-0.039
Mother-0.038
Token
Token ID198
Feature activation-5.05e-01

Pos logit contributions
 WIN+0.429
annabin+0.417
 backward+0.402
itably+0.398
oyd+0.396
Neg logit contributions
irement-0.397
akura-0.392
ashi-0.381
Mother-0.381
 Turing-0.380
TokenL
Token ID43
Feature activation+4.54e-01

Pos logit contributions
 WIN+0.334
annabin+0.321
 backward+0.312
itably+0.306
oyd+0.303
Neg logit contributions
irement-0.328
akura-0.326
ashi-0.316
Mother-0.314
 Turing-0.312
Tokeninda
Token ID22261
Feature activation+2.70e+00

Pos logit contributions
irement+0.298
akura+0.297
ashi+0.289
 Turing+0.282
Mother+0.281
Neg logit contributions
 WIN-0.299
annabin-0.284
 backward-0.279
itably-0.272
075-0.267
Token C
Token ID327
Feature activation+2.18e-01

Pos logit contributions
irement+1.756
akura+1.746
 Turing+1.735
ashi+1.713
Mother+1.679
Neg logit contributions
 WIN-1.708
annabin-1.675
 backward-1.617
itably-1.600
ospace-1.589
Token.
Token ID13
Feature activation-5.13e-01

Pos logit contributions
irement+0.147
akura+0.146
ashi+0.141
 Turing+0.140
Mother+0.139
Neg logit contributions
 WIN-0.140
annabin-0.133
 backward-0.130
itably-0.127
oyd-0.126
Token Deg
Token ID25905
Feature activation-6.81e-01

Pos logit contributions
 WIN+0.354
annabin+0.340
 backward+0.331
itably+0.325
oyd+0.322
Neg logit contributions
irement-0.319
akura-0.317
ashi-0.305
 Turing-0.302
Mother-0.302
Tokenut
Token ID315
Feature activation-7.28e-01

Pos logit contributions
 WIN+0.450
annabin+0.428
 backward+0.418
itably+0.408
oyd+0.404
Neg logit contributions
irement-0.448
akura-0.445
ashi-0.430
 Turing-0.423
Mother-0.423
Tokenis
Token ID271
Feature activation-2.88e-01

Pos logit contributions
 WIN+0.487
annabin+0.467
 backward+0.454
itably+0.446
oyd+0.442
Neg logit contributions
irement-0.468
akura-0.464
ashi-0.447
 Turing-0.444
Mother-0.443
Token I
Token ID314
Feature activation+5.14e-01

Pos logit contributions
 WIN+0.381
annabin+0.366
 backward+0.354
itably+0.348
oyd+0.346
Neg logit contributions
irement-0.392
akura-0.389
ashi-0.377
 Turing-0.375
Mother-0.373
Token believe
Token ID1975
Feature activation+2.77e+00

Pos logit contributions
akura+0.334
irement+0.334
ashi+0.325
 Turing+0.321
Mother+0.317
Neg logit contributions
 WIN-0.335
annabin-0.325
 backward-0.314
itably-0.310
oyd-0.308
Token is
Token ID318
Feature activation+1.77e-01

Pos logit contributions
irement+1.757
akura+1.752
 Turing+1.730
ashi+1.693
Mother+1.688
Neg logit contributions
 WIN-1.803
annabin-1.775
 backward-1.686
oyd-1.679
itably-1.676
Token the
Token ID262
Feature activation+1.54e+00

Pos logit contributions
irement+0.118
akura+0.117
 Turing+0.114
ashi+0.114
Mother+0.113
Neg logit contributions
 WIN-0.113
annabin-0.110
 backward-0.105
itably-0.104
oyd-0.104
Token first
Token ID717
Feature activation+7.29e-01

Pos logit contributions
irement+1.028
akura+1.016
 Turing+0.995
ashi+0.990
Mother+0.980
Neg logit contributions
 WIN-0.975
annabin-0.947
 backward-0.899
oyd-0.897
itably-0.896
Token national
Token ID2260
Feature activation+2.72e+00

Pos logit contributions
irement+0.504
akura+0.499
 Turing+0.488
ashi+0.484
Mother+0.480
Neg logit contributions
 WIN-0.447
annabin-0.433
 backward-0.411
itably-0.411
oyd-0.408
Token survey
Token ID5526
Feature activation+4.40e-02

Pos logit contributions
irement+1.807
 Turing+1.801
akura+1.779
ashi+1.755
Mother+1.725
Neg logit contributions
 WIN-1.681
annabin-1.656
itably-1.585
oyd-1.566
ospace-1.556
Token on
Token ID319
Feature activation-4.93e-01

Pos logit contributions
irement+0.030
akura+0.030
ashi+0.029
 Turing+0.029
Mother+0.029
Neg logit contributions
 WIN-0.027
annabin-0.026
itably-0.025
 backward-0.025
oyd-0.025
Token the
Token ID262
Feature activation+6.19e-03

Pos logit contributions
 WIN+0.312
annabin+0.299
 backward+0.289
itably+0.285
oyd+0.283
Neg logit contributions
irement-0.334
akura-0.331
ashi-0.321
 Turing-0.319
Mother-0.317
Token pending
Token ID13310
Feature activation-2.81e-01

Pos logit contributions
irement+0.005
akura+0.005
 Turing+0.005
ashi+0.005
Mother+0.005
Neg logit contributions
 WIN-0.003
annabin-0.003
 backward-0.003
oyd-0.003
itably-0.003
Token proposal
Token ID6961
Feature activation+2.88e-01

Pos logit contributions
 WIN+0.177
annabin+0.171
 backward+0.165
itably+0.163
oyd+0.162
Neg logit contributions
irement-0.191
akura-0.189
 Turing-0.183
ashi-0.183
Mother-0.181
Token left
Token ID1364
Feature activation-5.72e-01

Pos logit contributions
irement+0.366
akura+0.360
 Turing+0.354
ashi+0.353
Mother+0.352
Neg logit contributions
 WIN-0.351
annabin-0.341
 backward-0.322
oyd-0.322
itably-0.321
Token,
Token ID11
Feature activation-5.05e-01

Pos logit contributions
 WIN+0.357
annabin+0.345
 backward+0.330
itably+0.327
oyd+0.325
Neg logit contributions
irement-0.391
akura-0.387
ashi-0.375
Mother-0.372
mother-0.372
Token other
Token ID584
Feature activation-2.02e+00

Pos logit contributions
 WIN+0.318
annabin+0.311
oyd+0.294
itably+0.294
 backward+0.293
Neg logit contributions
irement-0.339
akura-0.338
ashi-0.326
Mother-0.326
 Turing-0.325
Token than
Token ID621
Feature activation+4.09e-02

Pos logit contributions
 WIN+1.397
 backward+1.317
annabin+1.315
itably+1.283
oyd+1.250
Neg logit contributions
irement-1.254
akura-1.249
ashi-1.193
 Turing-1.191
Mother-1.177
Token what
Token ID644
Feature activation+4.75e-01

Pos logit contributions
irement+0.027
akura+0.026
ashi+0.026
 Turing+0.026
Mother+0.025
Neg logit contributions
 WIN-0.027
annabin-0.026
 backward-0.025
itably-0.025
oyd-0.025
Token was
Token ID373
Feature activation-1.76e-01

Pos logit contributions
irement+0.295
akura+0.294
ashi+0.285
 Turing+0.283
Mother+0.282
Neg logit contributions
 WIN-0.325
annabin-0.316
 backward-0.303
oyd-0.300
itably-0.300
Token reported
Token ID2098
Feature activation-2.58e-01

Pos logit contributions
 WIN+0.115
annabin+0.111
 backward+0.107
itably+0.105
oyd+0.105
Neg logit contributions
irement-0.115
akura-0.114
ashi-0.110
 Turing-0.110
Mother-0.109
Token by
Token ID416
Feature activation-1.29e-01

Pos logit contributions
 WIN+0.159
annabin+0.153
 backward+0.146
itably+0.145
oyd+0.144
Neg logit contributions
irement-0.180
akura-0.178
ashi-0.173
 Turing-0.172
Mother-0.171
Token these
Token ID777
Feature activation+3.67e-01

Pos logit contributions
 WIN+0.084
annabin+0.081
 backward+0.077
itably+0.076
oyd+0.076
Neg logit contributions
irement-0.085
akura-0.085
ashi-0.082
 Turing-0.082
Mother-0.082
Token mainstream
Token ID8661
Feature activation+4.54e-01

Pos logit contributions
irement+0.249
akura+0.246
ashi+0.240
 Turing+0.239
Mother+0.239
Neg logit contributions
 WIN-0.229
annabin-0.223
 backward-0.210
oyd-0.209
itably-0.209
Token outlets
Token ID12527
Feature activation-1.66e-02

Pos logit contributions
irement+0.303
akura+0.298
ashi+0.292
Mother+0.290
 Turing+0.289
Neg logit contributions
 WIN-0.290
annabin-0.281
 backward-0.267
itably-0.266
oyd-0.264
Token.
Token ID13
Feature activation+1.22e+00

Pos logit contributions
irement+0.573
akura+0.568
ashi+0.551
 Turing+0.549
Mother+0.541
Neg logit contributions
 WIN-0.534
annabin-0.516
itably-0.493
 backward-0.493
oyd-0.487
Token And
Token ID843
Feature activation-5.28e-02

Pos logit contributions
irement+0.744
akura+0.740
 Turing+0.730
ashi+0.717
Mother+0.716
Neg logit contributions
 WIN-0.831
annabin-0.817
 backward-0.792
itably-0.782
oyd-0.775
Token it
Token ID340
Feature activation-4.63e-01

Pos logit contributions
 WIN+0.033
annabin+0.032
 backward+0.031
itably+0.031
oyd+0.030
Neg logit contributions
irement-0.035
akura-0.035
ashi-0.034
 Turing-0.034
Mother-0.034
Token is
Token ID318
Feature activation+3.11e-01

Pos logit contributions
 WIN+0.310
annabin+0.297
 backward+0.289
itably+0.284
oyd+0.282
Neg logit contributions
irement-0.297
akura-0.295
ashi-0.284
 Turing-0.282
Mother-0.281
Token obvious
Token ID3489
Feature activation+1.09e+00

Pos logit contributions
irement+0.212
akura+0.210
 Turing+0.206
ashi+0.205
Mother+0.202
Neg logit contributions
 WIN-0.193
annabin-0.189
 backward-0.178
itably-0.178
oyd-0.176
Token that
Token ID326
Feature activation-1.20e-01

Pos logit contributions
irement+0.762
akura+0.757
 Turing+0.739
ashi+0.737
Mother+0.724
Neg logit contributions
 WIN-0.653
annabin-0.634
 backward-0.601
itably-0.596
oyd-0.590
Token the
Token ID262
Feature activation+4.29e-01

Pos logit contributions
 WIN+0.077
annabin+0.074
 backward+0.071
itably+0.070
oyd+0.070
Neg logit contributions
irement-0.080
akura-0.079
ashi-0.077
 Turing-0.077
Mother-0.076
Token player
Token ID2137
Feature activation+8.65e-01

Pos logit contributions
irement+0.315
akura+0.311
 Turing+0.304
ashi+0.304
Mother+0.300
Neg logit contributions
 WIN-0.246
annabin-0.236
 backward-0.227
itably-0.225
oyd-0.221
Token did
Token ID750
Feature activation-6.99e-01

Pos logit contributions
irement+0.560
akura+0.553
ashi+0.539
 Turing+0.537
Mother+0.531
Neg logit contributions
 WIN-0.569
annabin-0.552
 backward-0.529
itably-0.528
oyd-0.519
Token something
Token ID1223
Feature activation+6.33e-01

Pos logit contributions
 WIN+0.421
annabin+0.404
 backward+0.389
itably+0.383
oyd+0.379
Neg logit contributions
irement-0.495
akura-0.492
ashi-0.477
 Turing-0.474
Mother-0.471
Token bad
Token ID2089
Feature activation-1.52e+00

Pos logit contributions
irement+0.435
akura+0.433
 Turing+0.421
ashi+0.421
Mother+0.415
Neg logit contributions
 WIN-0.389
annabin-0.378
itably-0.359
 backward-0.355
oyd-0.353
Token more
Token ID517
Feature activation-5.61e-01

Pos logit contributions
irement+0.457
akura+0.453
ashi+0.440
Mother+0.437
 Turing+0.435
Neg logit contributions
 WIN-0.511
annabin-0.501
oyd-0.477
 backward-0.477
itably-0.475
Token than
Token ID621
Feature activation-1.44e-01

Pos logit contributions
 WIN+0.375
annabin+0.360
 backward+0.349
itably+0.344
oyd+0.341
Neg logit contributions
irement-0.361
akura-0.358
ashi-0.345
 Turing-0.342
Mother-0.342
Token Paul
Token ID3362
Feature activation-1.21e+00

Pos logit contributions
 WIN+0.096
annabin+0.093
 backward+0.089
itably+0.088
oyd+0.088
Neg logit contributions
irement-0.093
akura-0.092
ashi-0.089
 Turing-0.089
Mother-0.088
Token McCartney
Token ID44677
Feature activation-1.81e+00

Pos logit contributions
 WIN+0.865
annabin+0.830
 backward+0.809
itably+0.795
oyd+0.789
Neg logit contributions
irement-0.723
akura-0.717
ashi-0.689
 Turing-0.682
Mother-0.681
Token has
Token ID468
Feature activation-1.92e-01

Pos logit contributions
 WIN+1.139
annabin+1.086
 backward+1.056
itably+1.031
oyd+1.021
Neg logit contributions
irement-1.237
akura-1.227
ashi-1.185
 Turing-1.176
Mother-1.170
Token pulled
Token ID5954
Feature activation-5.78e-02

Pos logit contributions
 WIN+0.112
annabin+0.108
 backward+0.104
itably+0.103
oyd+0.102
Neg logit contributions
irement-0.138
akura-0.137
ashi-0.133
 Turing-0.133
Mother-0.132
Token in
Token ID287
Feature activation-2.28e+00

Pos logit contributions
 WIN+0.034
annabin+0.033
itably+0.032
 backward+0.032
oyd+0.031
Neg logit contributions
irement-0.041
akura-0.041
ashi-0.040
Mother-0.040
 Turing-0.040
Token,
Token ID11
Feature activation-1.20e+00

Pos logit contributions
 WIN+1.457
annabin+1.395
 backward+1.354
itably+1.329
oyd+1.317
Neg logit contributions
irement-1.536
akura-1.524
ashi-1.472
 Turing-1.461
Mother-1.457
Token
Token ID447
Feature activation-1.52e+00

Pos logit contributions
 WIN+0.778
annabin+0.746
 backward+0.723
itably+0.711
oyd+0.704
Neg logit contributions
irement-0.795
akura-0.790
ashi-0.762
 Turing-0.757
Mother-0.755
Token
Token ID251
Feature activation-3.57e-01

Pos logit contributions
 WIN+1.008
annabin+0.968
 backward+0.939
itably+0.923
oyd+0.915
Neg logit contributions
irement-0.992
akura-0.984
ashi-0.950
 Turing-0.942
Mother-0.939
Token Mr
Token ID1770
Feature activation-1.91e+00

Pos logit contributions
 WIN+0.233
annabin+0.225
 backward+0.217
itably+0.215
oyd+0.212
Neg logit contributions
irement-0.234
akura-0.232
ashi-0.226
Mother-0.223
 Turing-0.223
Token I
Token ID314
Feature activation+1.42e+00

Pos logit contributions
akura+0.484
irement+0.481
ashi+0.465
 Turing+0.463
Mother+0.460
Neg logit contributions
 WIN-0.560
annabin-0.545
 backward-0.523
itably-0.521
oyd-0.514
Token can
Token ID460
Feature activation+4.17e-02

Pos logit contributions
akura+0.911
irement+0.907
ashi+0.883
Mother+0.868
 Turing+0.867
Neg logit contributions
 WIN-0.939
annabin-0.911
 backward-0.879
oyd-0.871
itably-0.871
Token use
Token ID779
Feature activation-1.54e-01

Pos logit contributions
irement+0.026
akura+0.026
ashi+0.025
 Turing+0.025
Mother+0.025
Neg logit contributions
 WIN-0.028
annabin-0.027
 backward-0.026
itably-0.026
oyd-0.026
Token it
Token ID340
Feature activation+1.59e-01

Pos logit contributions
 WIN+0.101
annabin+0.098
 backward+0.094
itably+0.093
oyd+0.092
Neg logit contributions
irement-0.099
akura-0.099
ashi-0.096
 Turing-0.095
Mother-0.094
Token as
Token ID355
Feature activation+2.57e-01

Pos logit contributions
irement+0.108
akura+0.108
ashi+0.104
 Turing+0.103
Mother+0.103
Neg logit contributions
 WIN-0.101
annabin-0.097
 backward-0.092
itably-0.092
oyd-0.091
Token lap
Token ID14779
Feature activation-1.08e-01

Pos logit contributions
irement+0.170
akura+0.169
ashi+0.164
 Turing+0.162
Mother+0.161
Neg logit contributions
 WIN-0.166
annabin-0.160
 backward-0.154
itably-0.152
oyd-0.151
Tokenboard
Token ID3526
Feature activation+3.27e-01

Pos logit contributions
 WIN+0.083
annabin+0.080
 backward+0.078
itably+0.077
oyd+0.076
Neg logit contributions
irement-0.059
akura-0.058
ashi-0.056
 Turing-0.055
Mother-0.055
Token,
Token ID11
Feature activation+1.93e-01

Pos logit contributions
irement+0.216
akura+0.215
ashi+0.208
 Turing+0.206
Mother+0.206
Neg logit contributions
 WIN-0.213
annabin-0.205
 backward-0.197
itably-0.195
oyd-0.193
Token and
Token ID290
Feature activation+4.50e-02

Pos logit contributions
irement+0.125
akura+0.124
ashi+0.120
 Turing+0.119
Mother+0.118
Neg logit contributions
 WIN-0.128
annabin-0.123
 backward-0.118
itably-0.118
oyd-0.116
Token I
Token ID314
Feature activation-5.05e-02

Pos logit contributions
irement+0.027
akura+0.027
ashi+0.026
 Turing+0.025
Mother+0.025
Neg logit contributions
 WIN-0.032
annabin-0.031
 backward-0.030
itably-0.030
oyd-0.030
Token can
Token ID460
Feature activation+3.29e-01

Pos logit contributions
 WIN+0.032
annabin+0.031
 backward+0.030
itably+0.029
oyd+0.029
Neg logit contributions
akura-0.034
irement-0.034
ashi-0.033
Mother-0.033
 Turing-0.033
Token I
Token ID314
Feature activation+4.52e-01

Pos logit contributions
irement+0.014
akura+0.014
ashi+0.013
 Turing+0.013
Mother+0.013
Neg logit contributions
 WIN-0.014
annabin-0.014
 backward-0.013
itably-0.013
oyd-0.013
Token was
Token ID373
Feature activation+9.31e-01

Pos logit contributions
irement+0.284
akura+0.282
ashi+0.273
 Turing+0.271
Mother+0.269
Neg logit contributions
 WIN-0.307
annabin-0.297
 backward-0.286
itably-0.284
oyd-0.281
Token young
Token ID1862
Feature activation+1.35e+00

Pos logit contributions
irement+0.587
akura+0.583
ashi+0.565
 Turing+0.564
Mother+0.561
Neg logit contributions
 WIN-0.626
annabin-0.612
itably-0.582
 backward-0.582
oyd-0.576
Token and
Token ID290
Feature activation+3.69e-02

Pos logit contributions
irement+0.924
akura+0.918
ashi+0.887
 Turing+0.881
Mother+0.879
Neg logit contributions
 WIN-0.851
annabin-0.815
 backward-0.788
itably-0.775
oyd-0.768
Token vulnerable
Token ID8826
Feature activation+4.47e-01

Pos logit contributions
irement+0.023
akura+0.023
ashi+0.022
 Turing+0.022
Mother+0.022
Neg logit contributions
 WIN-0.025
annabin-0.024
 backward-0.023
itably-0.023
oyd-0.023
Token,
Token ID11
Feature activation+2.75e-01

Pos logit contributions
irement+0.301
akura+0.298
ashi+0.288
 Turing+0.287
Mother+0.286
Neg logit contributions
 WIN-0.285
annabin-0.274
 backward-0.264
itably-0.260
oyd-0.258
Token and
Token ID290
Feature activation-1.00e-01

Pos logit contributions
irement+0.186
akura+0.185
ashi+0.179
 Turing+0.178
Mother+0.178
Neg logit contributions
 WIN-0.173
annabin-0.167
 backward-0.160
itably-0.159
oyd-0.158
Token didn
Token ID1422
Feature activation-9.45e-01

Pos logit contributions
 WIN+0.068
annabin+0.066
 backward+0.063
itably+0.063
oyd+0.062
Neg logit contributions
irement-0.063
akura-0.063
ashi-0.061
 Turing-0.060
Mother-0.060
Token't
Token ID470
Feature activation-8.21e-01

Pos logit contributions
 WIN+0.761
annabin+0.742
 backward+0.719
itably+0.711
oyd+0.711
Neg logit contributions
irement-0.475
akura-0.466
ashi-0.450
 Turing-0.449
Mother-0.448
Token have
Token ID423
Feature activation-6.43e-02

Pos logit contributions
 WIN+0.554
annabin+0.541
 backward+0.516
itably+0.516
oyd+0.514
Neg logit contributions
irement-0.517
akura-0.513
ashi-0.495
 Turing-0.492
mother-0.488
Token anywhere
Token ID6609
Feature activation+4.75e-01

Pos logit contributions
 WIN+0.041
annabin+0.040
 backward+0.038
itably+0.038
oyd+0.038
Neg logit contributions
irement-0.043
akura-0.043
ashi-0.041
 Turing-0.041
Mother-0.041
Token can
Token ID460
Feature activation-7.40e-01

Pos logit contributions
 WIN+0.115
annabin+0.110
 backward+0.106
itably+0.105
oyd+0.104
Neg logit contributions
irement-0.137
akura-0.136
ashi-0.132
 Turing-0.131
Mother-0.130
Token find
Token ID1064
Feature activation-5.56e-01

Pos logit contributions
 WIN+0.490
annabin+0.473
 backward+0.456
itably+0.451
oyd+0.448
Neg logit contributions
irement-0.480
akura-0.475
ashi-0.460
 Turing-0.456
Mother-0.453
Token a
Token ID257
Feature activation-3.79e-01

Pos logit contributions
 WIN+0.345
annabin+0.335
 backward+0.318
itably+0.316
oyd+0.315
Neg logit contributions
irement-0.380
akura-0.378
ashi-0.367
 Turing-0.365
Mother-0.363
Token static
Token ID9037
Feature activation-1.21e+00

Pos logit contributions
 WIN+0.234
annabin+0.229
itably+0.218
oyd+0.216
 backward+0.215
Neg logit contributions
irement-0.259
akura-0.256
 Turing-0.250
ashi-0.248
Mother-0.247
Token version
Token ID2196
Feature activation+3.12e-01

Pos logit contributions
 WIN+0.870
annabin+0.843
 backward+0.814
itably+0.808
oyd+0.802
Neg logit contributions
irement-0.720
akura-0.714
ashi-0.687
 Turing-0.683
Mother-0.679
Token of
Token ID286
Feature activation-8.21e-01

Pos logit contributions
irement+0.182
akura+0.181
ashi+0.174
 Turing+0.173
Mother+0.171
Neg logit contributions
 WIN-0.226
annabin-0.219
itably-0.211
 backward-0.210
oyd-0.209
Token it
Token ID340
Feature activation-1.05e+00

Pos logit contributions
 WIN+0.552
annabin+0.533
 backward+0.515
itably+0.508
oyd+0.505
Neg logit contributions
irement-0.523
akura-0.519
ashi-0.500
 Turing-0.497
Mother-0.495
Token here
Token ID994
Feature activation-8.19e-01

Pos logit contributions
 WIN+0.795
annabin+0.768
 backward+0.746
itably+0.738
oyd+0.732
Neg logit contributions
irement-0.586
akura-0.581
ashi-0.556
 Turing-0.552
Mother-0.549
Token.
Token ID13
Feature activation-1.04e+00

Pos logit contributions
 WIN+0.484
annabin+0.464
 backward+0.448
itably+0.440
oyd+0.436
Neg logit contributions
irement-0.590
akura-0.586
ashi-0.567
 Turing-0.564
Mother-0.562
Token
Token ID198
Feature activation-1.24e+00

Pos logit contributions
 WIN+0.697
annabin+0.671
 backward+0.651
itably+0.641
oyd+0.635
Neg logit contributions
irement-0.670
akura-0.665
ashi-0.641
 Turing-0.638
Mother-0.636
Token
Token ID198
Feature activation-1.50e+00

Pos logit contributions
 WIN+0.820
annabin+0.792
 backward+0.767
itably+0.756
oyd+0.751
Neg logit contributions
irement-0.808
akura-0.802
Mother-0.776
ashi-0.774
 Turing-0.771
Token than
Token ID621
Feature activation-4.02e-01

Pos logit contributions
 WIN+0.261
annabin+0.251
 backward+0.242
itably+0.239
oyd+0.237
Neg logit contributions
irement-0.276
akura-0.275
ashi-0.265
 Turing-0.263
Mother-0.262
Token
Token ID564
Feature activation+7.90e-01

Pos logit contributions
 WIN+0.264
annabin+0.256
 backward+0.245
itably+0.244
oyd+0.242
Neg logit contributions
akura-0.261
irement-0.261
ashi-0.253
 Turing-0.251
Mother-0.249
Token
Token ID246
Feature activation-9.83e-01

Pos logit contributions
irement+0.402
akura+0.399
ashi+0.383
 Turing+0.377
Mother+0.377
Neg logit contributions
 WIN-0.632
annabin-0.616
 backward-0.593
itably-0.592
oyd-0.587
TokenFear
Token ID37798
Feature activation-3.02e-01

Pos logit contributions
 WIN+0.652
annabin+0.626
 backward+0.607
itably+0.598
oyd+0.593
Neg logit contributions
irement-0.637
akura-0.633
ashi-0.611
Mother-0.606
 Turing-0.606
Token of
Token ID286
Feature activation-3.39e+00

Pos logit contributions
 WIN+0.191
annabin+0.184
 backward+0.178
itably+0.176
oyd+0.174
Neg logit contributions
irement-0.204
akura-0.204
ashi-0.197
Mother-0.195
 Turing-0.195
Token the
Token ID262
Feature activation-2.97e+00

Pos logit contributions
 WIN+2.186
annabin+2.034
 backward+2.010
 GOD+1.950
itably+1.939
Neg logit contributions
irement-2.289
akura-2.253
ashi-2.167
 Turing-2.150
Mother-2.149
Token Heaven
Token ID11225
Feature activation-6.05e-01

Pos logit contributions
 WIN+2.161
annabin+2.049
 backward+2.015
oyd+1.958
itably+1.954
Neg logit contributions
irement-1.762
akura-1.731
ashi-1.655
Mother-1.632
 Turing-1.629
Tokens
Token ID82
Feature activation-1.87e+00

Pos logit contributions
 WIN+0.367
annabin+0.350
 backward+0.339
itably+0.333
oyd+0.330
Neg logit contributions
irement-0.426
akura-0.424
ashi-0.410
 Turing-0.407
Mother-0.406
Token
Token ID447
Feature activation+7.69e-01

Pos logit contributions
 WIN+1.176
annabin+1.122
 backward+1.092
itably+1.061
oyd+1.051
Neg logit contributions
irement-1.283
akura-1.269
ashi-1.224
 Turing-1.218
Mother-1.211
Token
Token ID247
Feature activation-3.94e-01

Pos logit contributions
irement+0.553
akura+0.547
ashi+0.533
Mother+0.528
mother+0.526
Neg logit contributions
 WIN-0.455
annabin-0.438
 backward-0.416
itably-0.413
oyd-0.410
Token and
Token ID290
Feature activation-1.26e+00

Pos logit contributions
 WIN+0.246
annabin+0.235
 backward+0.228
itably+0.224
oyd+0.222
Neg logit contributions
irement-0.271
akura-0.269
ashi-0.260
 Turing-0.258
Mother-0.257
Token.)
Token ID2014
Feature activation+6.28e-01

Pos logit contributions
 WIN+0.559
annabin+0.533
 backward+0.514
itably+0.504
oyd+0.499
Neg logit contributions
irement-0.706
akura-0.701
ashi-0.679
 Turing-0.674
Mother-0.673
Token
Token ID198
Feature activation+8.73e-01

Pos logit contributions
irement+0.418
akura+0.415
ashi+0.400
 Turing+0.398
Mother+0.396
Neg logit contributions
 WIN-0.406
annabin-0.389
 backward-0.378
itably-0.371
oyd-0.368
Token
Token ID198
Feature activation+6.25e-01

Pos logit contributions
irement+0.517
akura+0.509
Mother+0.505
mother+0.503
 Turing+0.500
Neg logit contributions
 WIN-0.609
annabin-0.602
 backward-0.576
oyd-0.575
itably-0.574
TokenSo
Token ID2396
Feature activation+1.68e+00

Pos logit contributions
irement+0.422
akura+0.418
Mother+0.407
ashi+0.406
mother+0.402
Neg logit contributions
 WIN-0.398
annabin-0.381
 backward-0.369
itably-0.362
oyd-0.359
Token Mark
Token ID2940
Feature activation+8.63e-01

Pos logit contributions
irement+1.054
akura+1.049
ashi+1.017
 Turing+1.011
Mother+1.003
Neg logit contributions
 WIN-1.141
annabin-1.104
 backward-1.053
itably-1.053
oyd-1.043
Token Ham
Token ID4345
Feature activation-2.42e+00

Pos logit contributions
akura+0.223
irement+0.218
ashi+0.205
 Turing+0.198
mother+0.184
Neg logit contributions
 WIN-0.917
annabin-0.883
 backward-0.879
oyd-0.859
 GOD-0.859
Tokenill
Token ID359
Feature activation-1.05e+00

Pos logit contributions
 WIN+1.379
annabin+1.317
 backward+1.272
itably+1.243
oyd+1.229
Neg logit contributions
irement-1.792
akura-1.777
ashi-1.719
 Turing-1.716
Mother-1.708
Token,
Token ID11
Feature activation-4.00e-01

Pos logit contributions
 WIN+0.676
annabin+0.647
 backward+0.628
itably+0.616
oyd+0.610
Neg logit contributions
irement-0.700
akura-0.696
ashi-0.670
 Turing-0.666
Mother-0.664
Token best
Token ID1266
Feature activation-2.86e-01

Pos logit contributions
 WIN+0.268
annabin+0.257
 backward+0.250
itably+0.245
oyd+0.243
Neg logit contributions
irement-0.257
akura-0.255
ashi-0.246
 Turing-0.244
Mother-0.243
Token known
Token ID1900
Feature activation+1.02e+00

Pos logit contributions
 WIN+0.175
annabin+0.168
 backward+0.162
itably+0.160
oyd+0.159
Neg logit contributions
irement-0.200
akura-0.198
ashi-0.192
 Turing-0.191
Mother-0.190
Token for
Token ID329
Feature activation+2.11e+00

Pos logit contributions
irement+0.693
akura+0.687
ashi+0.665
 Turing+0.660
Mother+0.658
Neg logit contributions
 WIN-0.644
annabin-0.617
 backward-0.597
itably-0.587
oyd-0.582
Tokenly
Token ID306
Feature activation-1.68e+00

Pos logit contributions
 WIN+0.825
annabin+0.790
 backward+0.766
itably+0.749
oyd+0.747
Neg logit contributions
irement-0.859
akura-0.851
ashi-0.825
 Turing-0.817
Mother-0.814
Token.
Token ID13
Feature activation-7.21e-01

Pos logit contributions
 WIN+1.055
annabin+1.009
 backward+0.978
itably+0.960
oyd+0.951
Neg logit contributions
irement-1.150
akura-1.142
ashi-1.103
 Turing-1.096
Mother-1.093
Token
Token ID447
Feature activation-7.36e-01

Pos logit contributions
 WIN+0.477
annabin+0.457
 backward+0.444
itably+0.436
oyd+0.432
Neg logit contributions
irement-0.470
akura-0.466
ashi-0.450
 Turing-0.446
Mother-0.445
Token
Token ID251
Feature activation-1.31e+00

Pos logit contributions
 WIN+0.497
annabin+0.475
 backward+0.463
itably+0.454
oyd+0.449
Neg logit contributions
irement-0.469
akura-0.467
ashi-0.447
 Turing-0.445
Mother-0.444
Token The
Token ID383
Feature activation-1.20e-01

Pos logit contributions
 WIN+0.861
annabin+0.821
 backward+0.798
itably+0.784
oyd+0.776
Neg logit contributions
irement-0.857
akura-0.852
ashi-0.820
 Turing-0.812
Mother-0.810
Token agency
Token ID4086
Feature activation-1.55e+00

Pos logit contributions
 WIN+0.078
annabin+0.076
 backward+0.073
itably+0.072
oyd+0.071
Neg logit contributions
irement-0.079
akura-0.078
ashi-0.076
 Turing-0.075
Mother-0.075
Token also
Token ID635
Feature activation-4.58e-01

Pos logit contributions
 WIN+1.022
annabin+0.979
 backward+0.957
itably+0.936
oyd+0.926
Neg logit contributions
irement-1.008
akura-1.002
ashi-0.962
 Turing-0.958
Mother-0.954
Token would
Token ID561
Feature activation-2.42e-02

Pos logit contributions
 WIN+0.302
annabin+0.289
 backward+0.280
itably+0.276
oyd+0.273
Neg logit contributions
irement-0.300
akura-0.298
ashi-0.288
 Turing-0.285
Mother-0.284
Token be
Token ID307
Feature activation+7.22e-02

Pos logit contributions
 WIN+0.015
annabin+0.015
 backward+0.014
itably+0.014
oyd+0.014
Neg logit contributions
irement-0.016
akura-0.016
ashi-0.016
 Turing-0.015
mother-0.015
Token given
Token ID1813
Feature activation-5.73e-01

Pos logit contributions
irement+0.049
akura+0.048
ashi+0.047
 Turing+0.046
Mother+0.046
Neg logit contributions
 WIN-0.046
annabin-0.044
itably-0.042
 backward-0.042
oyd-0.042
Token the
Token ID262
Feature activation+9.01e-01

Pos logit contributions
 WIN+0.344
annabin+0.331
 backward+0.316
itably+0.313
oyd+0.311
Neg logit contributions
irement-0.406
akura-0.403
ashi-0.391
 Turing-0.389
Mother-0.386
Token RC
Token ID13987
Feature activation+3.54e-01

Pos logit contributions
irement+0.090
akura+0.089
 Turing+0.087
ashi+0.086
Mother+0.086
Neg logit contributions
 WIN-0.090
annabin-0.087
itably-0.084
 backward-0.083
oyd-0.083
Token4
Token ID19
Feature activation+1.27e+00

Pos logit contributions
irement+0.203
akura+0.201
ashi+0.193
 Turing+0.192
Mother+0.190
Neg logit contributions
 WIN-0.260
annabin-0.252
 backward-0.244
itably-0.243
oyd-0.239
Token key
Token ID1994
Feature activation+5.00e-01

Pos logit contributions
irement+0.821
akura+0.811
 Turing+0.788
ashi+0.781
Mother+0.775
Neg logit contributions
 WIN-0.831
annabin-0.812
itably-0.777
 backward-0.773
oyd-0.765
Tokenstream
Token ID5532
Feature activation-1.34e+00

Pos logit contributions
irement+0.351
akura+0.347
ashi+0.337
 Turing+0.335
mother+0.331
Neg logit contributions
 WIN-0.300
annabin-0.292
 backward-0.276
itably-0.276
oyd-0.275
Token X
Token ID1395
Feature activation+1.99e+00

Pos logit contributions
 WIN+0.867
annabin+0.832
 backward+0.806
itably+0.793
oyd+0.786
Neg logit contributions
irement-0.892
akura-0.885
ashi-0.854
 Turing-0.849
Mother-0.845
TokenOR
Token ID1581
Feature activation-1.07e+00

Pos logit contributions
irement+1.303
akura+1.300
ashi+1.254
 Turing+1.242
Mother+1.237
Neg logit contributions
 WIN-1.273
annabin-1.237
 backward-1.185
itably-1.184
oyd-1.163
Tokened
Token ID276
Feature activation+1.02e+00

Pos logit contributions
 WIN+0.705
annabin+0.679
 backward+0.655
itably+0.648
oyd+0.640
Neg logit contributions
irement-0.702
akura-0.695
ashi-0.671
 Turing-0.669
Mother-0.664
Token with
Token ID351
Feature activation-1.18e+00

Pos logit contributions
irement+0.680
akura+0.669
ashi+0.649
 Turing+0.647
Mother+0.641
Neg logit contributions
 WIN-0.644
annabin-0.628
itably-0.596
 backward-0.595
oyd-0.591
Token plain
Token ID8631
Feature activation-1.06e+00

Pos logit contributions
 WIN+0.751
annabin+0.727
 backward+0.699
itably+0.692
oyd+0.686
Neg logit contributions
irement-0.790
akura-0.784
ashi-0.757
 Turing-0.755
Mother-0.749
Tokentext
Token ID5239
Feature activation-2.97e-02

Pos logit contributions
 WIN+0.766
annabin+0.740
 backward+0.719
itably+0.709
oyd+0.703
Neg logit contributions
irement-0.617
akura-0.612
ashi-0.588
 Turing-0.583
Mother-0.580
Token
Token ID198
Feature activation+2.27e-01

Pos logit contributions
 WIN+0.019
annabin+0.018
 backward+0.018
itably+0.018
oyd+0.017
Neg logit contributions
irement-0.020
akura-0.020
 Turing-0.019
ashi-0.019
Mother-0.019
Tokenels
Token ID1424
Feature activation-6.04e-01

Pos logit contributions
 WIN+0.019
annabin+0.019
 backward+0.018
itably+0.018
oyd+0.018
Neg logit contributions
irement-0.020
akura-0.020
ashi-0.019
 Turing-0.019
Mother-0.019
Token bear
Token ID6842
Feature activation-2.09e+00

Pos logit contributions
 WIN+0.388
annabin+0.375
 backward+0.358
itably+0.355
oyd+0.352
Neg logit contributions
irement-0.403
akura-0.401
ashi-0.388
Mother-0.384
 Turing-0.384
Token,
Token ID11
Feature activation-7.33e-01

Pos logit contributions
 WIN+1.308
annabin+1.244
 backward+1.219
itably+1.190
oyd+1.177
Neg logit contributions
irement-1.435
akura-1.422
ashi-1.372
 Turing-1.364
Mother-1.357
Token
Token ID198
Feature activation-3.21e+00

Pos logit contributions
 WIN+0.448
annabin+0.421
 backward+0.416
itably+0.406
oyd+0.401
Neg logit contributions
irement-0.514
akura-0.509
ashi-0.494
 Turing-0.489
Mother-0.485
Token
Token ID198
Feature activation-2.74e+00

Pos logit contributions
 WIN+2.051
annabin+1.955
 backward+1.902
itably+1.863
oyd+1.844
Neg logit contributions
irement-2.171
akura-2.158
ashi-2.081
 Turing-2.061
Mother-2.053
TokenTo
Token ID2514
Feature activation-4.09e+00

Pos logit contributions
 WIN+1.687
annabin+1.609
 backward+1.562
itably+1.533
oyd+1.516
Neg logit contributions
irement-1.911
akura-1.896
ashi-1.833
 Turing-1.819
mother-1.793
Token grunt
Token ID45836
Feature activation+1.16e+00

Pos logit contributions
 WIN+2.649
annabin+2.507
 backward+2.482
itably+2.400
oyd+2.379
Neg logit contributions
irement-2.715
akura-2.690
ashi-2.589
 Turing-2.567
Mother-2.550
Token and
Token ID290
Feature activation-3.44e-01

Pos logit contributions
irement+0.777
akura+0.769
ashi+0.748
Mother+0.745
mother+0.743
Neg logit contributions
 WIN-0.736
annabin-0.715
itably-0.673
 backward-0.670
oyd-0.669
Token sweat
Token ID15488
Feature activation+2.68e-01

Pos logit contributions
 WIN+0.221
annabin+0.213
 backward+0.204
itably+0.202
oyd+0.201
Neg logit contributions
irement-0.230
akura-0.228
ashi-0.221
 Turing-0.219
Mother-0.218
Token under
Token ID739
Feature activation-5.35e-01

Pos logit contributions
irement+0.179
akura+0.178
ashi+0.173
Mother+0.172
mother+0.172
Neg logit contributions
 WIN-0.170
annabin-0.165
itably-0.155
 backward-0.155
oyd-0.155
Token a
Token ID257
Feature activation+2.61e-01

Pos logit contributions
 WIN+0.342
annabin+0.329
 backward+0.316
itably+0.312
oyd+0.310
Neg logit contributions
irement-0.359
akura-0.357
ashi-0.346
 Turing-0.343
Mother-0.342
Token."
Token ID526
Feature activation+5.40e-01

Pos logit contributions
 WIN+0.206
annabin+0.199
 backward+0.191
itably+0.190
oyd+0.187
Neg logit contributions
irement-0.212
akura-0.210
ashi-0.204
 Turing-0.202
Mother-0.201
Token And
Token ID843
Feature activation-1.94e+00

Pos logit contributions
irement+0.321
akura+0.318
 Turing+0.311
Mother+0.307
ashi+0.307
Neg logit contributions
 WIN-0.378
annabin-0.374
 backward-0.359
itably-0.355
oyd-0.353
Token we
Token ID356
Feature activation-1.51e+00

Pos logit contributions
 WIN+1.359
annabin+1.310
 backward+1.270
itably+1.252
oyd+1.241
Neg logit contributions
irement-1.178
akura-1.169
ashi-1.127
 Turing-1.118
Mother-1.114
Token went
Token ID1816
Feature activation-1.00e+00

Pos logit contributions
 WIN+0.968
annabin+0.929
 backward+0.899
itably+0.885
oyd+0.877
Neg logit contributions
irement-1.016
akura-1.009
ashi-0.974
 Turing-0.968
Mother-0.964
Token on
Token ID319
Feature activation-2.24e+00

Pos logit contributions
 WIN+0.600
annabin+0.585
itably+0.554
 backward+0.549
oyd+0.545
Neg logit contributions
irement-0.705
akura-0.701
ashi-0.679
 Turing-0.679
Mother-0.674
Token to
Token ID284
Feature activation-4.02e+00

Pos logit contributions
 WIN+1.399
annabin+1.334
 backward+1.298
itably+1.269
oyd+1.258
Neg logit contributions
irement-1.535
akura-1.527
ashi-1.473
 Turing-1.461
Mother-1.459
Token discuss
Token ID2112
Feature activation-1.54e+00

Pos logit contributions
 WIN+2.634
annabin+2.474
 backward+2.437
itably+2.364
oyd+2.349
Neg logit contributions
irement-2.661
akura-2.645
ashi-2.536
Mother-2.510
mother-2.502
Token pattern
Token ID3912
Feature activation-2.06e-01

Pos logit contributions
 WIN+0.992
annabin+0.950
 backward+0.922
itably+0.905
oyd+0.897
Neg logit contributions
irement-1.035
akura-1.027
ashi-0.992
 Turing-0.984
Mother-0.982
Token speed
Token ID2866
Feature activation+5.12e-01

Pos logit contributions
 WIN+0.127
annabin+0.123
 backward+0.117
itably+0.116
oyd+0.115
Neg logit contributions
irement-0.143
akura-0.141
ashi-0.138
 Turing-0.137
Mother-0.135
Token (
Token ID357
Feature activation-1.24e+00

Pos logit contributions
irement+0.345
akura+0.341
ashi+0.331
 Turing+0.328
Mother+0.327
Neg logit contributions
 WIN-0.324
annabin-0.316
itably-0.301
 backward-0.299
oyd-0.296
Tokenabout
Token ID10755
Feature activation-1.91e-01

Pos logit contributions
 WIN+0.780
annabin+0.749
 backward+0.724
itably+0.712
oyd+0.705
Neg logit contributions
irement-0.842
akura-0.835
ashi-0.809
Mother-0.804
 Turing-0.801
Token and
Token ID290
Feature activation-6.45e-02

Pos logit contributions
irement+0.061
akura+0.060
ashi+0.058
 Turing+0.058
Mother+0.058
Neg logit contributions
 WIN-0.056
annabin-0.053
 backward-0.052
itably-0.051
oyd-0.050
Token the
Token ID262
Feature activation-3.88e-01

Pos logit contributions
 WIN+0.041
annabin+0.039
 backward+0.038
itably+0.037
oyd+0.037
Neg logit contributions
irement-0.044
akura-0.043
ashi-0.042
 Turing-0.042
Mother-0.042
Token upcoming
Token ID7865
Feature activation+2.84e-01

Pos logit contributions
 WIN+0.245
annabin+0.238
 backward+0.228
itably+0.226
oyd+0.224
Neg logit contributions
irement-0.261
akura-0.261
ashi-0.252
 Turing-0.251
Mother-0.248
Token "
Token ID366
Feature activation+3.72e-01

Pos logit contributions
irement+0.188
akura+0.186
ashi+0.181
 Turing+0.180
Mother+0.178
Neg logit contributions
 WIN-0.182
annabin-0.177
 backward-0.170
itably-0.168
oyd-0.167
TokenHi
Token ID17250
Feature activation-1.95e+00

Pos logit contributions
irement+0.239
akura+0.238
Mother+0.230
ashi+0.229
 Turing+0.228
Neg logit contributions
 WIN-0.247
annabin-0.238
 backward-0.231
itably-0.227
oyd-0.226
Token Hi
Token ID15902
Feature activation-4.48e+00

Pos logit contributions
 WIN+1.221
annabin+1.163
 backward+1.130
itably+1.108
oyd+1.100
Neg logit contributions
irement-1.338
akura-1.328
ashi-1.278
 Turing-1.274
Mother-1.270
Token P
Token ID350
Feature activation-1.13e-01

Pos logit contributions
 WIN+2.939
annabin+2.756
 backward+2.712
itably+2.640
 GOD+2.639
Neg logit contributions
irement-2.920
akura-2.872
Mother-2.762
ashi-2.759
mother-2.746
Tokenuffy
Token ID15352
Feature activation-1.06e+00

Pos logit contributions
 WIN+0.073
annabin+0.070
 backward+0.068
itably+0.067
oyd+0.066
Neg logit contributions
irement-0.076
akura-0.075
ashi-0.073
Mother-0.072
 Turing-0.072
Token Am
Token ID1703
Feature activation+1.52e+00

Pos logit contributions
 WIN+0.670
annabin+0.643
 backward+0.624
itably+0.613
oyd+0.605
Neg logit contributions
irement-0.720
akura-0.716
ashi-0.691
 Turing-0.686
Mother-0.685
Tokeni
Token ID72
Feature activation-2.92e+00

Pos logit contributions
akura+1.006
irement+1.000
ashi+0.971
mother+0.944
Mother+0.943
Neg logit contributions
 WIN-0.996
annabin-0.957
 backward-0.932
itably-0.909
oyd-0.899
Token Y
Token ID575
Feature activation+8.32e-02

Pos logit contributions
 WIN+2.442
annabin+2.361
 backward+2.312
itably+2.279
oyd+2.276
Neg logit contributions
irement-1.376
akura-1.360
 Turing-1.279
Mother-1.277
ashi-1.268
Token bad
Token ID2089
Feature activation-5.09e-01

Pos logit contributions
irement+0.209
akura+0.208
ashi+0.203
 Turing+0.201
Mother+0.200
Neg logit contributions
 WIN-0.190
annabin-0.185
 backward-0.176
itably-0.175
oyd-0.174
Token news
Token ID1705
Feature activation-3.46e-01

Pos logit contributions
 WIN+0.330
annabin+0.317
 backward+0.306
itably+0.301
oyd+0.299
Neg logit contributions
irement-0.338
akura-0.336
ashi-0.324
 Turing-0.322
Mother-0.321
Token pl
Token ID458
Feature activation+4.90e-01

Pos logit contributions
 WIN+0.222
annabin+0.214
 backward+0.206
itably+0.204
oyd+0.203
Neg logit contributions
irement-0.231
akura-0.229
ashi-0.222
 Turing-0.220
Mother-0.219
Tokens
Token ID82
Feature activation-9.52e-01

Pos logit contributions
akura+0.297
irement+0.297
ashi+0.287
Mother+0.280
mother+0.280
Neg logit contributions
 WIN-0.344
annabin-0.332
 backward-0.322
itably-0.318
oyd-0.312
Token.
Token ID13
Feature activation-6.73e-01

Pos logit contributions
 WIN+0.598
annabin+0.576
 backward+0.556
itably+0.547
oyd+0.542
Neg logit contributions
irement-0.648
akura-0.644
ashi-0.624
 Turing-0.619
Mother-0.617
Token Good
Token ID4599
Feature activation-5.12e+00

Pos logit contributions
 WIN+0.425
annabin+0.412
 backward+0.396
itably+0.392
oyd+0.388
Neg logit contributions
irement-0.453
akura-0.452
ashi-0.437
 Turing-0.434
Mother-0.432
Token updates
Token ID5992
Feature activation-3.36e+00

Pos logit contributions
 WIN+3.545
annabin+3.377
 backward+3.348
itably+3.229
oyd+3.199
Neg logit contributions
irement-3.167
akura-3.164
ashi-3.020
 Turing-2.999
Mother-2.982
Token?
Token ID30
Feature activation-1.11e+00

Pos logit contributions
 WIN+2.132
annabin+2.041
 backward+1.979
itably+1.943
oyd+1.925
Neg logit contributions
irement-2.283
akura-2.265
ashi-2.188
 Turing-2.173
Mother-2.167
Token No
Token ID1400
Feature activation-8.86e-01

Pos logit contributions
 WIN+0.834
annabin+0.806
 backward+0.785
itably+0.774
oyd+0.768
Neg logit contributions
irement-0.626
akura-0.620
ashi-0.595
 Turing-0.592
Mother-0.589
Token more
Token ID517
Feature activation-7.29e-01

Pos logit contributions
 WIN+0.515
annabin+0.500
itably+0.471
oyd+0.470
 backward+0.466
Neg logit contributions
irement-0.641
akura-0.634
ashi-0.625
 Turing-0.621
mother-0.615
Token bad
Token ID2089
Feature activation-8.71e-01

Pos logit contributions
 WIN+0.559
annabin+0.555
oyd+0.528
itably+0.525
ospace+0.519
Neg logit contributions
irement-0.391
akura-0.379
ashi-0.375
 Turing-0.373
mother-0.370
Token December
Token ID3426
Feature activation-4.22e+00

Pos logit contributions
 WIN+1.002
annabin+0.952
 backward+0.938
itably+0.907
oyd+0.902
Neg logit contributions
irement-0.892
akura-0.891
ashi-0.854
Mother-0.844
 Turing-0.840
Token 5
Token ID642
Feature activation-4.35e+00

Pos logit contributions
 WIN+2.669
annabin+2.544
 backward+2.467
itably+2.420
oyd+2.396
Neg logit contributions
irement-2.883
akura-2.862
ashi-2.760
Mother-2.731
 Turing-2.730
Token,
Token ID11
Feature activation-2.87e+00

Pos logit contributions
 WIN+2.676
annabin+2.526
 backward+2.465
itably+2.399
oyd+2.376
Neg logit contributions
irement-3.042
akura-3.034
ashi-2.912
 Turing-2.884
Mother-2.851
Token 2009
Token ID3717
Feature activation-4.41e+00

Pos logit contributions
 WIN+1.682
annabin+1.598
 backward+1.552
itably+1.516
oyd+1.499
Neg logit contributions
irement-2.085
akura-2.071
ashi-2.002
 Turing-1.982
Mother-1.982
TokenQue
Token ID15681
Feature activation-3.00e+00

Pos logit contributions
 WIN+2.696
annabin+2.582
 backward+2.490
itably+2.446
oyd+2.431
Neg logit contributions
irement-3.078
akura-3.061
ashi-2.952
 Turing-2.934
mother-2.862
Tokeners
Token ID364
Feature activation-4.75e+00

Pos logit contributions
 WIN+2.232
annabin+2.149
 backward+2.095
itably+2.062
oyd+2.046
Neg logit contributions
irement-1.711
akura-1.696
ashi-1.627
 Turing-1.613
Mother-1.608
Token are
Token ID389
Feature activation-2.74e+00

Pos logit contributions
 WIN+2.865
annabin+2.740
 backward+2.664
itably+2.561
oyd+2.541
Neg logit contributions
irement-3.360
akura-3.339
ashi-3.228
Mother-3.190
 Turing-3.189
Token Here
Token ID3423
Feature activation-4.44e+00

Pos logit contributions
 WIN+2.022
annabin+1.938
 backward+1.912
itably+1.853
oyd+1.841
Neg logit contributions
irement-1.572
akura-1.564
ashi-1.499
Mother-1.472
mother-1.460
Token....
Token ID1106
Feature activation-3.22e+00

Pos logit contributions
 WIN+2.559
annabin+2.438
 backward+2.392
itably+2.329
oyd+2.319
Neg logit contributions
irement-3.217
akura-3.199
 Turing-3.084
ashi-3.079
Mother-3.063
Token
Token ID198
Feature activation-2.86e+00

Pos logit contributions
 WIN+1.929
annabin+1.775
 backward+1.755
 GOD+1.703
oyd+1.702
Neg logit contributions
irement-2.300
akura-2.272
ashi-2.202
Mother-2.191
mother-2.175
Token
Token ID198
Feature activation-2.27e+00

Pos logit contributions
 WIN+1.835
annabin+1.754
 backward+1.704
itably+1.672
oyd+1.656
Neg logit contributions
irement-1.917
akura-1.903
ashi-1.836
 Turing-1.822
Mother-1.817