TOP ACTIVATIONS
MAX = 1.517

о� и и
towel - lined plate . Serve hot ! <|endoftext|> Image copyright
into a pint of Ben and Jerry s .
т ь и е л ь
" vi : ", " vim :" or " ex :"
accepted year - round ; the deadline for consideration for the
deadline for consideration for the 2019 award is February 28 ,
in February 2017 Average annual value : $ 8
dead and ' Pet Sem at ary ' is real ,"
include : Uber , Ben & Jerry s and
earning his fifth ACC Rookie of the Week , first ACC
in March 2016 Average annual value : $ 3
freshman to win ACC Player of the Year .[ 117 ]
a breathing problem . Man j ula 's husband Nat v
kowski during 7 - on - 7 drills and then intercept
Sheen 2009 Alec Baldwin 2009 Steve Care ll
for his fourth ACC Rookie of the Week Award and first
in March 2016 Average annual value : $ 20
a battery operated portable AM / SW receiver with an internal
their own and do not represent the views of Town hall

MIN ACTIVATIONS
MIN = -1.244

0 s )} c xd 4 { ( 341 s )}
of a public street shall maintain any sidewalk in such condition
:// www . ac ep il ots . com / usa
\ [ + \ S + \ s + "( \
streets of the city , or parked in front of buildings
bitcoin / h ong - k ong / a - flex
frame unique to Ford F - 150 Rapt or , and
before { vers } vim ={ vers }: version { vers
, your character now receives secret voices in his or her
posted by James Harrison (@ j hh arrison 92 ) on
shows in order to win .[ 156 ] Sound
character Erin is a professor at Columbia University , where some
= [ system . windows . forms . text rend erer
. forms . text rend erer ] :: Measure Text ($
is history . <|endoftext|> About This Game Choose from
. 5 - liter Eco Boost ® engine that produces more
} or later vim < { vers }: version before {
created such music ( Ne h ring 99 ).
Photo : Erin Bre th auer , The Chronicle 11 of
the exception of season 14 ).[ 157 ] Th

INTERVAL 0.827 - 1.517
CONTAINS 0.060%

--+ | ... | + -------------------- --+ Uh oh
on the address https :// old . nice hash .
had a collection of autom at ons that was eventually thrown
Kenya African Democratic Union ( K AD U ), which was
" vim :" or " ex :" [ white ] optional

INTERVAL 0.136 - 0.827
CONTAINS 26.224%

it will panic and reboot endlessly . That means that if
) -> Own ed < T > ; } impl <
daughter . My husband was still abusive - I almost had
she te ases . I tried to hide my
: Half Shell still in full swing , it

INTERVAL -0.554 - 0.136
CONTAINS 72.860%

Studies at the Cand ler School of The ology of Em
on big projects such as the Kee y ask dam or
Lee and Laura Ling who sn uck into North Korea via
t figure any of this out . (
open enrollment period and tighten enrollment eligibility " to deter people

INTERVAL -1.244 - -0.554
CONTAINS 0.857%

s eventual blood alcohol level was below the . 08 threshold
generous 10 metre setback along Y onge Street and a 5
] B ionic Commando Rear med [/ b ][/ color ]
between £ 200 , 000 and £ 700 , 000 .
favorite favorite - July 6 , 2010 Subject :
Token
Token ID12466
Feature activation+3.33e-01

Pos logit contributions
PDATE+4.271
atism+4.060
+3.766
 mathemat+3.684
ADRA+3.668
Neg logit contributions
lly-3.390
 liqu-3.339
 Castro-3.302
bred-2.977
 Honour-2.910
Token
Token ID123
Feature activation-1.96e-01

Pos logit contributions
PDATE+2.645
atism+2.431
+2.225
ADRA+2.224
 mathemat+2.207
Neg logit contributions
 liqu-2.989
lly-2.977
 Castro-2.932
bred-2.684
 Honour-2.635
Tokenо�
Token ID25443
Feature activation+6.33e-03

Pos logit contributions
lly+1.664
 liqu+1.647
 Castro+1.627
bred+1.475
ful+1.463
Neg logit contributions
PDATE-1.688
atism-1.536
 mathemat-1.424
ADRA-1.420
-1.397
Token
Token ID111
Feature activation+9.87e-02

Pos logit contributions
PDATE+0.054
atism+0.049
 mathemat+0.046
ADRA+0.046
+0.045
Neg logit contributions
lly-0.054
 liqu-0.054
 Castro-0.053
bred-0.048
ful-0.047
Tokenи
Token ID18849
Feature activation+3.44e-01

Pos logit contributions
PDATE+0.870
atism+0.803
+0.743
ADRA+0.742
 mathemat+0.741
Neg logit contributions
lly-0.810
 liqu-0.805
 Castro-0.792
bred-0.719
 Honour-0.704
Token
Token ID140
Feature activation+1.52e+00

Pos logit contributions
PDATE+2.647
atism+2.454
+2.272
ADRA+2.231
 mathemat+2.205
Neg logit contributions
 liqu-3.105
lly-3.105
 Castro-3.050
bred-2.811
 Honour-2.757
Token
Token ID109
Feature activation+5.77e-02

Pos logit contributions
PDATE+6.119
atism+5.935
+5.412
 mathemat+4.609
+4.577
Neg logit contributions
 liqu-16.257
lly-15.799
 Castro-15.551
 Honour-14.932
bred-14.753
Token
Token ID141
Feature activation-4.81e-01

Pos logit contributions
PDATE+0.526
atism+0.487
ADRA+0.450
+0.449
 mathemat+0.449
Neg logit contributions
lly-0.457
 liqu-0.452
 Castro-0.447
bred-0.404
 Honour-0.396
Token
Token ID230
Feature activation-3.96e-01

Pos logit contributions
lly+3.859
 Castro+3.835
 liqu+3.809
ful+3.399
bred+3.365
Neg logit contributions
PDATE-4.454
atism-3.922
 mathemat-3.816
 volunte-3.660
ADRA-3.613
Tokenи
Token ID18849
Feature activation-3.74e-01

Pos logit contributions
lly+2.992
 liqu+2.941
 Castro+2.936
bred+2.596
ful+2.587
Neg logit contributions
PDATE-3.846
atism-3.483
 mathemat-3.386
 volunte-3.243
ADRA-3.239
Token
Token ID141
Feature activation+1.02e-02

Pos logit contributions
lly+2.789
 liqu+2.759
 Castro+2.720
bred+2.418
ful+2.395
Neg logit contributions
PDATE-3.611
atism-3.293
 mathemat-3.115
ADRA-3.103
-3.062
Token towel
Token ID24808
Feature activation-2.84e-01

Pos logit contributions
PDATE+1.624
atism+1.559
+1.189
 mathemat+0.935
ADRA+0.909
Neg logit contributions
lly-8.976
 liqu-8.976
 Castro-8.941
bred-8.435
 Honour-8.382
Token-
Token ID12
Feature activation-1.56e-01

Pos logit contributions
lly+1.549
 liqu+1.528
 Castro+1.494
bred+1.277
ful+1.255
Neg logit contributions
PDATE-3.352
atism-3.123
ADRA-2.984
 mathemat-2.975
-2.923
Tokenlined
Token ID10837
Feature activation+1.49e-01

Pos logit contributions
lly+0.944
 liqu+0.931
 Castro+0.915
bred+0.797
ful+0.777
Neg logit contributions
PDATE-1.719
atism-1.606
 mathemat-1.514
ADRA-1.513
-1.505
Token plate
Token ID7480
Feature activation+9.23e-02

Pos logit contributions
PDATE+0.769
atism+0.725
+0.632
 mathemat+0.602
ADRA+0.599
Neg logit contributions
lly-1.680
 liqu-1.667
 Castro-1.649
bred-1.546
 Honour-1.527
Token.
Token ID13
Feature activation-1.90e-01

Pos logit contributions
PDATE+0.912
atism+0.877
+0.830
ADRA+0.801
Buff+0.793
Neg logit contributions
lly-0.611
 liqu-0.610
 Castro-0.592
bred-0.524
 Honour-0.508
Token Serve
Token ID35557
Feature activation+1.48e+00

Pos logit contributions
lly+1.763
 liqu+1.748
 Castro+1.729
bred+1.584
ful+1.560
Neg logit contributions
PDATE-1.477
atism-1.341
 mathemat-1.227
ADRA-1.226
-1.217
Token hot
Token ID3024
Feature activation+9.05e-01

Pos logit contributions
atism+4.352
+4.043
Buff+3.708
usha+3.537
PDATE+3.245
Neg logit contributions
lly-16.536
 Castro-15.730
 Honour-15.469
 liqu-15.362
vc-15.161
Token!
Token ID0
Feature activation-3.42e-02

Pos logit contributions
atism+6.020
PDATE+5.875
+5.626
Buff+5.423
ESCO+5.400
Neg logit contributions
 liqu-7.495
 Castro-7.394
lly-7.346
bred-6.847
 Chin-6.802
Token<|endoftext|>
Token ID50256
Feature activation+1.11e-01

Pos logit contributions
lly+0.262
 liqu+0.259
 Castro+0.256
bred+0.230
ful+0.225
Neg logit contributions
PDATE-0.323
atism-0.297
 mathemat-0.277
-0.276
ADRA-0.276
TokenImage
Token ID5159
Feature activation+6.65e-02

Pos logit contributions
PDATE+0.926
atism+0.864
ADRA+0.798
+0.798
 mathemat+0.783
Neg logit contributions
lly-0.948
 liqu-0.948
 Castro-0.934
bred-0.840
 Honour-0.831
Token copyright
Token ID6634
Feature activation+6.48e-02

Pos logit contributions
PDATE+0.545
atism+0.498
 mathemat+0.458
ADRA+0.457
+0.455
Neg logit contributions
lly-0.590
 liqu-0.585
 Castro-0.578
bred-0.528
ful-0.518
Token into
Token ID656
Feature activation+5.80e-02

Pos logit contributions
PDATE+1.761
atism+1.646
+1.537
ADRA+1.534
 mathemat+1.532
Neg logit contributions
lly-1.143
 liqu-1.134
 Castro-1.119
bred-0.987
 Honour-0.964
Token a
Token ID257
Feature activation+8.53e-02

Pos logit contributions
PDATE+0.542
atism+0.501
ADRA+0.466
 mathemat+0.465
+0.464
Neg logit contributions
lly-0.446
 liqu-0.440
 Castro-0.435
bred-0.391
ful-0.383
Token pint
Token ID35245
Feature activation+1.76e-01

Pos logit contributions
PDATE+0.802
atism+0.746
ADRA+0.695
 mathemat+0.694
+0.690
Neg logit contributions
lly-0.646
 liqu-0.635
 Castro-0.628
bred-0.566
 Honour-0.555
Token of
Token ID286
Feature activation-7.95e-02

Pos logit contributions
PDATE+1.745
atism+1.637
+1.524
ADRA+1.522
 mathemat+1.512
Neg logit contributions
lly-1.234
 liqu-1.218
 Castro-1.213
bred-1.073
 Honour-1.053
Token Ben
Token ID3932
Feature activation-1.47e-01

Pos logit contributions
lly+0.610
 liqu+0.604
 Castro+0.596
bred+0.535
ful+0.525
Neg logit contributions
PDATE-0.751
atism-0.693
 mathemat-0.645
ADRA-0.645
-0.641
Token and
Token ID290
Feature activation+1.45e+00

Pos logit contributions
lly+1.086
 liqu+1.076
 Castro+1.068
bred+0.949
ful+0.936
Neg logit contributions
PDATE-1.442
atism-1.317
ADRA-1.247
 mathemat-1.243
-1.219
Token Jerry
Token ID13075
Feature activation+1.05e-01

Pos logit contributions
PDATE+10.817
atism+10.273
 mathemat+9.940
+9.777
+9.751
Neg logit contributions
lly-10.284
 liqu-10.222
 Castro-9.665
bred-9.418
fully-9.220
Token
Token ID447
Feature activation+3.75e-02

Pos logit contributions
PDATE+1.032
atism+0.960
ADRA+0.895
 mathemat+0.894
+0.894
Neg logit contributions
lly-0.762
 liqu-0.750
 Castro-0.740
bred-0.663
 Honour-0.646
Token
Token ID247
Feature activation+9.45e-02

Pos logit contributions
PDATE+0.185
atism+0.157
ADRA+0.135
 mathemat+0.135
+0.133
Neg logit contributions
lly-0.457
 liqu-0.454
 Castro-0.450
bred-0.421
ful-0.416
Tokens
Token ID82
Feature activation-5.55e-02

Pos logit contributions
PDATE+0.977
atism+0.912
ADRA+0.853
 mathemat+0.850
+0.850
Neg logit contributions
lly-0.633
 liqu-0.625
 Castro-0.617
bred-0.543
 Honour-0.532
Token.
Token ID13
Feature activation-4.42e-01

Pos logit contributions
lly+0.380
 liqu+0.376
 Castro+0.370
bred+0.327
ful+0.321
Neg logit contributions
PDATE-0.571
atism-0.530
 mathemat-0.497
ADRA-0.497
-0.494
Tokenт
Token ID20375
Feature activation+2.01e-01

Pos logit contributions
PDATE+0.675
atism+0.622
 mathemat+0.574
ADRA+0.574
+0.572
Neg logit contributions
lly-0.632
 liqu-0.626
 Castro-0.619
bred-0.561
 Honour-0.549
Tokenь
Token ID45367
Feature activation+6.44e-02

Pos logit contributions
PDATE+1.743
atism+1.640
+1.502
 mathemat+1.500
ADRA+1.488
Neg logit contributions
lly-1.652
 liqu-1.628
 Castro-1.612
bred-1.474
 Honour-1.436
Token
Token ID12466
Feature activation+3.90e-01

Pos logit contributions
PDATE+0.650
atism+0.608
 mathemat+0.567
+0.567
ADRA+0.566
Neg logit contributions
lly-0.447
 liqu-0.440
 Castro-0.434
bred-0.388
 Honour-0.375
Token
Token ID111
Feature activation+1.80e-01

Pos logit contributions
PDATE+3.038
atism+2.804
+2.562
ADRA+2.555
 mathemat+2.539
Neg logit contributions
 liqu-3.532
lly-3.518
 Castro-3.464
bred-3.179
 Honour-3.115
Tokenи
Token ID18849
Feature activation+2.57e-01

Pos logit contributions
PDATE+1.561
atism+1.439
ADRA+1.326
+1.325
 mathemat+1.324
Neg logit contributions
lly-1.493
 liqu-1.491
 Castro-1.469
bred-1.341
 Honour-1.308
Token
Token ID140
Feature activation+1.44e+00

Pos logit contributions
PDATE+2.088
atism+1.927
+1.775
 mathemat+1.756
ADRA+1.755
Neg logit contributions
 liqu-2.237
lly-2.235
 Castro-2.208
bred-2.020
 Honour-1.984
Token
Token ID109
Feature activation+8.06e-02

Pos logit contributions
PDATE+6.987
atism+6.558
+6.072
 mathemat+5.433
+5.221
Neg logit contributions
 liqu-14.713
lly-14.397
 Castro-14.182
 Honour-13.420
bred-13.400
Tokenе
Token ID16843
Feature activation+1.01e-02

Pos logit contributions
PDATE+0.693
atism+0.638
ADRA+0.589
+0.588
 mathemat+0.587
Neg logit contributions
lly-0.679
 liqu-0.674
 Castro-0.666
bred-0.605
 Honour-0.592
Tokenл
Token ID30143
Feature activation-2.55e-01

Pos logit contributions
PDATE+0.057
atism+0.049
 mathemat+0.043
ADRA+0.043
+0.043
Neg logit contributions
lly-0.115
 liqu-0.114
 Castro-0.113
bred-0.106
 Honour-0.104
Tokenь
Token ID45367
Feature activation+2.90e-01

Pos logit contributions
lly+2.246
 liqu+2.220
 Castro+2.188
bred+1.987
ful+1.977
Neg logit contributions
PDATE-2.117
atism-1.929
 mathemat-1.784
ADRA-1.776
-1.762
Token
Token ID12466
Feature activation+3.81e-01

Pos logit contributions
PDATE+2.934
atism+2.787
+2.591
ADRA+2.584
 mathemat+2.580
Neg logit contributions
lly-1.942
 liqu-1.921
 Castro-1.882
bred-1.684
 Honour-1.619
Token "
Token ID366
Feature activation-3.22e-01

Pos logit contributions
lly+2.964
 liqu+2.958
 Castro+2.855
ful+2.439
bred+2.418
Neg logit contributions
PDATE-5.729
atism-5.342
ADRA-5.121
 mathemat-5.077
-5.056
Tokenvi
Token ID8903
Feature activation+8.34e-01

Pos logit contributions
lly+1.780
 liqu+1.738
 Castro+1.688
bred+1.474
ful+1.446
Neg logit contributions
PDATE-3.739
atism-3.486
ADRA-3.325
 mathemat-3.317
-3.278
Token:
Token ID25
Feature activation-1.33e-01

Pos logit contributions
PDATE+6.340
atism+6.093
+5.644
 mathemat+5.397
ADRA+5.303
Neg logit contributions
 liqu-6.775
lly-6.748
 Castro-6.710
 Honour-6.086
bred-6.056
Token",
Token ID1600
Feature activation-2.09e-01

Pos logit contributions
lly+0.833
 liqu+0.822
 Castro+0.809
bred+0.708
ful+0.691
Neg logit contributions
PDATE-1.443
atism-1.346
 mathemat-1.267
ADRA-1.267
-1.260
Token "
Token ID366
Feature activation-2.82e-01

Pos logit contributions
lly+1.274
 liqu+1.261
 Castro+1.237
bred+1.074
ful+1.056
Neg logit contributions
PDATE-2.335
atism-2.151
 mathemat-2.059
ADRA-2.048
-2.010
Tokenvim
Token ID31124
Feature activation+1.43e+00

Pos logit contributions
lly+1.426
 liqu+1.382
 Castro+1.345
bred+1.160
ful+1.130
Neg logit contributions
PDATE-3.462
atism-3.201
 mathemat-3.081
ADRA-3.066
-3.013
Token:"
Token ID11097
Feature activation+6.44e-02

Pos logit contributions
PDATE+7.271
atism+6.583
+6.082
 mathemat+5.721
 volunte+5.369
Neg logit contributions
 liqu-14.106
 Castro-14.044
lly-13.853
 Honour-13.154
chin-13.093
Token or
Token ID393
Feature activation-3.33e-01

Pos logit contributions
PDATE+0.594
atism+0.556
+0.513
ADRA+0.511
 mathemat+0.509
Neg logit contributions
lly-0.497
 liqu-0.490
 Castro-0.486
bred-0.437
 Honour-0.426
Token "
Token ID366
Feature activation+7.45e-02

Pos logit contributions
lly+1.979
 liqu+1.951
 Castro+1.915
bred+1.656
ful+1.638
Neg logit contributions
PDATE-3.812
atism-3.480
 mathemat-3.367
ADRA-3.336
-3.253
Tokenex
Token ID1069
Feature activation+1.01e+00

Pos logit contributions
PDATE+0.644
atism+0.609
+0.561
ADRA+0.551
 mathemat+0.548
Neg logit contributions
lly-0.604
 liqu-0.601
 Castro-0.597
bred-0.532
 Honour-0.531
Token:"
Token ID11097
Feature activation-4.00e-02

Pos logit contributions
atism+5.425
PDATE+5.204
+4.933
usha+4.415
 mathemat+4.253
Neg logit contributions
 liqu-9.937
 Castro-9.920
lly-9.681
 Chin-9.152
 Honour-9.121
Token accepted
Token ID6292
Feature activation+7.90e-01

Pos logit contributions
lly+2.080
 liqu+2.069
 Castro+2.039
bred+1.817
ful+1.779
Neg logit contributions
PDATE-2.908
atism-2.677
ADRA-2.517
 mathemat-2.516
-2.510
Token year
Token ID614
Feature activation+4.89e-01

Pos logit contributions
PDATE+5.425
atism+4.911
 mathemat+4.389
ADRA+4.372
 volunte+4.322
Neg logit contributions
lly-7.209
 Castro-7.144
 liqu-7.116
 Honour-6.558
chin-6.548
Token-
Token ID12
Feature activation+6.33e-01

Pos logit contributions
atism+4.507
PDATE+4.461
+4.218
usha+4.019
Buff+3.974
Neg logit contributions
 Castro-3.167
 liqu-3.143
lly-3.102
 Honour-2.754
bred-2.689
Tokenround
Token ID744
Feature activation+8.71e-02

Pos logit contributions
atism+5.727
+5.274
PDATE+5.187
Buff+5.012
usha+4.980
Neg logit contributions
 Castro-4.413
 liqu-4.398
lly-4.089
 Chin-3.888
 Honour-3.830
Token;
Token ID26
Feature activation+5.34e-01

Pos logit contributions
PDATE+0.889
atism+0.823
 mathemat+0.771
ADRA+0.769
+0.767
Neg logit contributions
lly-0.596
 liqu-0.591
 Castro-0.582
bred-0.513
ful-0.501
Token the
Token ID262
Feature activation+1.37e+00

Pos logit contributions
PDATE+4.021
atism+3.712
ADRA+3.403
+3.354
 mathemat+3.262
Neg logit contributions
lly-4.803
 liqu-4.716
 Castro-4.683
bred-4.297
chin-4.226
Token deadline
Token ID12508
Feature activation+3.05e-02

Pos logit contributions
atism+7.675
PDATE+7.626
ESA+6.964
+6.881
ADRA+6.859
Neg logit contributions
lly-11.446
 liqu-11.084
 Castro-11.038
chin-10.736
bred-10.502
Token for
Token ID329
Feature activation-1.42e-01

Pos logit contributions
PDATE+0.278
atism+0.262
+0.243
ADRA+0.237
 mathemat+0.236
Neg logit contributions
lly-0.234
 liqu-0.233
 Castro-0.228
bred-0.206
 Honour-0.202
Token consideration
Token ID9110
Feature activation-2.66e-02

Pos logit contributions
lly+0.807
 liqu+0.796
 Castro+0.781
bred+0.672
ful+0.657
Neg logit contributions
PDATE-1.634
atism-1.527
 mathemat-1.447
ADRA-1.445
-1.435
Token for
Token ID329
Feature activation+6.19e-01

Pos logit contributions
lly+0.200
 liqu+0.198
 Castro+0.195
bred+0.175
 Honour+0.171
Neg logit contributions
PDATE-0.249
atism-0.234
-0.216
ADRA-0.214
 mathemat-0.214
Token the
Token ID262
Feature activation+4.00e-01

Pos logit contributions
PDATE+4.297
atism+4.143
+3.771
ADRA+3.627
ESA+3.567
Neg logit contributions
lly-5.568
 liqu-5.504
 Castro-5.472
bred-5.008
chin-4.932
Token deadline
Token ID12508
Feature activation+3.05e-02

Pos logit contributions
atism+7.675
PDATE+7.626
ESA+6.964
+6.881
ADRA+6.859
Neg logit contributions
lly-11.446
 liqu-11.084
 Castro-11.038
chin-10.736
bred-10.502
Token for
Token ID329
Feature activation-1.42e-01

Pos logit contributions
PDATE+0.278
atism+0.262
+0.243
ADRA+0.237
 mathemat+0.236
Neg logit contributions
lly-0.234
 liqu-0.233
 Castro-0.228
bred-0.206
 Honour-0.202
Token consideration
Token ID9110
Feature activation-2.66e-02

Pos logit contributions
lly+0.807
 liqu+0.796
 Castro+0.781
bred+0.672
ful+0.657
Neg logit contributions
PDATE-1.634
atism-1.527
 mathemat-1.447
ADRA-1.445
-1.435
Token for
Token ID329
Feature activation+6.19e-01

Pos logit contributions
lly+0.200
 liqu+0.198
 Castro+0.195
bred+0.175
 Honour+0.171
Neg logit contributions
PDATE-0.249
atism-0.234
-0.216
ADRA-0.214
 mathemat-0.214
Token the
Token ID262
Feature activation+4.00e-01

Pos logit contributions
PDATE+4.297
atism+4.143
+3.771
ADRA+3.627
ESA+3.567
Neg logit contributions
lly-5.568
 liqu-5.504
 Castro-5.472
bred-5.008
chin-4.932
Token 2019
Token ID13130
Feature activation+1.34e+00

Pos logit contributions
PDATE+3.471
atism+3.211
ADRA+2.973
+2.951
 mathemat+2.928
Neg logit contributions
lly-3.206
 liqu-3.158
 Castro-3.134
bred-2.849
ful-2.763
Token award
Token ID5764
Feature activation+2.68e-01

Pos logit contributions
atism+6.526
ESA+5.949
usha+5.892
PDATE+5.880
+5.869
Neg logit contributions
lly-12.420
 liqu-11.981
 Castro-11.812
chin-11.546
bred-11.242
Token is
Token ID318
Feature activation-5.23e-01

Pos logit contributions
PDATE+2.308
atism+2.228
+2.048
ADRA+1.984
 mathemat+1.967
Neg logit contributions
lly-2.084
 liqu-2.060
 Castro-2.039
 Honour-1.834
bred-1.823
Token February
Token ID3945
Feature activation-8.66e-02

Pos logit contributions
lly+2.865
 liqu+2.862
 Castro+2.798
ful+2.371
 Honour+2.355
Neg logit contributions
PDATE-5.956
atism-5.616
 mathemat-5.339
ADRA-5.335
-5.282
Token 28
Token ID2579
Feature activation+2.58e-01

Pos logit contributions
lly+0.652
 liqu+0.645
 Castro+0.638
bred+0.570
ful+0.560
Neg logit contributions
PDATE-0.833
atism-0.768
 mathemat-0.720
ADRA-0.717
-0.711
Token,
Token ID11
Feature activation+5.60e-04

Pos logit contributions
PDATE+2.546
atism+2.449
+2.295
ADRA+2.235
Buff+2.188
Neg logit contributions
 liqu-1.700
lly-1.692
 Castro-1.641
bred-1.444
 Honour-1.427
Token in
Token ID287
Feature activation+5.00e-04

Pos logit contributions
PDATE+0.547
atism+0.507
 mathemat+0.472
+0.469
ADRA+0.469
Neg logit contributions
lly-0.441
 liqu-0.435
 Castro-0.430
bred-0.388
 Honour-0.378
Token February
Token ID3945
Feature activation-1.18e-01

Pos logit contributions
PDATE+0.005
atism+0.005
 mathemat+0.005
ADRA+0.005
+0.005
Neg logit contributions
lly-0.003
 liqu-0.003
 Castro-0.003
bred-0.003
ful-0.003
Token 2017
Token ID2177
Feature activation+5.47e-01

Pos logit contributions
lly+0.683
 liqu+0.673
 Castro+0.662
bred+0.571
ful+0.557
Neg logit contributions
PDATE-1.340
atism-1.252
 mathemat-1.183
ADRA-1.183
-1.176
Token
Token ID198
Feature activation+2.59e-01

Pos logit contributions
PDATE+4.892
atism+4.473
 mathemat+4.164
+4.135
ADRA+4.076
Neg logit contributions
lly-4.227
 liqu-4.187
 Castro-4.113
bred-3.753
 Honour-3.646
Token
Token ID198
Feature activation+3.03e-01

Pos logit contributions
atism+2.098
PDATE+2.036
+1.978
Buff+1.893
ESA+1.843
Neg logit contributions
 liqu-2.061
lly-2.037
 Castro-1.988
bred-1.787
 Honour-1.760
TokenAverage
Token ID26287
Feature activation+1.29e+00

Pos logit contributions
PDATE+3.195
atism+3.041
+2.867
ADRA+2.863
Buff+2.801
Neg logit contributions
 liqu-1.888
lly-1.867
 Castro-1.842
bred-1.591
 Honour-1.561
Token annual
Token ID5079
Feature activation+1.97e-01

Pos logit contributions
atism+6.418
PDATE+5.941
+5.529
Buff+5.514
ESA+5.282
Neg logit contributions
 liqu-12.144
 Castro-12.106
lly-12.041
chin-11.768
bred-11.370
Token value
Token ID1988
Feature activation+6.59e-01

Pos logit contributions
PDATE+1.368
atism+1.279
+1.162
ADRA+1.131
 mathemat+1.125
Neg logit contributions
 liqu-1.890
lly-1.889
 Castro-1.869
bred-1.728
chin-1.698
Token:
Token ID25
Feature activation-5.77e-02

Pos logit contributions
atism+4.832
+4.634
PDATE+4.604
Buff+4.390
ESA+4.282
Neg logit contributions
 liqu-5.413
lly-5.302
 Castro-5.200
bred-4.755
chin-4.691
Token $
Token ID720
Feature activation-1.67e-01

Pos logit contributions
lly+0.372
 liqu+0.368
 Castro+0.361
bred+0.319
ful+0.310
Neg logit contributions
PDATE-0.613
atism-0.572
 mathemat-0.537
ADRA-0.537
-0.534
Token8
Token ID23
Feature activation-4.10e-01

Pos logit contributions
lly+1.088
 liqu+1.071
 Castro+1.061
bred+0.933
ful+0.914
Neg logit contributions
PDATE-1.778
atism-1.645
 mathemat-1.560
ADRA-1.546
-1.529
Token dead
Token ID2636
Feature activation-3.25e-01

Pos logit contributions
lly+1.158
 liqu+1.149
 Castro+1.132
bred+1.013
ful+0.994
Neg logit contributions
PDATE-1.485
atism-1.369
ADRA-1.278
 mathemat-1.278
-1.268
Token and
Token ID290
Feature activation-1.86e-01

Pos logit contributions
lly+2.270
 liqu+2.231
 Castro+2.190
bred+1.946
ful+1.927
Neg logit contributions
PDATE-3.338
atism-3.078
 mathemat-2.907
ADRA-2.899
-2.864
Token '
Token ID705
Feature activation-4.06e-01

Pos logit contributions
lly+1.389
 liqu+1.377
 Castro+1.360
bred+1.214
ful+1.191
Neg logit contributions
PDATE-1.801
atism-1.660
ADRA-1.551
 mathemat-1.549
-1.540
TokenPet
Token ID25803
Feature activation+1.67e-01

Pos logit contributions
lly+3.136
 liqu+3.062
 Castro+3.030
bred+2.798
ful+2.758
Neg logit contributions
PDATE-3.890
atism-3.504
 mathemat-3.313
ADRA-3.266
-3.211
Token Sem
Token ID12449
Feature activation+5.29e-01

Pos logit contributions
PDATE+1.684
atism+1.586
ADRA+1.483
 mathemat+1.475
+1.469
Neg logit contributions
lly-1.135
 liqu-1.134
 Castro-1.112
bred-0.990
 Honour-0.962
Tokenat
Token ID265
Feature activation+1.28e+00

Pos logit contributions
PDATE+4.723
atism+4.643
ADRA+4.211
 mathemat+4.109
+4.098
Neg logit contributions
 liqu-3.934
lly-3.889
 Castro-3.812
 Honour-3.433
bred-3.415
Tokenary
Token ID560
Feature activation-8.96e-03

Pos logit contributions
atism+12.025
PDATE+11.060
usha+10.863
+10.642
ouf+10.564
Neg logit contributions
 liqu-7.268
lly-7.191
 Castro-6.963
 Honour-6.051
fully-5.952
Token'
Token ID6
Feature activation-1.48e-01

Pos logit contributions
lly+0.063
 liqu+0.062
 Castro+0.061
bred+0.054
ful+0.053
Neg logit contributions
PDATE-0.090
atism-0.084
 mathemat-0.078
ADRA-0.078
-0.078
Token is
Token ID318
Feature activation+1.30e-02

Pos logit contributions
lly+1.069
 liqu+1.058
 Castro+1.045
bred+0.930
ful+0.913
Neg logit contributions
PDATE-1.475
atism-1.361
ADRA-1.274
 mathemat-1.274
-1.265
Token real
Token ID1103
Feature activation-2.79e-01

Pos logit contributions
PDATE+0.122
atism+0.113
 mathemat+0.105
ADRA+0.105
+0.105
Neg logit contributions
lly-0.099
 liqu-0.098
 Castro-0.096
bred-0.086
ful-0.085
Token,"
Token ID553
Feature activation+5.08e-02

Pos logit contributions
lly+2.056
 liqu+2.020
 Castro+1.985
bred+1.780
ful+1.748
Neg logit contributions
PDATE-2.756
atism-2.540
ADRA-2.378
 mathemat-2.374
-2.356
Token include
Token ID2291
Feature activation-1.82e-03

Pos logit contributions
lly+1.517
 liqu+1.494
 Castro+1.477
bred+1.314
ful+1.297
Neg logit contributions
PDATE-1.997
atism-1.838
ADRA-1.718
 mathemat-1.715
-1.703
Token:
Token ID25
Feature activation-7.67e-02

Pos logit contributions
lly+0.013
 liqu+0.013
 Castro+0.013
bred+0.011
ful+0.011
Neg logit contributions
PDATE-0.018
atism-0.017
 mathemat-0.016
ADRA-0.016
-0.016
Token Uber
Token ID12024
Feature activation-1.74e-02

Pos logit contributions
lly+0.549
 liqu+0.543
 Castro+0.536
bred+0.477
ful+0.467
Neg logit contributions
PDATE-0.763
atism-0.707
 mathemat-0.662
ADRA-0.661
-0.657
Token,
Token ID11
Feature activation+5.72e-02

Pos logit contributions
lly+0.111
 liqu+0.110
 Castro+0.108
bred+0.095
ful+0.092
Neg logit contributions
PDATE-0.186
atism-0.173
ADRA-0.163
 mathemat-0.163
-0.162
Token Ben
Token ID3932
Feature activation-3.97e-01

Pos logit contributions
PDATE+0.585
atism+0.547
ADRA+0.513
 mathemat+0.512
+0.509
Neg logit contributions
lly-0.388
 liqu-0.381
 Castro-0.376
bred-0.335
ful-0.325
Token &
Token ID1222
Feature activation+1.26e+00

Pos logit contributions
lly+2.627
 Castro+2.606
 liqu+2.571
bred+2.286
ful+2.236
Neg logit contributions
PDATE-4.172
atism-3.815
ADRA-3.680
 mathemat-3.666
-3.541
Token Jerry
Token ID13075
Feature activation+8.58e-02

Pos logit contributions
PDATE+8.368
+7.834
 Gö+7.780
atism+7.710
Buff+7.683
Neg logit contributions
lly-10.290
 liqu-9.934
bred-9.903
fully-9.429
 Castro-9.268
Token
Token ID447
Feature activation-1.82e-01

Pos logit contributions
PDATE+0.833
atism+0.773
 mathemat+0.724
ADRA+0.724
+0.722
Neg logit contributions
lly-0.629
 liqu-0.619
 Castro-0.608
bred-0.549
ful-0.534
Token
Token ID247
Feature activation-1.31e-02

Pos logit contributions
lly+2.003
 liqu+1.990
 Castro+1.979
bred+1.833
ful+1.814
Neg logit contributions
PDATE-1.100
atism-0.962
 mathemat-0.868
ADRA-0.845
-0.839
Tokens
Token ID82
Feature activation-9.72e-02

Pos logit contributions
lly+0.087
 liqu+0.086
 Castro+0.085
bred+0.075
ful+0.073
Neg logit contributions
PDATE-0.135
atism-0.126
ADRA-0.119
 mathemat-0.118
-0.118
Token and
Token ID290
Feature activation+3.48e-02

Pos logit contributions
lly+0.625
 liqu+0.617
 Castro+0.608
bred+0.533
ful+0.521
Neg logit contributions
PDATE-1.038
atism-0.968
 mathemat-0.910
ADRA-0.908
-0.904
Token earning
Token ID13748
Feature activation+7.55e-02

Pos logit contributions
PDATE+1.842
atism+1.710
 mathemat+1.602
+1.594
ADRA+1.593
Neg logit contributions
lly-1.497
 liqu-1.474
 Castro-1.458
bred-1.322
ful-1.286
Token his
Token ID465
Feature activation+3.43e-01

Pos logit contributions
PDATE+0.753
atism+0.701
 mathemat+0.658
ADRA+0.655
+0.653
Neg logit contributions
lly-0.529
 liqu-0.524
 Castro-0.517
bred-0.460
ful-0.449
Token fifth
Token ID8150
Feature activation+4.93e-03

Pos logit contributions
PDATE+2.817
atism+2.638
 mathemat+2.453
ADRA+2.390
+2.373
Neg logit contributions
lly-2.906
 liqu-2.848
 Castro-2.832
bred-2.570
ful-2.515
Token ACC
Token ID15859
Feature activation+1.65e-01

Pos logit contributions
PDATE+0.048
atism+0.044
 mathemat+0.041
ADRA+0.041
+0.041
Neg logit contributions
lly-0.037
 liqu-0.036
 Castro-0.036
bred-0.032
ful-0.031
Token Rookie
Token ID36595
Feature activation+7.00e-02

Pos logit contributions
PDATE+1.455
atism+1.329
 mathemat+1.242
ADRA+1.240
+1.237
Neg logit contributions
lly-1.349
 liqu-1.333
 Castro-1.317
bred-1.189
ful-1.160
Token of
Token ID286
Feature activation+1.25e+00

Pos logit contributions
PDATE+0.681
atism+0.628
+0.588
ADRA+0.587
 mathemat+0.587
Neg logit contributions
lly-0.511
 liqu-0.505
 Castro-0.497
bred-0.447
ful-0.431
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
atism+6.022
+5.650
PDATE+5.489
Buff+5.391
Package+5.084
Neg logit contributions
 liqu-12.237
lly-11.766
 Castro-11.451
bred-10.722
chin-10.504
Token Week
Token ID6119
Feature activation-5.69e-02

Pos logit contributions
atism+4.293
PDATE+4.145
+3.828
Buff+3.749
Package+3.511
Neg logit contributions
 liqu-11.626
lly-11.372
 Castro-10.720
bred-10.427
chin-10.187
Token,
Token ID11
Feature activation+2.13e-01

Pos logit contributions
lly+0.435
 liqu+0.430
 Castro+0.424
bred+0.381
ful+0.372
Neg logit contributions
PDATE-0.536
atism-0.496
 mathemat-0.463
ADRA-0.462
-0.459
Token first
Token ID717
Feature activation-1.70e-01

Pos logit contributions
PDATE+2.009
atism+1.858
 mathemat+1.751
ADRA+1.737
+1.732
Neg logit contributions
lly-1.583
 liqu-1.561
 Castro-1.545
bred-1.408
ful-1.355
Token ACC
Token ID15859
Feature activation-3.00e-02

Pos logit contributions
lly+1.344
 liqu+1.322
 Castro+1.304
bred+1.178
ful+1.164
Neg logit contributions
PDATE-1.591
atism-1.461
ADRA-1.362
 mathemat-1.352
-1.342
Token in
Token ID287
Feature activation-1.07e-01

Pos logit contributions
PDATE+0.762
atism+0.704
 mathemat+0.655
+0.652
ADRA+0.651
Neg logit contributions
lly-0.622
 liqu-0.613
 Castro-0.606
bred-0.548
 Honour-0.534
Token March
Token ID2805
Feature activation-2.96e-01

Pos logit contributions
lly+0.672
 liqu+0.666
 Castro+0.657
bred+0.573
ful+0.560
Neg logit contributions
PDATE-1.158
atism-1.075
ADRA-1.017
 mathemat-1.014
-1.006
Token 2016
Token ID1584
Feature activation+3.41e-01

Pos logit contributions
lly+1.721
 liqu+1.701
 Castro+1.672
bred+1.438
ful+1.420
Neg logit contributions
PDATE-3.387
atism-3.123
ADRA-2.989
 mathemat-2.976
-2.937
Token
Token ID198
Feature activation+3.37e-02

Pos logit contributions
PDATE+3.217
atism+2.958
 mathemat+2.765
+2.742
ADRA+2.723
Neg logit contributions
lly-2.565
 liqu-2.537
 Castro-2.491
bred-2.256
 Honour-2.194
Token
Token ID198
Feature activation+1.72e-01

Pos logit contributions
PDATE+0.303
atism+0.295
+0.276
ADRA+0.267
Buff+0.263
Neg logit contributions
 liqu-0.252
lly-0.252
 Castro-0.245
bred-0.219
 Honour-0.215
TokenAverage
Token ID26287
Feature activation+1.24e+00

Pos logit contributions
PDATE+1.870
atism+1.772
ADRA+1.667
+1.667
 mathemat+1.640
Neg logit contributions
 liqu-1.040
lly-1.035
 Castro-1.016
bred-0.875
 Honour-0.855
Token annual
Token ID5079
Feature activation+1.52e-01

Pos logit contributions
atism+6.365
PDATE+5.876
+5.485
Buff+5.431
ESA+5.220
Neg logit contributions
 liqu-11.702
 Castro-11.668
lly-11.581
chin-11.269
bred-10.895
Token value
Token ID1988
Feature activation+5.94e-01

Pos logit contributions
PDATE+1.080
atism+1.005
+0.914
ADRA+0.894
 mathemat+0.889
Neg logit contributions
lly-1.447
 liqu-1.445
 Castro-1.429
bred-1.318
 Honour-1.296
Token:
Token ID25
Feature activation-1.09e-01

Pos logit contributions
atism+4.472
PDATE+4.326
+4.289
Buff+4.060
ESA+3.939
Neg logit contributions
 liqu-4.824
lly-4.746
 Castro-4.633
bred-4.240
chin-4.164
Token $
Token ID720
Feature activation-5.07e-02

Pos logit contributions
lly+0.685
 liqu+0.676
 Castro+0.666
bred+0.581
ful+0.569
Neg logit contributions
PDATE-1.179
atism-1.099
 mathemat-1.036
ADRA-1.034
-1.027
Token3
Token ID18
Feature activation-3.85e-01

Pos logit contributions
lly+0.346
 liqu+0.342
 Castro+0.336
bred+0.298
ful+0.291
Neg logit contributions
PDATE-0.519
atism-0.485
ADRA-0.453
-0.453
 mathemat-0.452
Token freshman
Token ID18621
Feature activation+1.11e-01

Pos logit contributions
PDATE+1.498
atism+1.383
 mathemat+1.296
+1.287
ADRA+1.279
Neg logit contributions
lly-1.240
 liqu-1.223
 Castro-1.210
bred-1.082
ful-1.061
Token to
Token ID284
Feature activation-4.95e-02

Pos logit contributions
PDATE+1.106
atism+1.022
 mathemat+0.962
+0.959
ADRA+0.957
Neg logit contributions
lly-0.781
 liqu-0.772
 Castro-0.761
bred-0.678
ful-0.660
Token win
Token ID1592
Feature activation+2.61e-01

Pos logit contributions
lly+0.377
 liqu+0.373
 Castro+0.368
bred+0.330
ful+0.324
Neg logit contributions
PDATE-0.470
atism-0.435
 mathemat-0.405
ADRA-0.405
-0.402
Token ACC
Token ID15859
Feature activation+1.50e-02

Pos logit contributions
PDATE+2.295
atism+2.114
 mathemat+1.966
ADRA+1.954
+1.941
Neg logit contributions
lly-2.117
 liqu-2.094
 Castro-2.069
bred-1.878
ful-1.828
Token Player
Token ID7853
Feature activation+2.40e-01

Pos logit contributions
PDATE+0.165
atism+0.154
 mathemat+0.146
ADRA+0.145
+0.145
Neg logit contributions
lly-0.092
 liqu-0.090
 Castro-0.089
bred-0.077
ful-0.075
Token of
Token ID286
Feature activation+1.22e+00

Pos logit contributions
PDATE+2.279
atism+2.105
+1.963
 mathemat+1.957
ADRA+1.954
Neg logit contributions
lly-1.794
 liqu-1.781
 Castro-1.743
bred-1.568
ful-1.507
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
atism+6.063
+5.667
PDATE+5.374
Buff+5.265
Package+5.063
Neg logit contributions
 liqu-11.898
lly-11.562
 Castro-11.173
bred-10.452
chin-10.243
Token Year
Token ID6280
Feature activation+6.70e-02

Pos logit contributions
atism+5.214
PDATE+5.118
+4.875
Buff+4.672
Package+4.480
Neg logit contributions
 liqu-9.865
lly-9.834
 Castro-9.134
bred-8.893
chin-8.548
Token.[
Token ID3693
Feature activation-4.66e-01

Pos logit contributions
PDATE+0.747
atism+0.699
 mathemat+0.662
ADRA+0.660
+0.657
Neg logit contributions
lly-0.393
 liqu-0.387
 Castro-0.379
bred-0.331
ful-0.319
Token117
Token ID17657
Feature activation-1.46e-01

Pos logit contributions
lly+2.517
 liqu+2.496
 Castro+2.384
bred+2.091
ful+2.075
Neg logit contributions
PDATE-5.425
atism-5.082
 mathemat-4.823
ADRA-4.765
-4.737
Token]
Token ID60
Feature activation+2.12e-01

Pos logit contributions
lly+0.601
 liqu+0.590
 Castro+0.575
bred+0.463
ful+0.443
Neg logit contributions
PDATE-1.901
atism-1.794
ADRA-1.707
 mathemat-1.706
-1.700
Token a
Token ID257
Feature activation-7.39e-02

Pos logit contributions
lly+0.864
 liqu+0.860
 Castro+0.846
bred+0.753
ful+0.742
Neg logit contributions
PDATE-1.131
atism-1.043
 mathemat-0.975
ADRA-0.974
-0.971
Token breathing
Token ID12704
Feature activation+2.56e-01

Pos logit contributions
lly+0.577
 liqu+0.575
 Castro+0.566
bred+0.508
ful+0.500
Neg logit contributions
PDATE-0.687
atism-0.631
 mathemat-0.589
ADRA-0.588
-0.585
Token problem
Token ID1917
Feature activation+1.71e-01

Pos logit contributions
PDATE+1.871
atism+1.719
ADRA+1.570
 mathemat+1.555
+1.554
Neg logit contributions
lly-2.448
 liqu-2.408
 Castro-2.403
bred-2.228
 Honour-2.177
Token.
Token ID13
Feature activation+3.70e-02

Pos logit contributions
PDATE+1.735
atism+1.623
 mathemat+1.512
ADRA+1.511
+1.510
Neg logit contributions
lly-1.164
 liqu-1.150
 Castro-1.131
bred-1.007
 Honour-0.976
Token Man
Token ID1869
Feature activation-4.87e-02

Pos logit contributions
PDATE+0.339
atism+0.313
ADRA+0.292
+0.290
 mathemat+0.289
Neg logit contributions
lly-0.293
 liqu-0.290
 Castro-0.285
bred-0.258
ful-0.252
Tokenj
Token ID73
Feature activation+1.21e+00

Pos logit contributions
lly+0.356
 liqu+0.353
 Castro+0.348
bred+0.310
ful+0.304
Neg logit contributions
PDATE-0.475
atism-0.441
 mathemat-0.412
ADRA-0.411
-0.409
Tokenula
Token ID4712
Feature activation-2.08e-01

Pos logit contributions
atism+5.986
usha+5.511
ouf+4.875
+4.843
ESA+4.715
Neg logit contributions
 Castro-12.356
 liqu-12.283
lly-11.457
 Chin-11.124
 Honour-11.122
Token's
Token ID338
Feature activation-2.08e-02

Pos logit contributions
lly+1.470
 liqu+1.453
 Castro+1.434
bred+1.268
ful+1.249
Neg logit contributions
PDATE-2.111
atism-1.947
 mathemat-1.823
ADRA-1.822
-1.814
Token husband
Token ID5229
Feature activation-1.32e-02

Pos logit contributions
lly+0.163
 liqu+0.161
 Castro+0.159
bred+0.143
ful+0.141
Neg logit contributions
PDATE-0.191
atism-0.178
 mathemat-0.165
ADRA-0.165
-0.163
Token Nat
Token ID14393
Feature activation+3.49e-01

Pos logit contributions
lly+0.094
 liqu+0.093
 Castro+0.091
bred+0.082
ful+0.079
Neg logit contributions
PDATE-0.130
atism-0.121
 mathemat-0.114
ADRA-0.114
-0.113
Tokenv
Token ID85
Feature activation+3.03e-01

Pos logit contributions
PDATE+3.229
atism+3.110
ADRA+2.857
+2.855
 mathemat+2.770
Neg logit contributions
 liqu-2.580
lly-2.529
 Castro-2.507
bred-2.238
 Honour-2.198
Tokenkowski
Token ID26216
Feature activation+9.29e-02

Pos logit contributions
PDATE+0.627
atism+0.521
+0.400
ADRA+0.389
 mathemat+0.385
Neg logit contributions
lly-2.433
 liqu-2.429
 Castro-2.408
bred-2.265
 Honour-2.241
Token during
Token ID1141
Feature activation+3.89e-01

Pos logit contributions
PDATE+0.947
atism+0.886
 mathemat+0.828
+0.826
ADRA+0.826
Neg logit contributions
lly-0.630
 liqu-0.624
 Castro-0.614
bred-0.544
 Honour-0.529
Token 7
Token ID767
Feature activation-7.66e-03

Pos logit contributions
PDATE+3.376
atism+3.149
 mathemat+2.900
+2.897
ADRA+2.867
Neg logit contributions
lly-3.161
 liqu-3.095
 Castro-3.041
bred-2.736
 Honour-2.709
Token-
Token ID12
Feature activation+3.47e-01

Pos logit contributions
lly+0.048
 liqu+0.047
 Castro+0.046
bred+0.041
 Honour+0.040
Neg logit contributions
PDATE-0.083
atism-0.077
ADRA-0.073
-0.073
 mathemat-0.073
Tokenon
Token ID261
Feature activation+5.58e-01

Pos logit contributions
PDATE+2.653
atism+2.548
+2.401
ADRA+2.287
 mathemat+2.284
Neg logit contributions
 liqu-3.082
lly-3.026
 Castro-3.020
 Honour-2.703
bred-2.692
Token-
Token ID12
Feature activation+1.20e+00

Pos logit contributions
atism+5.047
PDATE+4.950
+4.879
Buff+4.649
ADRA+4.547
Neg logit contributions
 liqu-3.729
lly-3.695
 Castro-3.560
bred-3.123
 Honour-3.104
Token7
Token ID22
Feature activation-1.24e-01

Pos logit contributions
atism+8.258
+8.132
Buff+7.912
PDATE+7.871
Package+7.444
Neg logit contributions
 liqu-10.049
lly-9.976
 Castro-9.555
 Honour-8.714
 Chin-8.566
Token drills
Token ID26721
Feature activation-1.90e-02

Pos logit contributions
lly+0.936
 liqu+0.927
 Castro+0.915
bred+0.822
ful+0.809
Neg logit contributions
PDATE-1.187
atism-1.093
ADRA-1.021
 mathemat-1.020
-1.012
Token and
Token ID290
Feature activation+2.26e-01

Pos logit contributions
lly+0.126
 liqu+0.124
 Castro+0.122
bred+0.108
 Honour+0.105
Neg logit contributions
PDATE-0.198
atism-0.185
 mathemat-0.173
ADRA-0.173
-0.172
Token then
Token ID788
Feature activation+2.35e-01

Pos logit contributions
PDATE+2.034
atism+1.915
 mathemat+1.781
+1.770
ADRA+1.760
Neg logit contributions
lly-1.752
 liqu-1.722
 Castro-1.691
bred-1.552
 Honour-1.507
Token intercept
Token ID15788
Feature activation-2.74e-01

Pos logit contributions
PDATE+2.105
atism+1.984
 mathemat+1.846
+1.830
ADRA+1.823
Neg logit contributions
lly-1.818
 liqu-1.791
 Castro-1.761
bred-1.609
 Honour-1.571
Token Sheen
Token ID50189
Feature activation+4.89e-01

Pos logit contributions
lly+3.210
 Castro+3.182
 liqu+3.122
bred+2.789
 Chin+2.787
Neg logit contributions
PDATE-4.522
atism-4.057
 mathemat-3.934
ADRA-3.889
 volunte-3.790
Token 2009
Token ID3717
Feature activation-1.77e-01

Pos logit contributions
PDATE+4.200
atism+3.930
+3.601
 mathemat+3.568
ADRA+3.544
Neg logit contributions
 liqu-3.926
lly-3.898
 Castro-3.800
bred-3.428
chin-3.352
Token Alec
Token ID38285
Feature activation+1.67e-01

Pos logit contributions
lly+1.213
 liqu+1.194
 Castro+1.184
bred+1.045
ful+1.023
Neg logit contributions
PDATE-1.816
atism-1.683
 mathemat-1.582
ADRA-1.579
-1.566
Token Baldwin
Token ID26402
Feature activation+4.17e-01

Pos logit contributions
PDATE+1.433
atism+1.367
+1.276
ADRA+1.235
 mathemat+1.218
Neg logit contributions
 liqu-1.332
lly-1.329
 Castro-1.278
bred-1.170
ful-1.154
Token
Token ID198
Feature activation+1.90e-01

Pos logit contributions
PDATE+3.425
atism+3.406
+3.212
Buff+3.041
ESA+2.975
Neg logit contributions
 liqu-3.268
lly-3.232
 Castro-3.113
bred-2.819
 Honour-2.750
Token
Token ID198
Feature activation+1.18e+00

Pos logit contributions
atism+1.551
PDATE+1.522
+1.468
Buff+1.398
ESA+1.369
Neg logit contributions
 liqu-1.506
lly-1.477
 Castro-1.447
bred-1.293
 Honour-1.278
Token2009
Token ID10531
Feature activation-3.40e-01

Pos logit contributions
atism+4.854
Buff+4.721
+4.686
Package+4.515
ESA+4.259
Neg logit contributions
 liqu-13.685
 Castro-12.808
lly-12.747
 Honour-11.628
 Labrador-11.624
Token Steve
Token ID6542
Feature activation-2.26e-01

Pos logit contributions
lly+2.378
 Castro+2.348
 liqu+2.313
bred+2.047
 Chin+2.025
Neg logit contributions
PDATE-3.487
atism-3.203
 mathemat-3.042
ADRA-3.013
-2.963
Token Care
Token ID7276
Feature activation-6.38e-01

Pos logit contributions
lly+1.720
 liqu+1.694
 Castro+1.685
bred+1.505
ful+1.481
Neg logit contributions
PDATE-2.165
atism-1.981
 mathemat-1.868
ADRA-1.856
-1.834
Tokenll
Token ID297
Feature activation+3.87e-01

Pos logit contributions
lly+2.954
 liqu+2.663
 Castro+2.565
ful+2.310
bred+2.259
Neg logit contributions
PDATE-8.557
 mathemat-7.732
atism-7.652
ADRA-7.612
-7.513
Token
Token ID198
Feature activation+2.62e-01

Pos logit contributions
PDATE+3.277
atism+3.185
+2.996
Buff+2.842
usha+2.801
Neg logit contributions
lly-2.992
 liqu-2.985
 Castro-2.873
bred-2.584
ful-2.532
Tokenfor
Token ID1640
Feature activation+4.98e-01

Pos logit contributions
PDATE+2.292
atism+2.271
+2.083
ADRA+1.968
Buff+1.957
Neg logit contributions
 liqu-2.562
lly-2.520
 Castro-2.514
bred-2.246
 Honour-2.228
Token his
Token ID465
Feature activation+1.82e-01

Pos logit contributions
PDATE+4.227
atism+3.943
 mathemat+3.680
ADRA+3.675
+3.606
Neg logit contributions
lly-3.948
 liqu-3.919
 Castro-3.857
bred-3.494
ful-3.381
Token fourth
Token ID5544
Feature activation-1.45e-01

Pos logit contributions
PDATE+1.504
atism+1.404
 mathemat+1.298
ADRA+1.274
+1.269
Neg logit contributions
lly-1.561
 liqu-1.530
 Castro-1.517
bred-1.377
ful-1.352
Token ACC
Token ID15859
Feature activation+6.26e-02

Pos logit contributions
lly+1.061
 liqu+1.050
 Castro+1.036
bred+0.925
ful+0.908
Neg logit contributions
PDATE-1.433
atism-1.323
ADRA-1.238
 mathemat-1.234
-1.224
Token Rookie
Token ID36595
Feature activation+1.87e-02

Pos logit contributions
PDATE+0.530
atism+0.484
 mathemat+0.449
ADRA+0.448
+0.448
Neg logit contributions
lly-0.535
 liqu-0.528
 Castro-0.522
bred-0.474
ful-0.463
Token of
Token ID286
Feature activation+1.17e+00

Pos logit contributions
PDATE+0.182
atism+0.168
ADRA+0.157
+0.157
 mathemat+0.157
Neg logit contributions
lly-0.136
 liqu-0.135
 Castro-0.132
bred-0.119
ful-0.115
Token the
Token ID262
Feature activation+1.05e+00

Pos logit contributions
atism+6.005
+5.603
PDATE+5.557
Buff+5.251
Package+5.021
Neg logit contributions
 liqu-11.358
lly-10.992
 Castro-10.552
bred-9.972
chin-9.775
Token Week
Token ID6119
Feature activation-1.82e-01

Pos logit contributions
PDATE+5.468
atism+5.054
+4.433
ADRA+4.350
Buff+4.170
Neg logit contributions
 liqu-10.981
lly-10.829
 Castro-10.208
bred-9.822
chin-9.631
Token Award
Token ID11289
Feature activation-1.66e-01

Pos logit contributions
lly+1.333
 liqu+1.318
 Castro+1.300
bred+1.160
ful+1.139
Neg logit contributions
PDATE-1.788
atism-1.656
 mathemat-1.546
ADRA-1.545
-1.536
Token and
Token ID290
Feature activation+1.17e-01

Pos logit contributions
lly+1.118
 liqu+1.103
 Castro+1.087
bred+0.960
ful+0.943
Neg logit contributions
PDATE-1.715
atism-1.596
 mathemat-1.497
ADRA-1.497
-1.487
Token first
Token ID717
Feature activation-2.81e-02

Pos logit contributions
PDATE+1.067
atism+0.988
 mathemat+0.925
ADRA+0.918
+0.911
Neg logit contributions
lly-0.909
 liqu-0.900
 Castro-0.888
bred-0.801
ful-0.782
Token in
Token ID287
Feature activation+2.07e-02

Pos logit contributions
lly+0.589
 liqu+0.583
 Castro+0.575
bred+0.514
ful+0.505
Neg logit contributions
PDATE-0.766
atism-0.708
ADRA-0.661
 mathemat-0.661
-0.657
Token March
Token ID2805
Feature activation-3.76e-01

Pos logit contributions
PDATE+0.221
atism+0.206
 mathemat+0.193
ADRA+0.193
+0.192
Neg logit contributions
lly-0.134
 liqu-0.132
 Castro-0.130
bred-0.114
ful-0.111
Token 2016
Token ID1584
Feature activation+3.24e-01

Pos logit contributions
lly+2.059
 liqu+2.041
 Castro+1.995
bred+1.692
ful+1.679
Neg logit contributions
PDATE-4.436
atism-4.084
ADRA-3.936
 mathemat-3.892
-3.860
Token
Token ID198
Feature activation+3.34e-02

Pos logit contributions
PDATE+3.053
atism+2.811
 mathemat+2.627
+2.603
ADRA+2.583
Neg logit contributions
lly-2.432
 liqu-2.411
 Castro-2.365
bred-2.143
 Honour-2.083
Token
Token ID198
Feature activation+2.02e-01

Pos logit contributions
PDATE+0.300
atism+0.292
+0.274
ADRA+0.264
Buff+0.260
Neg logit contributions
 liqu-0.250
lly-0.250
 Castro-0.244
bred-0.218
 Honour-0.213
TokenAverage
Token ID26287
Feature activation+1.17e+00

Pos logit contributions
PDATE+2.188
atism+2.074
ADRA+1.952
+1.951
 mathemat+1.918
Neg logit contributions
 liqu-1.227
lly-1.219
 Castro-1.200
bred-1.032
 Honour-1.007
Token annual
Token ID5079
Feature activation+1.42e-01

Pos logit contributions
atism+6.252
PDATE+5.870
+5.406
Buff+5.369
ESA+5.123
Neg logit contributions
 liqu-10.996
 Castro-10.949
lly-10.859
chin-10.454
bred-10.165
Token value
Token ID1988
Feature activation+5.10e-01

Pos logit contributions
PDATE+1.012
atism+0.941
+0.855
ADRA+0.837
 mathemat+0.835
Neg logit contributions
lly-1.346
 liqu-1.344
 Castro-1.329
bred-1.224
 Honour-1.205
Token:
Token ID25
Feature activation-1.68e-01

Pos logit contributions
atism+3.973
PDATE+3.909
+3.788
Buff+3.593
ESA+3.485
Neg logit contributions
 liqu-4.083
lly-4.033
 Castro-3.935
bred-3.590
 Honour-3.521
Token $
Token ID720
Feature activation+1.39e-02

Pos logit contributions
lly+1.032
 liqu+1.018
 Castro+1.008
bred+0.868
ful+0.855
Neg logit contributions
PDATE-1.852
atism-1.720
 mathemat-1.634
ADRA-1.625
-1.605
Token20
Token ID1238
Feature activation-2.81e-01

Pos logit contributions
PDATE+0.137
atism+0.129
+0.121
ADRA+0.120
 mathemat+0.119
Neg logit contributions
lly-0.098
 liqu-0.097
 Castro-0.095
bred-0.084
 Honour-0.082
Token a
Token ID257
Feature activation+1.47e-01

Pos logit contributions
lly+0.935
 liqu+0.924
 Castro+0.914
bred+0.812
ful+0.801
Neg logit contributions
PDATE-1.218
atism-1.123
ADRA-1.048
 mathemat-1.039
-1.037
Token battery
Token ID6555
Feature activation+1.03e-01

Pos logit contributions
PDATE+1.310
atism+1.208
 mathemat+1.131
+1.116
ADRA+1.115
Neg logit contributions
lly-1.183
 liqu-1.164
 Castro-1.150
bred-1.045
 Honour-1.022
Token operated
Token ID12228
Feature activation+9.30e-02

Pos logit contributions
PDATE+1.011
atism+0.935
 mathemat+0.876
ADRA+0.872
+0.870
Neg logit contributions
lly-0.753
 liqu-0.743
 Castro-0.733
bred-0.656
 Honour-0.640
Token portable
Token ID17726
Feature activation+2.28e-01

Pos logit contributions
PDATE+0.877
atism+0.810
 mathemat+0.756
ADRA+0.752
+0.750
Neg logit contributions
lly-0.711
 liqu-0.702
 Castro-0.693
bred-0.624
ful-0.610
Token AM
Token ID3001
Feature activation+4.38e-01

Pos logit contributions
PDATE+2.151
atism+1.985
 mathemat+1.862
+1.850
ADRA+1.837
Neg logit contributions
lly-1.720
 liqu-1.694
 Castro-1.675
bred-1.515
 Honour-1.474
Token/
Token ID14
Feature activation+1.16e+00

Pos logit contributions
PDATE+4.102
atism+3.744
+3.548
 mathemat+3.537
ADRA+3.525
Neg logit contributions
lly-3.320
 liqu-3.289
 Castro-3.233
bred-2.925
 Honour-2.853
TokenSW
Token ID17887
Feature activation+4.07e-01

Pos logit contributions
PDATE+5.199
atism+4.562
+4.257
ESA+4.099
ADRA+3.950
Neg logit contributions
 liqu-13.169
lly-13.039
 Castro-12.946
 Honour-12.480
bred-11.949
Token receiver
Token ID9733
Feature activation+4.37e-01

Pos logit contributions
PDATE+3.669
atism+3.333
+3.138
ADRA+3.120
 mathemat+3.119
Neg logit contributions
lly-3.263
 liqu-3.215
 Castro-3.172
bred-2.896
 Honour-2.858
Token with
Token ID351
Feature activation+7.04e-02

Pos logit contributions
PDATE+4.094
atism+3.766
 mathemat+3.568
+3.560
ADRA+3.487
Neg logit contributions
lly-3.233
 liqu-3.181
 Castro-3.127
bred-2.824
 Honour-2.776
Token an
Token ID281
Feature activation+2.29e-01

Pos logit contributions
PDATE+0.687
atism+0.636
 mathemat+0.594
ADRA+0.594
+0.590
Neg logit contributions
lly-0.517
 liqu-0.511
 Castro-0.504
bred-0.450
ful-0.441
Token internal
Token ID5387
Feature activation+3.37e-01

Pos logit contributions
PDATE+2.125
atism+1.964
 mathemat+1.827
ADRA+1.820
+1.818
Neg logit contributions
lly-1.775
 liqu-1.755
 Castro-1.728
bred-1.567
ful-1.525
Token their
Token ID511
Feature activation+1.35e-01

Pos logit contributions
lly+0.993
 liqu+0.981
 Castro+0.968
bred+0.864
ful+0.847
Neg logit contributions
PDATE-1.329
atism-1.232
ADRA-1.149
 mathemat-1.148
-1.143
Token own
Token ID898
Feature activation+2.73e-01

Pos logit contributions
PDATE+1.252
atism+1.174
+1.085
 mathemat+1.082
ADRA+1.077
Neg logit contributions
lly-1.029
 liqu-1.024
 Castro-1.013
bred-0.903
ful-0.885
Token and
Token ID290
Feature activation+2.68e-01

Pos logit contributions
PDATE+2.291
atism+2.133
+1.986
Buff+1.958
 mathemat+1.956
Neg logit contributions
lly-2.182
 liqu-2.170
 Castro-2.145
bred-2.003
ful-1.926
Token do
Token ID466
Feature activation+6.08e-01

Pos logit contributions
PDATE+1.933
atism+1.807
+1.612
 mathemat+1.610
ADRA+1.608
Neg logit contributions
 liqu-2.547
lly-2.540
 Castro-2.491
 Honour-2.293
bred-2.293
Token not
Token ID407
Feature activation+1.99e-01

Pos logit contributions
atism+4.060
PDATE+3.937
+3.859
Buff+3.570
usha+3.390
Neg logit contributions
 liqu-5.543
lly-5.318
 Castro-5.231
bred-4.866
 Honour-4.799
Token represent
Token ID2380
Feature activation+1.16e+00

Pos logit contributions
PDATE+1.749
atism+1.655
 mathemat+1.518
ADRA+1.515
+1.506
Neg logit contributions
lly-1.596
 liqu-1.596
 Castro-1.556
bred-1.407
 Honour-1.377
Token the
Token ID262
Feature activation+8.51e-01

Pos logit contributions
PDATE+6.165
atism+6.144
Buff+5.473
ADRA+5.382
+5.266
Neg logit contributions
lly-11.348
 liqu-11.264
 Castro-10.711
chuk-10.304
chin-10.286
Token views
Token ID5009
Feature activation+8.13e-01

Pos logit contributions
atism+7.369
PDATE+7.147
Buff+6.650
ADRA+6.222
 Tanz+6.107
Neg logit contributions
lly-6.141
 liqu-5.908
 Castro-5.702
ful-5.317
bred-5.283
Token of
Token ID286
Feature activation+7.09e-01

Pos logit contributions
atism+6.018
+5.418
PDATE+5.371
Buff+5.153
usha+4.969
Neg logit contributions
 liqu-6.656
lly-6.632
 Castro-6.400
 Honour-5.929
chin-5.699
Token Town
Token ID8329
Feature activation+9.65e-02

Pos logit contributions
PDATE+6.302
atism+6.061
Buff+5.707
 Tanz+5.559
ADRA+5.519
Neg logit contributions
lly-5.063
 liqu-4.863
 Castro-4.495
 Honour-4.477
ful-4.333
Tokenhall
Token ID18323
Feature activation+1.30e-02

Pos logit contributions
PDATE+1.145
atism+1.073
 mathemat+1.016
ADRA+1.016
+1.009
Neg logit contributions
lly-0.514
 liqu-0.502
 Castro-0.497
bred-0.419
ful-0.412
Token0
Token ID15
Feature activation-1.06e-01

Pos logit contributions
lly+1.723
 liqu+1.702
 Castro+1.684
bred+1.515
ful+1.498
Neg logit contributions
PDATE-2.148
atism-1.963
 mathemat-1.851
ADRA-1.829
-1.800
Tokens
Token ID82
Feature activation-6.03e-01

Pos logit contributions
lly+0.658
 liqu+0.649
 Castro+0.639
bred+0.558
ful+0.544
Neg logit contributions
PDATE-1.151
atism-1.075
 mathemat-1.011
ADRA-1.011
-1.007
Token)}
Token ID38165
Feature activation-5.04e-01

Pos logit contributions
lly+4.030
 Castro+3.887
 liqu+3.879
bred+3.434
ful+3.430
Neg logit contributions
PDATE-6.536
atism-5.881
 mathemat-5.849
 volunte-5.620
ADRA-5.507
Token c
Token ID269
Feature activation-1.73e-01

Pos logit contributions
 liqu+3.187
lly+3.173
 Castro+3.149
 Honour+2.704
bred+2.702
Neg logit contributions
PDATE-5.499
atism-5.026
 mathemat-4.754
ADRA-4.733
-4.721
Tokenxd
Token ID24954
Feature activation-4.46e-01

Pos logit contributions
lly+1.489
 liqu+1.467
 Castro+1.452
bred+1.320
ful+1.304
Neg logit contributions
PDATE-1.497
atism-1.350
 mathemat-1.255
ADRA-1.251
-1.239
Token4
Token ID19
Feature activation-1.24e+00

Pos logit contributions
lly+2.721
 Castro+2.667
 liqu+2.657
bred+2.275
ful+2.230
Neg logit contributions
PDATE-4.973
atism-4.588
 mathemat-4.385
ADRA-4.306
-4.267
Token {
Token ID1391
Feature activation-3.76e-01

Pos logit contributions
 Castro+5.016
lly+4.858
 liqu+4.787
 Honour+3.752
 Chin+3.684
Neg logit contributions
PDATE-15.637
atism-14.447
 mathemat-13.850
ADRA-13.606
 volunte-13.497
Token(
Token ID7
Feature activation-1.38e-01

Pos logit contributions
lly+2.145
 liqu+2.092
 Castro+2.068
bred+1.830
ful+1.799
Neg logit contributions
PDATE-4.422
atism-4.019
 mathemat-3.953
 volunte-3.831
ADRA-3.824
Token341
Token ID33660
Feature activation-3.85e-01

Pos logit contributions
lly+1.039
 liqu+1.027
 Castro+1.014
bred+0.910
ful+0.896
Neg logit contributions
PDATE-1.328
atism-1.220
 mathemat-1.145
ADRA-1.138
-1.125
Tokens
Token ID82
Feature activation-2.26e-01

Pos logit contributions
lly+2.228
 liqu+2.185
 Castro+2.140
bred+1.880
ful+1.856
Neg logit contributions
PDATE-4.490
atism-4.103
 mathemat-3.974
ADRA-3.938
 volunte-3.825
Token)}
Token ID38165
Feature activation-7.13e-01

Pos logit contributions
lly+1.663
 liqu+1.635
 Castro+1.619
bred+1.448
ful+1.425
Neg logit contributions
PDATE-2.242
atism-2.053
 mathemat-1.953
ADRA-1.920
-1.895
Token of
Token ID286
Feature activation-4.41e-01

Pos logit contributions
PDATE+0.769
atism+0.714
 mathemat+0.668
ADRA+0.665
+0.663
Neg logit contributions
lly-0.559
 liqu-0.552
 Castro-0.545
bred-0.486
 Honour-0.476
Token a
Token ID257
Feature activation-4.46e-01

Pos logit contributions
lly+3.205
 liqu+3.203
 Castro+3.182
bred+2.815
ful+2.760
Neg logit contributions
PDATE-4.357
atism-3.992
ADRA-3.759
 mathemat-3.758
-3.709
Token public
Token ID1171
Feature activation-2.19e-01

Pos logit contributions
 liqu+3.521
lly+3.511
 Castro+3.466
bred+3.097
 Chin+3.047
Neg logit contributions
PDATE-4.166
atism-3.778
ADRA-3.554
 mathemat-3.505
-3.498
Token street
Token ID4675
Feature activation-8.97e-02

Pos logit contributions
lly+1.796
 liqu+1.777
 Castro+1.745
bred+1.576
ful+1.555
Neg logit contributions
PDATE-2.004
atism-1.809
ADRA-1.692
 mathemat-1.681
-1.672
Token shall
Token ID2236
Feature activation-7.41e-01

Pos logit contributions
lly+0.640
 liqu+0.634
 Castro+0.624
bred+0.556
ful+0.546
Neg logit contributions
PDATE-0.898
atism-0.830
 mathemat-0.777
ADRA-0.777
-0.771
Token maintain
Token ID5529
Feature activation-1.19e+00

Pos logit contributions
lly+5.368
 liqu+5.343
 Castro+5.056
ful+4.685
bred+4.487
Neg logit contributions
PDATE-7.730
atism-6.663
ADRA-6.573
-6.389
 mathemat-6.381
Token any
Token ID597
Feature activation-7.77e-01

Pos logit contributions
lly+7.750
 liqu+7.579
 Castro+7.410
ful+6.821
bred+6.653
Neg logit contributions
PDATE-12.252
atism-11.008
ADRA-10.654
 mathemat-10.387
-10.299
Token sidewalk
Token ID24058
Feature activation-3.26e-01

Pos logit contributions
lly+5.755
 liqu+5.685
 Castro+5.557
bred+4.978
ful+4.965
Neg logit contributions
PDATE-7.562
atism-6.846
ADRA-6.580
 mathemat-6.392
-6.338
Token in
Token ID287
Feature activation-3.64e-01

Pos logit contributions
lly+2.379
 liqu+2.362
 Castro+2.304
bred+2.067
ful+2.059
Neg logit contributions
PDATE-3.250
atism-2.970
ADRA-2.812
 mathemat-2.776
-2.747
Token such
Token ID884
Feature activation-8.46e-02

Pos logit contributions
lly+2.797
 liqu+2.768
 Castro+2.732
bred+2.476
ful+2.439
Neg logit contributions
PDATE-3.450
atism-3.149
ADRA-2.941
 mathemat-2.936
-2.903
Token condition
Token ID4006
Feature activation+5.96e-02

Pos logit contributions
lly+0.705
 liqu+0.698
 Castro+0.690
bred+0.625
ful+0.614
Neg logit contributions
PDATE-0.742
atism-0.681
 mathemat-0.630
ADRA-0.630
-0.626
Token://
Token ID1378
Feature activation-2.12e-01

Pos logit contributions
atism+2.408
PDATE+2.360
+2.264
Buff+2.096
ESA+2.057
Neg logit contributions
 liqu-3.499
lly-3.400
 Castro-3.389
bred-3.108
 Honour-3.052
Tokenwww
Token ID2503
Feature activation+2.58e-03

Pos logit contributions
lly+1.726
 liqu+1.696
 Castro+1.673
bred+1.528
ful+1.502
Neg logit contributions
PDATE-1.924
atism-1.746
 mathemat-1.633
ADRA-1.627
-1.605
Token.
Token ID13
Feature activation-3.79e-01

Pos logit contributions
PDATE+0.026
atism+0.025
+0.024
ADRA+0.023
 mathemat+0.023
Neg logit contributions
 liqu-0.017
lly-0.017
 Castro-0.016
bred-0.014
 Honour-0.014
Tokenac
Token ID330
Feature activation+1.05e-01

Pos logit contributions
lly+2.712
 liqu+2.613
 Castro+2.580
bred+2.374
ful+2.323
Neg logit contributions
PDATE-3.872
atism-3.480
 mathemat-3.328
ADRA-3.307
-3.243
Tokenep
Token ID538
Feature activation-1.71e-01

Pos logit contributions
PDATE+1.014
atism+0.968
+0.893
 mathemat+0.892
ADRA+0.892
Neg logit contributions
 liqu-0.747
lly-0.744
 Castro-0.735
bred-0.644
 Honour-0.639
Tokenil
Token ID346
Feature activation-1.18e+00

Pos logit contributions
lly+1.156
 liqu+1.142
 Castro+1.125
bred+0.995
ful+0.973
Neg logit contributions
PDATE-1.762
atism-1.639
 mathemat-1.537
ADRA-1.536
-1.528
Tokenots
Token ID1747
Feature activation-1.59e-01

Pos logit contributions
lly+5.763
 Castro+5.572
 liqu+5.506
ful+4.874
bred+4.710
Neg logit contributions
PDATE-14.425
 mathemat-12.407
ADRA-12.209
 volunte-12.100
-12.060
Token.
Token ID13
Feature activation+3.26e-01

Pos logit contributions
lly+1.001
 liqu+0.990
 Castro+0.975
bred+0.850
ful+0.829
Neg logit contributions
PDATE-1.709
atism-1.600
 mathemat-1.502
ADRA-1.501
-1.496
Tokencom
Token ID785
Feature activation+1.76e-01

Pos logit contributions
PDATE+2.438
atism+2.362
+2.122
 mathemat+2.096
ADRA+2.090
Neg logit contributions
 liqu-2.934
lly-2.912
 Castro-2.907
 Honour-2.616
bred-2.600
Token/
Token ID14
Feature activation-3.29e-02

Pos logit contributions
PDATE+1.716
atism+1.594
+1.487
 mathemat+1.483
ADRA+1.474
Neg logit contributions
 liqu-1.273
lly-1.271
 Castro-1.253
bred-1.118
ful-1.083
Tokenusa
Token ID22064
Feature activation+3.80e-01

Pos logit contributions
lly+0.243
 liqu+0.241
 Castro+0.238
bred+0.212
ful+0.207
Neg logit contributions
PDATE-0.318
atism-0.296
ADRA-0.275
 mathemat-0.275
-0.275
Token\
Token ID59
Feature activation-9.06e-02

Pos logit contributions
lly+1.245
 liqu+1.214
 Castro+1.192
bred+1.001
ful+0.973
Neg logit contributions
PDATE-3.157
atism-2.967
 mathemat-2.823
ADRA-2.816
-2.790
Token[
Token ID58
Feature activation+1.64e-01

Pos logit contributions
lly+0.678
 liqu+0.670
 Castro+0.660
bred+0.592
ful+0.581
Neg logit contributions
PDATE-0.872
atism-0.806
 mathemat-0.753
ADRA-0.752
-0.746
Token+
Token ID10
Feature activation-7.01e-01

Pos logit contributions
PDATE+1.618
atism+1.502
 mathemat+1.402
ADRA+1.402
+1.402
Neg logit contributions
lly-1.171
 liqu-1.160
 Castro-1.142
bred-1.014
 Honour-1.000
Token\
Token ID59
Feature activation-3.84e-01

Pos logit contributions
lly+4.096
 liqu+3.955
 Castro+3.919
bred+3.446
ful+3.426
Neg logit contributions
PDATE-7.702
atism-7.152
 mathemat-6.832
ADRA-6.809
 volunte-6.686
TokenS
Token ID50
Feature activation-2.41e-01

Pos logit contributions
lly+2.539
 liqu+2.467
 Castro+2.447
bred+2.160
ful+2.136
Neg logit contributions
PDATE-4.044
atism-3.735
 mathemat-3.545
ADRA-3.522
-3.474
Token+
Token ID10
Feature activation-1.17e+00

Pos logit contributions
lly+1.838
 liqu+1.812
 Castro+1.796
bred+1.607
ful+1.591
Neg logit contributions
PDATE-2.278
atism-2.104
 mathemat-1.966
ADRA-1.951
-1.935
Token\
Token ID59
Feature activation-6.13e-01

Pos logit contributions
lly+7.097
 Castro+6.753
 liqu+6.753
ful+6.100
bred+5.978
Neg logit contributions
PDATE-11.776
atism-10.882
 mathemat-10.432
ADRA-10.391
 volunte-10.188
Tokens
Token ID82
Feature activation-2.63e-01

Pos logit contributions
lly+3.962
 Castro+3.805
 liqu+3.797
bred+3.356
ful+3.322
Neg logit contributions
PDATE-6.478
atism-5.956
 mathemat-5.703
ADRA-5.631
 volunte-5.575
Token+
Token ID10
Feature activation-1.01e+00

Pos logit contributions
lly+1.990
 liqu+1.960
 Castro+1.947
bred+1.740
ful+1.719
Neg logit contributions
PDATE-2.496
atism-2.306
 mathemat-2.160
ADRA-2.139
-2.121
Token"(
Token ID18109
Feature activation-5.28e-01

Pos logit contributions
lly+6.005
 liqu+5.711
 Castro+5.631
ful+5.054
bred+5.052
Neg logit contributions
PDATE-10.615
atism-9.803
ADRA-9.459
 mathemat-9.448
 volunte-9.191
Token\
Token ID59
Feature activation-1.21e-01

Pos logit contributions
lly+3.156
 liqu+3.080
 Castro+3.038
bred+2.672
ful+2.639
Neg logit contributions
PDATE-5.838
atism-5.399
 mathemat-5.136
ADRA-5.122
-5.032
Token streets
Token ID6483
Feature activation-1.39e-01

Pos logit contributions
lly+3.300
 liqu+3.225
 Castro+3.150
ful+2.736
bred+2.716
Neg logit contributions
PDATE-7.192
atism-6.551
ADRA-6.371
 mathemat-6.341
-6.166
Token of
Token ID286
Feature activation+9.27e-02

Pos logit contributions
lly+0.977
 liqu+0.966
 Castro+0.952
bred+0.846
ful+0.828
Neg logit contributions
PDATE-1.409
atism-1.307
 mathemat-1.225
ADRA-1.224
-1.216
Token the
Token ID262
Feature activation-2.69e-01

Pos logit contributions
PDATE+0.769
atism+0.716
+0.660
ADRA+0.652
 mathemat+0.643
Neg logit contributions
lly-0.795
 liqu-0.790
 Castro-0.776
bred-0.707
 Honour-0.693
Token city
Token ID1748
Feature activation-3.24e-01

Pos logit contributions
lly+1.252
 liqu+1.229
 Castro+1.212
bred+1.006
ful+0.970
Neg logit contributions
PDATE-3.395
atism-3.155
 mathemat-3.028
ADRA-3.027
-2.984
Token,
Token ID11
Feature activation+1.17e-01

Pos logit contributions
lly+1.860
 liqu+1.823
 Castro+1.801
bred+1.557
ful+1.530
Neg logit contributions
PDATE-3.757
atism-3.459
 mathemat-3.327
ADRA-3.294
-3.233
Token or
Token ID393
Feature activation-1.17e+00

Pos logit contributions
PDATE+1.026
atism+0.973
+0.904
ADRA+0.881
 mathemat+0.873
Neg logit contributions
lly-0.920
 liqu-0.909
 Castro-0.898
bred-0.811
 Honour-0.793
Token parked
Token ID19584
Feature activation-1.49e-01

Pos logit contributions
lly+4.393
 liqu+4.282
 Castro+4.141
bred+3.144
ful+3.060
Neg logit contributions
PDATE-15.674
ADRA-14.616
 mathemat-14.568
atism-14.403
-13.936
Token in
Token ID287
Feature activation+1.43e-01

Pos logit contributions
lly+1.107
 liqu+1.096
 Castro+1.081
bred+0.966
ful+0.950
Neg logit contributions
PDATE-1.448
atism-1.335
 mathemat-1.252
ADRA-1.249
-1.236
Token front
Token ID2166
Feature activation-3.69e-01

Pos logit contributions
PDATE+1.071
atism+0.993
+0.903
 mathemat+0.881
ADRA+0.880
Neg logit contributions
lly-1.327
 liqu-1.313
 Castro-1.308
bred-1.190
 Honour-1.173
Token of
Token ID286
Feature activation-1.45e-01

Pos logit contributions
lly+2.455
 liqu+2.415
 Castro+2.387
bred+2.111
ful+2.097
Neg logit contributions
PDATE-3.987
atism-3.605
 mathemat-3.504
ADRA-3.450
 volunte-3.347
Token buildings
Token ID6832
Feature activation+9.46e-02

Pos logit contributions
lly+0.825
 liqu+0.812
 Castro+0.798
bred+0.688
ful+0.669
Neg logit contributions
PDATE-1.653
atism-1.550
 mathemat-1.463
ADRA-1.462
-1.456
Tokenbitcoin
Token ID35395
Feature activation+1.32e-01

Pos logit contributions
PDATE+0.424
atism+0.403
+0.369
 mathemat+0.364
ADRA+0.363
Neg logit contributions
lly-0.425
 liqu-0.423
 Castro-0.420
bred-0.376
 Honour-0.375
Token/
Token ID14
Feature activation+3.93e-02

Pos logit contributions
PDATE+1.299
atism+1.215
 mathemat+1.129
+1.128
ADRA+1.125
Neg logit contributions
lly-0.935
 liqu-0.930
 Castro-0.915
bred-0.818
 Honour-0.796
Tokenh
Token ID71
Feature activation-9.51e-01

Pos logit contributions
PDATE+0.419
atism+0.394
 mathemat+0.369
+0.369
ADRA+0.368
Neg logit contributions
lly-0.249
 liqu-0.246
 Castro-0.243
bred-0.212
 Honour-0.207
Tokenong
Token ID506
Feature activation-2.25e-01

Pos logit contributions
lly+3.735
 liqu+3.185
 Castro+2.996
ful+2.807
bred+2.682
Neg logit contributions
PDATE-13.559
 mathemat-11.939
-11.705
ADRA-11.693
atism-11.563
Token-
Token ID12
Feature activation-3.60e-01

Pos logit contributions
lly+1.254
 liqu+1.230
 Castro+1.209
bred+1.040
ful+1.012
Neg logit contributions
PDATE-2.622
atism-2.438
 mathemat-2.320
ADRA-2.313
-2.287
Tokenk
Token ID74
Feature activation-1.17e+00

Pos logit contributions
lly+2.600
 liqu+2.541
 Castro+2.500
bred+2.261
ful+2.215
Neg logit contributions
PDATE-3.613
atism-3.289
ADRA-3.101
 mathemat-3.092
-3.056
Tokenong
Token ID506
Feature activation-2.71e-01

Pos logit contributions
lly+4.037
 liqu+3.375
 Castro+3.046
ful+3.029
chin+2.828
Neg logit contributions
PDATE-17.513
 mathemat-15.342
-15.271
-15.150
 millenn-15.069
Token/
Token ID14
Feature activation-2.71e-01

Pos logit contributions
lly+1.831
 liqu+1.807
 Castro+1.780
bred+1.563
ful+1.547
Neg logit contributions
PDATE-2.804
atism-2.595
ADRA-2.437
 mathemat-2.434
-2.415
Tokena
Token ID64
Feature activation-1.76e-01

Pos logit contributions
lly+2.052
 liqu+2.031
 Castro+2.005
bred+1.804
ful+1.785
Neg logit contributions
PDATE-2.609
atism-2.374
 mathemat-2.235
ADRA-2.234
-2.192
Token-
Token ID12
Feature activation-2.20e-01

Pos logit contributions
lly+1.150
 liqu+1.137
 Castro+1.117
bred+0.983
ful+0.972
Neg logit contributions
PDATE-1.869
atism-1.736
ADRA-1.631
 mathemat-1.631
-1.618
Tokenflex
Token ID32880
Feature activation+1.33e-01

Pos logit contributions
lly+1.663
 liqu+1.637
 Castro+1.612
bred+1.461
ful+1.441
Neg logit contributions
PDATE-2.122
atism-1.942
ADRA-1.821
 mathemat-1.820
-1.789
Token frame
Token ID5739
Feature activation-2.50e-01

Pos logit contributions
lly+1.503
 liqu+1.503
 Castro+1.466
bred+1.323
ful+1.294
Neg logit contributions
PDATE-2.015
atism-1.846
ADRA-1.741
 mathemat-1.736
-1.708
Token unique
Token ID3748
Feature activation+3.32e-01

Pos logit contributions
lly+1.724
 liqu+1.713
 Castro+1.675
bred+1.495
ful+1.468
Neg logit contributions
PDATE-2.575
atism-2.362
ADRA-2.238
 mathemat-2.228
-2.194
Token to
Token ID284
Feature activation-1.82e-01

Pos logit contributions
PDATE+3.225
atism+3.043
+2.820
 mathemat+2.795
ADRA+2.775
Neg logit contributions
 Castro-2.300
lly-2.290
 liqu-2.281
bred-2.014
 Honour-2.011
Token Ford
Token ID8092
Feature activation-7.49e-02

Pos logit contributions
lly+1.228
 liqu+1.222
 Castro+1.200
bred+1.063
ful+1.040
Neg logit contributions
PDATE-1.896
atism-1.754
 mathemat-1.651
ADRA-1.650
-1.634
Token F
Token ID376
Feature activation-5.58e-01

Pos logit contributions
lly+0.527
 liqu+0.522
 Castro+0.514
bred+0.457
ful+0.449
Neg logit contributions
PDATE-0.758
atism-0.699
ADRA-0.658
 mathemat-0.657
-0.651
Token-
Token ID12
Feature activation-1.17e+00

Pos logit contributions
lly+2.740
 liqu+2.712
 Castro+2.636
bred+2.215
ful+2.203
Neg logit contributions
PDATE-6.841
atism-6.243
 mathemat-6.082
ADRA-5.944
 volunte-5.891
Token150
Token ID8628
Feature activation-9.96e-02

Pos logit contributions
lly+5.240
 liqu+5.213
 Castro+5.102
bred+4.617
ful+4.428
Neg logit contributions
PDATE-13.847
 mathemat-12.662
atism-12.600
ADRA-12.169
 volunte-12.028
Token Rapt
Token ID21301
Feature activation+1.89e-01

Pos logit contributions
lly+0.665
 liqu+0.658
 Castro+0.648
bred+0.573
ful+0.561
Neg logit contributions
PDATE-1.039
atism-0.966
 mathemat-0.908
ADRA-0.906
-0.900
Tokenor
Token ID273
Feature activation-1.81e-01

Pos logit contributions
PDATE+1.627
atism+1.544
+1.415
ADRA+1.405
 mathemat+1.381
Neg logit contributions
lly-1.541
 liqu-1.536
 Castro-1.519
bred-1.350
 Honour-1.334
Token,
Token ID11
Feature activation-1.17e-02

Pos logit contributions
lly+1.260
 liqu+1.248
 Castro+1.228
bred+1.092
ful+1.069
Neg logit contributions
PDATE-1.832
atism-1.696
ADRA-1.593
 mathemat-1.592
-1.579
Token and
Token ID290
Feature activation+5.32e-02

Pos logit contributions
lly+0.087
 liqu+0.086
 Castro+0.085
bred+0.076
ful+0.075
Neg logit contributions
PDATE-0.112
atism-0.104
 mathemat-0.097
ADRA-0.096
-0.096
Token before
Token ID878
Feature activation-4.08e-01

Pos logit contributions
PDATE+0.809
atism+0.754
ADRA+0.705
 mathemat+0.704
+0.702
Neg logit contributions
lly-0.536
 liqu-0.531
 Castro-0.522
bred-0.462
 Honour-0.449
Token {
Token ID1391
Feature activation-6.69e-01

Pos logit contributions
lly+2.741
 liqu+2.714
 Castro+2.648
bred+2.348
ful+2.334
Neg logit contributions
PDATE-4.212
atism-3.953
ADRA-3.695
 mathemat-3.686
-3.664
Tokenvers
Token ID690
Feature activation+1.12e-01

Pos logit contributions
lly+3.854
 liqu+3.704
 Castro+3.599
bred+3.247
ful+3.231
Neg logit contributions
PDATE-7.493
atism-6.936
ADRA-6.680
 mathemat-6.645
-6.519
Token}
Token ID92
Feature activation-3.30e-01

Pos logit contributions
PDATE+1.239
atism+1.162
ADRA+1.094
 mathemat+1.093
+1.091
Neg logit contributions
lly-0.671
 liqu-0.662
 Castro-0.652
bred-0.566
 Honour-0.552
Token vim
Token ID43907
Feature activation+3.79e-02

Pos logit contributions
lly+2.306
 liqu+2.290
 Castro+2.251
ful+1.981
bred+1.980
Neg logit contributions
PDATE-3.351
atism-3.094
ADRA-2.914
 mathemat-2.905
-2.895
Token={
Token ID34758
Feature activation-1.15e+00

Pos logit contributions
PDATE+0.369
atism+0.342
ADRA+0.318
 mathemat+0.318
+0.317
Neg logit contributions
lly-0.278
 liqu-0.275
 Castro-0.272
bred-0.242
 Honour-0.237
Tokenvers
Token ID690
Feature activation-1.58e-02

Pos logit contributions
lly+7.722
 liqu+7.294
 Castro+7.058
bred+6.856
ful+6.830
Neg logit contributions
PDATE-11.103
atism-9.942
ADRA-9.761
 mathemat-9.711
-9.339
Token}:
Token ID38362
Feature activation-1.85e-01

Pos logit contributions
lly+0.103
 liqu+0.102
 Castro+0.100
bred+0.088
 Honour+0.086
Neg logit contributions
PDATE-0.167
atism-0.156
ADRA-0.146
 mathemat-0.146
-0.146
Token version
Token ID2196
Feature activation-6.88e-02

Pos logit contributions
lly+1.355
 liqu+1.342
 Castro+1.320
bred+1.179
ful+1.161
Neg logit contributions
PDATE-1.811
atism-1.678
ADRA-1.569
 mathemat-1.569
-1.560
Token {
Token ID1391
Feature activation-5.44e-01

Pos logit contributions
lly+0.457
 liqu+0.452
 Castro+0.445
bred+0.392
ful+0.384
Neg logit contributions
PDATE-0.719
atism-0.669
 mathemat-0.628
ADRA-0.628
-0.624
Tokenvers
Token ID690
Feature activation+7.36e-02

Pos logit contributions
lly+2.608
 liqu+2.487
 Castro+2.415
bred+2.133
ful+2.083
Neg logit contributions
PDATE-6.679
atism-6.210
ADRA-5.982
 mathemat-5.967
-5.886
Token,
Token ID11
Feature activation-1.40e-01

Pos logit contributions
lly+1.930
 liqu+1.891
 Castro+1.868
bred+1.616
ful+1.586
Neg logit contributions
PDATE-3.910
atism-3.590
 mathemat-3.460
ADRA-3.415
-3.355
Token your
Token ID534
Feature activation-2.29e-01

Pos logit contributions
lly+0.963
 liqu+0.950
 Castro+0.937
bred+0.830
ful+0.815
Neg logit contributions
PDATE-1.453
atism-1.341
 mathemat-1.268
ADRA-1.262
-1.248
Token character
Token ID2095
Feature activation-2.50e-01

Pos logit contributions
lly+1.587
 liqu+1.569
 Castro+1.545
bred+1.368
ful+1.342
Neg logit contributions
PDATE-2.356
atism-2.164
 mathemat-2.044
ADRA-2.042
-2.016
Token now
Token ID783
Feature activation-8.00e-02

Pos logit contributions
lly+1.929
 liqu+1.908
 Castro+1.884
bred+1.693
ful+1.661
Neg logit contributions
PDATE-2.339
atism-2.159
 mathemat-2.009
ADRA-2.008
-1.996
Token receives
Token ID11583
Feature activation-3.35e-01

Pos logit contributions
lly+0.611
 liqu+0.605
 Castro+0.598
bred+0.538
 Honour+0.525
Neg logit contributions
PDATE-0.751
atism-0.696
ADRA-0.646
 mathemat-0.645
-0.643
Token secret
Token ID3200
Feature activation-1.14e+00

Pos logit contributions
lly+2.151
 liqu+2.138
 Castro+2.093
bred+1.819
ful+1.809
Neg logit contributions
PDATE-3.655
atism-3.348
 mathemat-3.203
ADRA-3.191
-3.112
Token voices
Token ID10839
Feature activation+6.48e-02

Pos logit contributions
lly+5.242
 liqu+5.024
 Castro+4.691
ful+4.021
 Chin+3.948
Neg logit contributions
PDATE-14.724
atism-13.262
ADRA-13.120
 mathemat-13.086
 teasp-12.746
Token in
Token ID287
Feature activation+1.67e-01

Pos logit contributions
PDATE+0.567
atism+0.539
+0.504
ADRA+0.488
 mathemat+0.482
Neg logit contributions
lly-0.512
 liqu-0.506
 Castro-0.498
bred-0.452
 Honour-0.444
Token his
Token ID465
Feature activation+1.99e-01

Pos logit contributions
PDATE+1.468
atism+1.404
+1.300
ADRA+1.273
 mathemat+1.260
Neg logit contributions
lly-1.308
 liqu-1.294
 Castro-1.280
bred-1.145
 Honour-1.130
Token or
Token ID393
Feature activation+1.43e-01

Pos logit contributions
PDATE+1.668
atism+1.612
+1.488
ADRA+1.441
 mathemat+1.437
Neg logit contributions
lly-1.604
 liqu-1.583
 Castro-1.570
bred-1.424
ful-1.379
Token her
Token ID607
Feature activation+1.89e-01

Pos logit contributions
PDATE+1.142
atism+1.100
+0.999
ADRA+0.979
 mathemat+0.971
Neg logit contributions
lly-1.213
 liqu-1.200
 Castro-1.191
bred-1.088
 Honour-1.069
Token posted
Token ID4481
Feature activation-4.73e-02

Pos logit contributions
lly+1.421
 liqu+1.408
 Castro+1.382
bred+1.219
ful+1.200
Neg logit contributions
PDATE-2.173
atism-2.011
ADRA-1.897
 mathemat-1.897
-1.876
Token by
Token ID416
Feature activation-1.62e-01

Pos logit contributions
lly+0.394
 liqu+0.391
 Castro+0.385
bred+0.351
 Honour+0.344
Neg logit contributions
PDATE-0.402
atism-0.377
-0.346
ADRA-0.341
 mathemat-0.340
Token James
Token ID3700
Feature activation-2.83e-01

Pos logit contributions
lly+1.224
 liqu+1.213
 Castro+1.199
bred+1.072
ful+1.052
Neg logit contributions
PDATE-1.559
atism-1.436
ADRA-1.342
 mathemat-1.339
-1.330
Token Harrison
Token ID17281
Feature activation-7.82e-01

Pos logit contributions
lly+2.201
 liqu+2.171
 Castro+2.167
bred+1.936
ful+1.900
Neg logit contributions
PDATE-2.669
atism-2.434
 mathemat-2.283
ADRA-2.270
-2.239
Token (@
Token ID4275
Feature activation-6.08e-01

Pos logit contributions
lly+4.326
 Castro+4.317
 liqu+4.219
bred+3.621
ful+3.567
Neg logit contributions
PDATE-8.733
atism-8.260
 mathemat-7.846
ADRA-7.838
 volunte-7.601
Tokenj
Token ID73
Feature activation-1.14e+00

Pos logit contributions
lly+4.549
 liqu+4.377
 Castro+4.239
bred+4.026
ful+3.966
Neg logit contributions
PDATE-5.939
atism-5.250
 mathemat-4.977
ADRA-4.948
 volunte-4.846
Tokenhh
Token ID12337
Feature activation-6.86e-01

Pos logit contributions
lly+7.723
 liqu+7.123
bred+6.718
 Castro+6.564
ful+6.548
Neg logit contributions
PDATE-11.909
atism-10.106
ADRA-10.061
-9.909
 mathemat-9.841
Tokenarrison
Token ID22472
Feature activation-4.72e-01

Pos logit contributions
lly+4.385
 liqu+4.191
 Castro+4.078
bred+3.767
ful+3.690
Neg logit contributions
PDATE-7.579
atism-6.730
 mathemat-6.570
ADRA-6.459
-6.373
Token92
Token ID5892
Feature activation-5.69e-01

Pos logit contributions
lly+3.319
 liqu+3.227
 Castro+3.187
bred+2.930
ful+2.850
Neg logit contributions
PDATE-4.773
atism-4.372
 mathemat-4.159
ADRA-4.078
-4.051
Token)
Token ID8
Feature activation-4.39e-01

Pos logit contributions
lly+3.904
 liqu+3.762
 Castro+3.722
bred+3.381
ful+3.288
Neg logit contributions
PDATE-5.766
atism-5.400
 mathemat-5.095
ADRA-5.057
-5.001
Token on
Token ID319
Feature activation-3.50e-01

Pos logit contributions
lly+2.440
 liqu+2.387
 Castro+2.343
ful+1.995
bred+1.963
Neg logit contributions
PDATE-5.279
atism-4.832
 mathemat-4.703
ADRA-4.684
 volunte-4.548
Token shows
Token ID2523
Feature activation-6.76e-02

Pos logit contributions
lly+0.910
 liqu+0.899
 Castro+0.887
bred+0.791
ful+0.774
Neg logit contributions
PDATE-1.242
atism-1.152
 mathemat-1.077
ADRA-1.076
-1.071
Token in
Token ID287
Feature activation+1.16e-01

Pos logit contributions
lly+0.467
 liqu+0.462
 Castro+0.457
bred+0.403
ful+0.396
Neg logit contributions
PDATE-0.691
atism-0.641
ADRA-0.600
 mathemat-0.599
-0.596
Token order
Token ID1502
Feature activation-2.78e-01

Pos logit contributions
PDATE+1.020
atism+0.939
 mathemat+0.871
ADRA+0.869
+0.867
Neg logit contributions
lly-0.951
 liqu-0.939
 Castro-0.926
bred-0.837
ful-0.825
Token to
Token ID284
Feature activation-2.69e-01

Pos logit contributions
lly+1.700
 liqu+1.678
 Castro+1.647
bred+1.430
ful+1.408
Neg logit contributions
PDATE-3.048
atism-2.848
 mathemat-2.682
ADRA-2.680
-2.661
Token win
Token ID1592
Feature activation+2.68e-01

Pos logit contributions
lly+2.138
 liqu+2.132
 Castro+2.078
bred+1.856
ful+1.852
Neg logit contributions
PDATE-2.537
atism-2.267
ADRA-2.138
-2.115
 mathemat-2.113
Token.[
Token ID3693
Feature activation-1.12e+00

Pos logit contributions
PDATE+2.476
atism+2.287
 mathemat+2.158
ADRA+2.131
+2.120
Neg logit contributions
lly-2.043
 Castro-1.985
 liqu-1.983
bred-1.796
ful-1.729
Token156
Token ID21599
Feature activation-3.12e-01

Pos logit contributions
lly+5.946
 Castro+5.550
 liqu+5.384
bred+4.906
ful+4.806
Neg logit contributions
PDATE-12.701
atism-11.825
 mathemat-11.161
ADRA-11.123
 volunte-11.093
Token]
Token ID60
Feature activation+8.93e-02

Pos logit contributions
lly+1.200
 liqu+1.164
 Castro+1.137
bred+0.904
ful+0.871
Neg logit contributions
PDATE-4.124
atism-3.907
 mathemat-3.722
ADRA-3.712
-3.691
Token
Token ID198
Feature activation-1.36e-01

Pos logit contributions
PDATE+0.871
atism+0.811
+0.761
ADRA+0.759
 mathemat+0.749
Neg logit contributions
lly-0.640
 liqu-0.637
 Castro-0.616
bred-0.559
ful-0.542
Token
Token ID198
Feature activation-2.45e-01

Pos logit contributions
lly+0.960
 liqu+0.950
 Castro+0.936
bred+0.832
ful+0.814
Neg logit contributions
PDATE-1.362
atism-1.266
ADRA-1.183
 mathemat-1.182
-1.179
TokenSound
Token ID21369
Feature activation-1.51e-01

Pos logit contributions
lly+2.073
 liqu+2.029
 Castro+2.009
bred+1.841
ful+1.812
Neg logit contributions
PDATE-2.178
atism-1.955
 mathemat-1.852
ADRA-1.804
 volunte-1.785
Token character
Token ID2095
Feature activation-6.66e-01

Pos logit contributions
lly+1.803
 liqu+1.761
 Castro+1.754
bred+1.546
ful+1.527
Neg logit contributions
PDATE-2.885
atism-2.626
 mathemat-2.527
ADRA-2.507
-2.444
Token Erin
Token ID28894
Feature activation-4.39e-01

Pos logit contributions
lly+4.494
 liqu+4.255
 Castro+4.201
ful+3.813
bred+3.760
Neg logit contributions
PDATE-6.884
atism-6.291
ADRA-6.081
 mathemat-5.983
-5.825
Token is
Token ID318
Feature activation-1.97e-01

Pos logit contributions
lly+2.721
 liqu+2.655
 Castro+2.635
ful+2.287
bred+2.266
Neg logit contributions
PDATE-5.041
atism-4.499
 mathemat-4.391
ADRA-4.383
-4.210
Token a
Token ID257
Feature activation-6.42e-01

Pos logit contributions
lly+1.290
 liqu+1.274
 Castro+1.256
bred+1.108
ful+1.084
Neg logit contributions
PDATE-2.122
atism-1.947
 mathemat-1.854
ADRA-1.848
-1.813
Token professor
Token ID6240
Feature activation-3.60e-02

Pos logit contributions
lly+3.297
 liqu+3.254
 Castro+3.088
ful+2.805
bred+2.765
Neg logit contributions
PDATE-7.766
atism-7.134
ADRA-7.133
 mathemat-6.872
-6.729
Token at
Token ID379
Feature activation-1.11e+00

Pos logit contributions
lly+0.276
 liqu+0.273
 Castro+0.269
bred+0.242
ful+0.238
Neg logit contributions
PDATE-0.341
atism-0.314
 mathemat-0.293
ADRA-0.293
-0.290
Token Columbia
Token ID9309
Feature activation+1.46e-01

Pos logit contributions
lly+5.333
 Castro+5.262
 liqu+4.978
bred+4.671
ful+4.430
Neg logit contributions
PDATE-12.808
ADRA-11.544
atism-11.484
 mathemat-11.143
-10.944
Token University
Token ID2059
Feature activation+1.48e-01

Pos logit contributions
PDATE+1.278
atism+1.227
+1.123
 mathemat+1.097
ADRA+1.093
Neg logit contributions
lly-1.144
 liqu-1.127
 Castro-1.106
bred-1.002
 Honour-0.973
Token,
Token ID11
Feature activation+1.40e-01

Pos logit contributions
PDATE+1.467
atism+1.427
+1.347
ADRA+1.311
Buff+1.291
Neg logit contributions
 liqu-0.966
lly-0.963
 Castro-0.937
bred-0.816
 Honour-0.813
Token where
Token ID810
Feature activation+2.23e-02

Pos logit contributions
PDATE+1.307
atism+1.221
+1.131
 mathemat+1.129
ADRA+1.126
Neg logit contributions
lly-1.063
 liqu-1.054
 Castro-1.038
bred-0.929
 Honour-0.914
Token some
Token ID617
Feature activation-4.53e-01

Pos logit contributions
PDATE+0.162
atism+0.147
 mathemat+0.133
+0.132
ADRA+0.132
Neg logit contributions
lly-0.217
 liqu-0.216
 Castro-0.214
bred-0.197
 Honour-0.194
Token =
Token ID796
Feature activation+4.25e-02

Pos logit contributions
PDATE+1.428
atism+1.365
+1.269
ADRA+1.249
 mathemat+1.210
Neg logit contributions
 liqu-1.249
lly-1.245
 Castro-1.220
bred-1.092
 Honour-1.079
Token [
Token ID685
Feature activation-3.36e-01

Pos logit contributions
PDATE+0.381
atism+0.350
 mathemat+0.326
ADRA+0.323
+0.323
Neg logit contributions
lly-0.347
 liqu-0.342
 Castro-0.338
bred-0.307
ful-0.300
Tokensystem
Token ID10057
Feature activation-2.31e-01

Pos logit contributions
lly+2.094
 liqu+2.038
 Castro+1.991
bred+1.796
ful+1.760
Neg logit contributions
PDATE-3.766
atism-3.378
 mathemat-3.291
ADRA-3.274
 volunte-3.184
Token.
Token ID13
Feature activation-2.61e-01

Pos logit contributions
lly+1.414
 liqu+1.393
 Castro+1.370
bred+1.195
ful+1.167
Neg logit contributions
PDATE-2.552
atism-2.377
 mathemat-2.246
ADRA-2.242
-2.226
Tokenwindows
Token ID28457
Feature activation-2.79e-01

Pos logit contributions
lly+1.967
 liqu+1.942
 Castro+1.911
bred+1.725
ful+1.696
Neg logit contributions
PDATE-2.514
atism-2.305
ADRA-2.163
 mathemat-2.162
-2.137
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
lly+1.675
 liqu+1.644
 Castro+1.618
bred+1.411
ful+1.383
Neg logit contributions
PDATE-3.135
atism-2.894
 mathemat-2.762
ADRA-2.748
-2.709
Tokenforms
Token ID23914
Feature activation-5.11e-01

Pos logit contributions
lly+7.121
 liqu+6.825
 Castro+6.587
ful+6.460
bred+6.170
Neg logit contributions
PDATE-11.651
ADRA-10.028
atism-10.012
 mathemat-9.883
-9.710
Token.
Token ID13
Feature activation-8.86e-01

Pos logit contributions
lly+2.841
 liqu+2.731
 Castro+2.692
bred+2.359
ful+2.351
Neg logit contributions
PDATE-6.227
atism-5.527
 mathemat-5.508
ADRA-5.401
 volunte-5.337
Tokentext
Token ID5239
Feature activation-1.80e-01

Pos logit contributions
lly+5.265
 liqu+5.066
 Castro+4.787
ful+4.570
fully+4.407
Neg logit contributions
PDATE-9.757
atism-8.718
ADRA-8.560
 mathemat-8.502
-8.289
Tokenrend
Token ID10920
Feature activation-1.49e-01

Pos logit contributions
lly+1.614
 liqu+1.597
 Castro+1.577
bred+1.444
ful+1.425
Neg logit contributions
PDATE-1.487
atism-1.340
 mathemat-1.244
ADRA-1.241
-1.227
Tokenerer
Token ID11882
Feature activation-1.10e+00

Pos logit contributions
lly+0.778
 liqu+0.761
 Castro+0.747
bred+0.636
ful+0.620
Neg logit contributions
PDATE-1.795
atism-1.663
 mathemat-1.596
ADRA-1.583
-1.570
Token.
Token ID13
Feature activation-1.11e+00

Pos logit contributions
lly+1.675
 liqu+1.644
 Castro+1.618
bred+1.411
ful+1.383
Neg logit contributions
PDATE-3.135
atism-2.894
 mathemat-2.762
ADRA-2.748
-2.709
Tokenforms
Token ID23914
Feature activation-5.11e-01

Pos logit contributions
lly+7.121
 liqu+6.825
 Castro+6.587
ful+6.460
bred+6.170
Neg logit contributions
PDATE-11.651
ADRA-10.028
atism-10.012
 mathemat-9.883
-9.710
Token.
Token ID13
Feature activation-8.86e-01

Pos logit contributions
lly+2.841
 liqu+2.731
 Castro+2.692
bred+2.359
ful+2.351
Neg logit contributions
PDATE-6.227
atism-5.527
 mathemat-5.508
ADRA-5.401
 volunte-5.337
Tokentext
Token ID5239
Feature activation-1.80e-01

Pos logit contributions
lly+5.265
 liqu+5.066
 Castro+4.787
ful+4.570
fully+4.407
Neg logit contributions
PDATE-9.757
atism-8.718
ADRA-8.560
 mathemat-8.502
-8.289
Tokenrend
Token ID10920
Feature activation-1.49e-01

Pos logit contributions
lly+1.614
 liqu+1.597
 Castro+1.577
bred+1.444
ful+1.425
Neg logit contributions
PDATE-1.487
atism-1.340
 mathemat-1.244
ADRA-1.241
-1.227
Tokenerer
Token ID11882
Feature activation-1.10e+00

Pos logit contributions
lly+0.778
 liqu+0.761
 Castro+0.747
bred+0.636
ful+0.620
Neg logit contributions
PDATE-1.795
atism-1.663
 mathemat-1.596
ADRA-1.583
-1.570
Token]
Token ID60
Feature activation-2.69e-01

Pos logit contributions
lly+2.902
 liqu+2.694
 Castro+2.647
ful+2.118
bred+1.941
Neg logit contributions
PDATE-16.528
 mathemat-15.177
-14.892
-14.830
aditional-14.740
Token::
Token ID3712
Feature activation-7.05e-01

Pos logit contributions
lly+1.385
 liqu+1.338
 Castro+1.324
bred+1.130
ful+1.112
Neg logit contributions
PDATE-3.292
atism-3.030
 mathemat-2.933
ADRA-2.899
-2.843
TokenMeasure
Token ID47384
Feature activation-6.25e-02

Pos logit contributions
lly+5.997
 liqu+5.778
 Castro+5.764
ful+5.315
bred+5.224
Neg logit contributions
PDATE-5.952
atism-5.386
 mathemat-5.095
 volunte-4.985
ADRA-4.959
TokenText
Token ID8206
Feature activation+3.28e-01

Pos logit contributions
lly+0.556
 liqu+0.552
 Castro+0.545
bred+0.498
ful+0.489
Neg logit contributions
PDATE-0.513
atism-0.468
ADRA-0.430
 mathemat-0.430
-0.427
Token($
Token ID16763
Feature activation+4.79e-01

Pos logit contributions
PDATE+2.853
atism+2.643
ADRA+2.438
+2.435
 mathemat+2.411
Neg logit contributions
lly-2.633
 liqu-2.631
 Castro-2.626
bred-2.353
 Honour-2.317
Token is
Token ID318
Feature activation+5.69e-02

Pos logit contributions
lly+0.896
 liqu+0.886
 Castro+0.873
bred+0.776
ful+0.760
Neg logit contributions
PDATE-1.285
atism-1.193
 mathemat-1.117
ADRA-1.116
-1.110
Token history
Token ID2106
Feature activation-1.35e-01

Pos logit contributions
PDATE+0.545
atism+0.504
 mathemat+0.471
ADRA+0.471
+0.467
Neg logit contributions
lly-0.425
 liqu-0.420
 Castro-0.414
bred-0.371
ful-0.363
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
lly+0.838
 liqu+0.826
 Castro+0.813
bred+0.708
ful+0.696
Neg logit contributions
PDATE-1.479
atism-1.384
 mathemat-1.305
ADRA-1.304
-1.296
Token<|endoftext|>
Token ID50256
Feature activation+9.98e-02

Pos logit contributions
lly+2.325
 Castro+2.314
 liqu+2.300
bred+2.027
ful+1.992
Neg logit contributions
PDATE-3.439
atism-3.167
 mathemat-3.030
ADRA-2.984
-2.935
TokenAbout
Token ID8585
Feature activation-4.81e-01

Pos logit contributions
PDATE+0.836
atism+0.778
ADRA+0.718
+0.718
 mathemat+0.706
Neg logit contributions
lly-0.852
 liqu-0.850
 Castro-0.839
bred-0.755
 Honour-0.746
Token This
Token ID770
Feature activation-1.09e+00

Pos logit contributions
lly+3.259
 liqu+3.225
 Castro+3.205
bred+2.799
 Honour+2.784
Neg logit contributions
PDATE-4.956
atism-4.549
ADRA-4.311
 mathemat-4.278
-4.271
Token Game
Token ID3776
Feature activation-8.76e-02

Pos logit contributions
lly+5.043
 liqu+4.705
 Castro+4.585
 Sims+4.005
 Honour+3.950
Neg logit contributions
PDATE-13.171
ADRA-11.847
atism-11.771
 volunte-11.543
 mathemat-11.519
Token
Token ID198
Feature activation-3.18e-01

Pos logit contributions
lly+0.630
 liqu+0.624
 Castro+0.614
bred+0.548
ful+0.536
Neg logit contributions
PDATE-0.866
atism-0.804
 mathemat-0.751
ADRA-0.750
-0.747
Token
Token ID198
Feature activation-2.31e-01

Pos logit contributions
lly+2.135
 liqu+2.093
 Castro+2.078
bred+1.840
ful+1.818
Neg logit contributions
PDATE-3.413
atism-3.089
 mathemat-3.000
ADRA-2.941
 volunte-2.888
TokenChoose
Token ID31851
Feature activation+2.77e-01

Pos logit contributions
lly+1.919
 liqu+1.889
 Castro+1.866
bred+1.705
ful+1.679
Neg logit contributions
PDATE-2.044
atism-1.862
 mathemat-1.740
ADRA-1.718
-1.697
Token from
Token ID422
Feature activation+7.28e-03

Pos logit contributions
PDATE+2.593
atism+2.427
ADRA+2.256
 mathemat+2.251
+2.234
Neg logit contributions
lly-2.079
 liqu-2.052
 Castro-2.019
bred-1.809
 Honour-1.771
Token.
Token ID13
Feature activation-1.72e-01

Pos logit contributions
lly+1.780
 liqu+1.763
 Castro+1.706
bred+1.489
ful+1.459
Neg logit contributions
PDATE-3.521
atism-3.321
 mathemat-3.140
ADRA-3.115
-3.101
Token5
Token ID20
Feature activation+3.43e-01

Pos logit contributions
lly+1.060
 liqu+1.045
 Castro+1.030
bred+0.898
ful+0.880
Neg logit contributions
PDATE-1.886
atism-1.759
 mathemat-1.664
ADRA-1.651
-1.639
Token-
Token ID12
Feature activation+2.31e-01

Pos logit contributions
PDATE+3.695
atism+3.414
+3.232
 mathemat+3.208
ADRA+3.194
Neg logit contributions
lly-2.115
 Castro-2.078
 liqu-2.060
bred-1.785
 Honour-1.752
Tokenliter
Token ID17201
Feature activation-3.93e-01

Pos logit contributions
PDATE+1.979
atism+1.815
+1.677
 mathemat+1.672
ADRA+1.668
Neg logit contributions
lly-1.944
 Castro-1.918
 liqu-1.918
bred-1.714
 Honour-1.694
Token Eco
Token ID38719
Feature activation-3.29e-01

Pos logit contributions
 liqu+4.371
lly+4.361
 Castro+4.314
bred+4.033
ful+3.976
Neg logit contributions
PDATE-2.377
atism-2.038
 mathemat-1.820
ADRA-1.811
-1.782
TokenBoost
Token ID45686
Feature activation-1.06e+00

Pos logit contributions
lly+2.577
 liqu+2.537
 Castro+2.514
bred+2.284
ful+2.254
Neg logit contributions
PDATE-3.082
atism-2.804
 mathemat-2.688
ADRA-2.616
 volunte-2.560
Token®
Token ID7461
Feature activation-6.96e-01

Pos logit contributions
 liqu+7.902
lly+7.810
 Castro+7.576
bred+7.222
ful+7.062
Neg logit contributions
PDATE-9.454
atism-8.715
ADRA-8.173
 mathemat-8.138
 volunte-7.842
Token engine
Token ID3113
Feature activation-3.00e-01

Pos logit contributions
 liqu+4.524
lly+4.457
 Castro+4.338
bred+3.957
ful+3.856
Neg logit contributions
PDATE-7.241
atism-6.695
ADRA-6.337
 mathemat-6.287
-6.186
Token that
Token ID326
Feature activation-4.14e-01

Pos logit contributions
lly+1.961
 liqu+1.944
 Castro+1.907
bred+1.685
ful+1.656
Neg logit contributions
PDATE-3.185
atism-2.953
ADRA-2.792
 mathemat-2.778
-2.754
Token produces
Token ID11073
Feature activation-1.60e-01

Pos logit contributions
 liqu+3.138
lly+3.135
 Castro+3.052
bred+2.696
ful+2.684
Neg logit contributions
PDATE-4.035
atism-3.643
ADRA-3.440
 mathemat-3.424
-3.403
Token more
Token ID517
Feature activation-2.81e-02

Pos logit contributions
lly+1.144
 liqu+1.141
 Castro+1.118
bred+0.994
ful+0.975
Neg logit contributions
PDATE-1.601
atism-1.473
 mathemat-1.386
ADRA-1.384
-1.377
Token}
Token ID92
Feature activation-5.27e-02

Pos logit contributions
PDATE+1.292
atism+1.211
ADRA+1.138
 mathemat+1.135
+1.135
Neg logit contributions
lly-0.739
 liqu-0.734
 Castro-0.721
bred-0.628
 Honour-0.616
Token or
Token ID393
Feature activation-7.38e-01

Pos logit contributions
lly+0.385
 liqu+0.381
 Castro+0.375
bred+0.335
ful+0.329
Neg logit contributions
PDATE-0.517
atism-0.478
 mathemat-0.447
ADRA-0.447
-0.444
Token later
Token ID1568
Feature activation-3.50e-02

Pos logit contributions
lly+4.910
 liqu+4.903
 Castro+4.716
ful+4.182
bred+4.166
Neg logit contributions
PDATE-7.439
atism-6.924
-6.535
ADRA-6.527
 mathemat-6.489
Token vim
Token ID43907
Feature activation-1.76e-01

Pos logit contributions
lly+0.250
 liqu+0.248
 Castro+0.244
bred+0.218
ful+0.213
Neg logit contributions
PDATE-0.347
atism-0.322
 mathemat-0.301
ADRA-0.301
-0.299
Token<
Token ID27
Feature activation-2.07e-01

Pos logit contributions
lly+1.031
 liqu+1.012
 Castro+0.993
bred+0.864
ful+0.850
Neg logit contributions
PDATE-1.982
atism-1.850
 mathemat-1.751
ADRA-1.748
-1.736
Token{
Token ID90
Feature activation-1.05e+00

Pos logit contributions
lly+1.380
 liqu+1.348
 Castro+1.332
bred+1.189
ful+1.163
Neg logit contributions
PDATE-2.178
atism-2.018
 mathemat-1.907
ADRA-1.898
-1.878
Tokenvers
Token ID690
Feature activation-8.08e-02

Pos logit contributions
lly+7.141
 liqu+6.735
 Castro+6.589
ful+6.275
bred+6.260
Neg logit contributions
PDATE-10.229
atism-9.239
 mathemat-9.017
ADRA-8.951
-8.586
Token}:
Token ID38362
Feature activation-3.70e-01

Pos logit contributions
lly+0.510
 liqu+0.503
 Castro+0.495
bred+0.433
ful+0.423
Neg logit contributions
PDATE-0.873
atism-0.814
 mathemat-0.766
ADRA-0.766
-0.762
Token version
Token ID2196
Feature activation+7.89e-02

Pos logit contributions
lly+2.664
 liqu+2.641
 Castro+2.594
bred+2.326
ful+2.305
Neg logit contributions
PDATE-3.628
atism-3.364
ADRA-3.152
 mathemat-3.151
-3.135
Token before
Token ID878
Feature activation-4.08e-01

Pos logit contributions
PDATE+0.809
atism+0.754
ADRA+0.705
 mathemat+0.704
+0.702
Neg logit contributions
lly-0.536
 liqu-0.531
 Castro-0.522
bred-0.462
 Honour-0.449
Token {
Token ID1391
Feature activation-6.69e-01

Pos logit contributions
lly+2.741
 liqu+2.714
 Castro+2.648
bred+2.348
ful+2.334
Neg logit contributions
PDATE-4.212
atism-3.953
ADRA-3.695
 mathemat-3.686
-3.664
Token created
Token ID2727
Feature activation+9.82e-02

Pos logit contributions
lly+0.722
 liqu+0.718
 Castro+0.712
bred+0.637
ful+0.626
Neg logit contributions
PDATE-0.872
atism-0.797
ADRA-0.745
 mathemat-0.742
-0.740
Token such
Token ID884
Feature activation-7.80e-03

Pos logit contributions
PDATE+0.903
atism+0.840
 mathemat+0.780
ADRA+0.774
+0.774
Neg logit contributions
lly-0.767
 liqu-0.757
 Castro-0.746
bred-0.675
ful-0.659
Token music
Token ID2647
Feature activation+2.44e-01

Pos logit contributions
lly+0.060
 liqu+0.059
 Castro+0.058
bred+0.052
ful+0.051
Neg logit contributions
PDATE-0.073
atism-0.068
 mathemat-0.063
ADRA-0.063
-0.063
Token (
Token ID357
Feature activation-2.81e-01

Pos logit contributions
PDATE+2.447
atism+2.327
 mathemat+2.146
+2.142
ADRA+2.122
Neg logit contributions
lly-1.656
 liqu-1.638
 Castro-1.617
bred-1.435
 Honour-1.390
TokenNe
Token ID8199
Feature activation+1.60e-01

Pos logit contributions
lly+2.319
 liqu+2.274
 Castro+2.268
bred+2.068
ful+2.035
Neg logit contributions
PDATE-2.511
atism-2.264
 mathemat-2.122
ADRA-2.107
-2.078
Tokenh
Token ID71
Feature activation-1.05e+00

Pos logit contributions
PDATE+1.737
atism+1.682
+1.573
ADRA+1.552
 mathemat+1.542
Neg logit contributions
 liqu-0.924
lly-0.913
 Castro-0.906
bred-0.775
 Honour-0.751
Tokenring
Token ID1806
Feature activation-3.89e-01

Pos logit contributions
lly+7.113
 liqu+6.599
 Castro+6.469
bred+6.267
ful+6.194
Neg logit contributions
PDATE-11.915
 mathemat-10.178
-10.161
-10.143
 teasp-9.881
Token 99
Token ID7388
Feature activation+1.41e-01

Pos logit contributions
lly+2.838
 liqu+2.803
 Castro+2.767
bred+2.458
ful+2.424
Neg logit contributions
PDATE-3.820
atism-3.500
ADRA-3.294
 mathemat-3.289
-3.263
Token).
Token ID737
Feature activation+4.75e-02

Pos logit contributions
PDATE+1.252
atism+1.163
+1.079
ADRA+1.077
 mathemat+1.065
Neg logit contributions
lly-1.132
 liqu-1.128
 Castro-1.106
bred-1.003
 Honour-0.979
Token
Token ID198
Feature activation-6.22e-03

Pos logit contributions
PDATE+0.470
atism+0.437
ADRA+0.408
 mathemat+0.407
+0.407
Neg logit contributions
lly-0.340
 liqu-0.336
 Castro-0.330
bred-0.295
ful-0.288
Token
Token ID198
Feature activation-3.61e-02

Pos logit contributions
lly+0.045
 liqu+0.044
 Castro+0.044
bred+0.039
ful+0.038
Neg logit contributions
PDATE-0.062
atism-0.057
ADRA-0.053
 mathemat-0.053
-0.053
Token Photo
Token ID5555
Feature activation+5.22e-01

Pos logit contributions
lly+0.166
 liqu+0.164
 Castro+0.160
bred+0.148
ful+0.144
Neg logit contributions
PDATE-0.153
atism-0.142
-0.131
ADRA-0.129
 mathemat-0.128
Token:
Token ID25
Feature activation-3.64e-01

Pos logit contributions
atism+3.813
+3.749
PDATE+3.629
Buff+3.557
ESA+3.435
Neg logit contributions
 liqu-4.329
lly-4.292
 Castro-4.138
bred-3.774
 Honour-3.737
Token Erin
Token ID28894
Feature activation-2.64e-01

Pos logit contributions
lly+2.840
 Castro+2.802
 liqu+2.801
bred+2.480
 Chin+2.460
Neg logit contributions
PDATE-3.388
atism-3.111
 mathemat-2.909
ADRA-2.899
-2.872
Token Bre
Token ID3719
Feature activation-3.23e-02

Pos logit contributions
lly+1.930
 Castro+1.910
 liqu+1.900
bred+1.696
ful+1.662
Neg logit contributions
PDATE-2.637
atism-2.394
 mathemat-2.282
ADRA-2.258
-2.211
Tokenth
Token ID400
Feature activation-2.49e-01

Pos logit contributions
lly+0.281
 liqu+0.279
 Castro+0.276
bred+0.250
ful+0.246
Neg logit contributions
PDATE-0.267
atism-0.247
-0.226
ADRA-0.226
 mathemat-0.225
Tokenauer
Token ID16261
Feature activation-1.05e+00

Pos logit contributions
lly+1.680
 Castro+1.639
 liqu+1.639
bred+1.456
ful+1.421
Neg logit contributions
PDATE-2.650
atism-2.392
 mathemat-2.311
ADRA-2.273
-2.230
Token,
Token ID11
Feature activation-3.31e-01

Pos logit contributions
 Castro+4.645
lly+4.459
 liqu+4.265
 Chin+3.743
ful+3.666
Neg logit contributions
PDATE-13.864
 mathemat-12.630
 volunte-12.301
atism-12.176
 teasp-12.132
Token The
Token ID383
Feature activation+2.68e-02

Pos logit contributions
lly+2.280
 Castro+2.265
 liqu+2.246
bred+1.978
 Honour+1.957
Neg logit contributions
PDATE-3.431
atism-3.132
 mathemat-3.018
ADRA-2.950
 volunte-2.896
Token Chronicle
Token ID14160
Feature activation-4.69e-01

Pos logit contributions
PDATE+0.171
atism+0.155
+0.139
ADRA+0.138
 mathemat+0.134
Neg logit contributions
lly-0.281
 liqu-0.279
 Castro-0.274
bred-0.255
ful-0.250
Token 11
Token ID1367
Feature activation-8.11e-02

Pos logit contributions
lly+3.010
 liqu+2.994
 Castro+2.979
bred+2.576
ful+2.570
Neg logit contributions
PDATE-5.068
atism-4.628
 mathemat-4.453
ADRA-4.411
 volunte-4.292
Token of
Token ID286
Feature activation-9.25e-02

Pos logit contributions
lly+0.576
 liqu+0.571
 Castro+0.562
bred+0.500
 Honour+0.488
Neg logit contributions
PDATE-0.801
atism-0.749
-0.699
ADRA-0.696
 mathemat-0.694
Token the
Token ID262
Feature activation-3.21e-03

Pos logit contributions
lly+1.928
 liqu+1.916
 Castro+1.900
bred+1.687
ful+1.666
Neg logit contributions
PDATE-2.354
atism-2.159
 mathemat-2.017
ADRA-2.011
-1.987
Token exception
Token ID6631
Feature activation+1.11e-01

Pos logit contributions
lly+0.027
 liqu+0.026
 Castro+0.026
bred+0.024
ful+0.023
Neg logit contributions
PDATE-0.028
atism-0.026
 mathemat-0.024
ADRA-0.024
-0.024
Token of
Token ID286
Feature activation+2.99e-01

Pos logit contributions
PDATE+1.084
atism+1.009
ADRA+0.940
 mathemat+0.939
+0.938
Neg logit contributions
lly-0.797
 liqu-0.791
 Castro-0.776
bred-0.696
 Honour-0.677
Token season
Token ID1622
Feature activation+3.13e-01

Pos logit contributions
PDATE+2.697
atism+2.517
+2.326
 mathemat+2.318
ADRA+2.314
Neg logit contributions
lly-2.342
 liqu-2.313
 Castro-2.267
bred-2.070
ful-2.025
Token 14
Token ID1478
Feature activation+1.49e-01

Pos logit contributions
PDATE+2.672
atism+2.525
+2.304
 mathemat+2.297
ADRA+2.295
Neg logit contributions
lly-2.524
 liqu-2.501
 Castro-2.462
bred-2.241
ful-2.178
Token).[
Token ID42669
Feature activation-1.05e+00

Pos logit contributions
PDATE+1.444
atism+1.341
 mathemat+1.250
ADRA+1.250
+1.247
Neg logit contributions
lly-1.087
 liqu-1.078
 Castro-1.061
bred-0.951
ful-0.926
Token157
Token ID18458
Feature activation-4.22e-01

Pos logit contributions
lly+4.902
 liqu+4.740
 Castro+4.724
bred+4.113
ful+3.923
Neg logit contributions
PDATE-12.513
atism-11.622
 mathemat-11.112
ADRA-11.065
 volunte-10.993
Token]
Token ID60
Feature activation-1.72e-02

Pos logit contributions
lly+1.571
 liqu+1.519
 Castro+1.488
bred+1.177
ful+1.134
Neg logit contributions
PDATE-5.627
atism-5.336
 mathemat-5.091
ADRA-5.068
-5.028
Token
Token ID198
Feature activation-6.74e-02

Pos logit contributions
lly+0.124
 liqu+0.123
 Castro+0.120
bred+0.108
ful+0.105
Neg logit contributions
PDATE-0.170
atism-0.158
ADRA-0.148
-0.147
 mathemat-0.147
Token
Token ID198
Feature activation-2.08e-01

Pos logit contributions
lly+0.488
 liqu+0.485
 Castro+0.476
bred+0.423
 Honour+0.413
Neg logit contributions
PDATE-0.647
atism-0.613
-0.572
ADRA-0.565
 mathemat-0.558
TokenTh
Token ID817
Feature activation-4.11e-01

Pos logit contributions
lly+1.778
 liqu+1.743
 Castro+1.727
bred+1.582
ful+1.558
Neg logit contributions
PDATE-1.825
atism-1.639
 mathemat-1.550
ADRA-1.515
 volunte-1.488
Token--+
Token ID44785
Feature activation+7.07e-02

Pos logit contributions
atism+3.172
PDATE+2.971
+2.544
ADRA+2.543
Buff+2.338
Neg logit contributions
 liqu-10.064
lly-9.806
 Castro-9.675
bred-9.091
 Sims-8.983
Token |
Token ID930
Feature activation-2.01e-02

Pos logit contributions
PDATE+0.716
atism+0.663
ADRA+0.621
 mathemat+0.620
+0.616
Neg logit contributions
lly-0.494
 liqu-0.487
 Castro-0.479
bred-0.429
ful-0.417
Token...
Token ID2644
Feature activation+5.52e-02

Pos logit contributions
lly+0.154
 liqu+0.152
 Castro+0.150
bred+0.135
ful+0.132
Neg logit contributions
PDATE-0.190
atism-0.176
 mathemat-0.164
ADRA-0.164
-0.163
Token |
Token ID930
Feature activation+6.94e-01

Pos logit contributions
PDATE+0.554
atism+0.521
+0.488
ADRA+0.485
 mathemat+0.479
Neg logit contributions
lly-0.380
 liqu-0.376
 Castro-0.368
bred-0.327
ful-0.318
Token +
Token ID1343
Feature activation+7.29e-01

Pos logit contributions
PDATE+6.196
atism+5.714
ADRA+5.293
+5.265
 mathemat+5.204
Neg logit contributions
lly-5.323
 liqu-5.205
 Castro-5.065
bred-4.802
ful-4.541
Token--------------------
Token ID19351
Feature activation+8.67e-01

Pos logit contributions
PDATE+3.239
atism+3.133
+2.955
Buff+2.727
ADRA+2.694
Neg logit contributions
 Castro-8.198
 liqu-8.158
lly-8.011
 Chin-7.400
bred-7.391
Token--+
Token ID44785
Feature activation+3.91e-01

Pos logit contributions
atism+3.219
PDATE+3.047
ADRA+2.572
+2.559
Buff+2.356
Neg logit contributions
 liqu-10.727
lly-10.381
 Castro-10.264
bred-9.644
 Sims-9.579
Token
Token ID198
Feature activation+1.85e-01

Pos logit contributions
PDATE+3.713
atism+3.424
ADRA+3.216
+3.172
 mathemat+3.166
Neg logit contributions
lly-2.924
 liqu-2.886
 Castro-2.798
bred-2.581
ful-2.483
Token
Token ID198
Feature activation+1.18e-02

Pos logit contributions
atism+1.538
PDATE+1.534
+1.450
Buff+1.387
ADRA+1.368
Neg logit contributions
 liqu-1.441
lly-1.426
 Castro-1.393
bred-1.253
 Honour-1.225
TokenUh
Token ID34653
Feature activation+3.72e-02

Pos logit contributions
PDATE+0.103
atism+0.094
 mathemat+0.087
ADRA+0.087
+0.087
Neg logit contributions
lly-0.099
 liqu-0.098
 Castro-0.096
bred-0.087
ful-0.086
Token oh
Token ID11752
Feature activation-1.70e-01

Pos logit contributions
PDATE+0.389
atism+0.363
ADRA+0.341
 mathemat+0.340
+0.339
Neg logit contributions
lly-0.242
 liqu-0.240
 Castro-0.237
bred-0.208
 Honour-0.203
Token
Token ID198
Feature activation+3.34e-01

Pos logit contributions
atism+2.420
PDATE+2.310
+2.287
Buff+2.186
ESA+2.132
Neg logit contributions
 liqu-2.460
lly-2.425
 Castro-2.371
bred-2.121
 Honour-2.089
Tokenon
Token ID261
Feature activation+2.33e-01

Pos logit contributions
PDATE+2.717
atism+2.551
+2.360
ADRA+2.356
 mathemat+2.290
Neg logit contributions
 liqu-2.885
lly-2.883
 Castro-2.855
bred-2.558
 Honour-2.530
Token the
Token ID262
Feature activation+4.76e-01

Pos logit contributions
PDATE+1.981
atism+1.808
ADRA+1.679
+1.669
 mathemat+1.664
Neg logit contributions
lly-1.965
 liqu-1.961
 Castro-1.942
bred-1.763
 Honour-1.724
Token address
Token ID2209
Feature activation+2.78e-01

Pos logit contributions
PDATE+3.870
atism+3.530
+3.280
 mathemat+3.278
ADRA+3.257
Neg logit contributions
 liqu-4.052
lly-4.015
 Castro-3.998
bred-3.708
chin-3.562
Token https
Token ID3740
Feature activation-9.23e-02

Pos logit contributions
PDATE+2.638
atism+2.454
ADRA+2.292
 mathemat+2.284
+2.278
Neg logit contributions
lly-2.067
 liqu-2.039
 Castro-2.007
bred-1.823
ful-1.757
Token://
Token ID1378
Feature activation+8.79e-01

Pos logit contributions
lly+0.782
 liqu+0.772
 Castro+0.764
bred+0.694
ful+0.684
Neg logit contributions
PDATE-0.808
atism-0.731
 mathemat-0.685
ADRA-0.680
-0.670
Tokenold
Token ID727
Feature activation+8.40e-02

Pos logit contributions
atism+4.740
+4.285
PDATE+4.210
usha+3.779
Buff+3.743
Neg logit contributions
 liqu-9.115
 Castro-9.049
lly-8.869
 Chin-8.319
 Honour-8.234
Token.
Token ID13
Feature activation+4.87e-01

Pos logit contributions
PDATE+0.839
atism+0.813
+0.766
ADRA+0.744
 mathemat+0.730
Neg logit contributions
 liqu-0.549
lly-0.548
 Castro-0.535
bred-0.470
 Honour-0.458
Tokennice
Token ID44460
Feature activation+4.18e-01

Pos logit contributions
atism+3.047
PDATE+2.824
+2.674
usha+2.526
Buff+2.462
Neg logit contributions
 liqu-4.878
 Castro-4.859
lly-4.724
 Chin-4.410
 Honour-4.398
Tokenhash
Token ID17831
Feature activation-3.93e-01

Pos logit contributions
atism+1.716
PDATE+1.630
+1.447
ADRA+1.283
usha+1.250
Neg logit contributions
 liqu-5.076
 Castro-5.068
lly-4.983
 Honour-4.658
 Chin-4.655
Token.
Token ID13
Feature activation+1.11e-01

Pos logit contributions
lly+2.234
 liqu+2.162
 Castro+2.153
bred+1.874
ful+1.860
Neg logit contributions
PDATE-4.694
atism-4.229
 mathemat-4.170
ADRA-4.078
 volunte-4.030
Token had
Token ID550
Feature activation+1.66e-01

Pos logit contributions
lly+1.255
 liqu+1.243
 Castro+1.228
bred+1.097
ful+1.076
Neg logit contributions
PDATE-1.626
atism-1.494
ADRA-1.402
 mathemat-1.394
-1.385
Token a
Token ID257
Feature activation-1.94e-01

Pos logit contributions
PDATE+1.488
atism+1.411
 mathemat+1.321
+1.297
ADRA+1.282
Neg logit contributions
lly-1.282
 liqu-1.250
 Castro-1.238
bred-1.123
 Honour-1.114
Token collection
Token ID4947
Feature activation-3.12e-01

Pos logit contributions
lly+1.552
 liqu+1.546
 Castro+1.524
bred+1.371
ful+1.347
Neg logit contributions
PDATE-1.767
atism-1.617
ADRA-1.506
 mathemat-1.501
-1.494
Token of
Token ID286
Feature activation-2.76e-01

Pos logit contributions
lly+2.075
 liqu+2.072
 Castro+2.045
bred+1.790
ful+1.790
Neg logit contributions
PDATE-3.252
atism-3.005
ADRA-2.835
 mathemat-2.805
-2.787
Token autom
Token ID3557
Feature activation-4.72e-02

Pos logit contributions
 liqu+2.156
lly+2.141
 Castro+2.127
bred+1.890
ful+1.861
Neg logit contributions
PDATE-2.578
atism-2.353
ADRA-2.204
-2.173
 mathemat-2.167
Tokenat
Token ID265
Feature activation+8.80e-01

Pos logit contributions
lly+0.465
 liqu+0.461
 Castro+0.456
bred+0.420
 Honour+0.413
Neg logit contributions
PDATE-0.338
atism-0.311
 mathemat-0.279
ADRA-0.278
-0.277
Tokenons
Token ID684
Feature activation+2.06e-01

Pos logit contributions
atism+5.276
PDATE+5.230
+4.595
 mathemat+4.542
ufact+4.234
Neg logit contributions
lly-8.565
 Castro-8.561
 liqu-8.519
 Honour-7.809
 Chin-7.744
Token that
Token ID326
Feature activation-1.38e-02

Pos logit contributions
PDATE+1.942
atism+1.821
 mathemat+1.714
+1.700
ADRA+1.669
Neg logit contributions
lly-1.535
 liqu-1.503
 Castro-1.482
bred-1.329
 Honour-1.314
Token was
Token ID373
Feature activation+3.09e-01

Pos logit contributions
lly+0.105
 liqu+0.104
 Castro+0.103
bred+0.092
ful+0.091
Neg logit contributions
PDATE-0.130
atism-0.120
 mathemat-0.112
ADRA-0.111
-0.111
Token eventually
Token ID4191
Feature activation+7.42e-02

Pos logit contributions
PDATE+2.744
atism+2.606
 mathemat+2.450
+2.406
ADRA+2.382
Neg logit contributions
lly-2.403
 liqu-2.324
 Castro-2.317
bred-2.101
 Honour-2.064
Token thrown
Token ID8754
Feature activation+1.02e-01

Pos logit contributions
PDATE+0.684
atism+0.646
 mathemat+0.610
+0.600
ADRA+0.593
Neg logit contributions
lly-0.565
 liqu-0.547
 Castro-0.545
bred-0.493
 Honour-0.483
Token Kenya
Token ID21506
Feature activation+1.27e-01

Pos logit contributions
PDATE+1.568
atism+1.467
 mathemat+1.358
+1.351
ADRA+1.345
Neg logit contributions
lly-1.371
 liqu-1.351
 Castro-1.322
bred-1.210
ful-1.188
Token African
Token ID5510
Feature activation+5.70e-02

Pos logit contributions
PDATE+1.153
atism+1.078
+1.001
 mathemat+0.995
ADRA+0.992
Neg logit contributions
lly-0.988
 liqu-0.980
 Castro-0.959
bred-0.870
ful-0.851
Token Democratic
Token ID4390
Feature activation-7.96e-02

Pos logit contributions
PDATE+0.474
atism+0.439
+0.402
 mathemat+0.402
ADRA+0.400
Neg logit contributions
lly-0.494
 liqu-0.489
 Castro-0.480
bred-0.439
ful-0.432
Token Union
Token ID4479
Feature activation+2.77e-01

Pos logit contributions
lly+0.597
 liqu+0.590
 Castro+0.583
bred+0.522
ful+0.512
Neg logit contributions
PDATE-0.764
atism-0.706
 mathemat-0.659
ADRA-0.658
-0.654
Token (
Token ID357
Feature activation+5.54e-01

Pos logit contributions
PDATE+2.389
atism+2.219
ADRA+2.041
 mathemat+2.040
+2.036
Neg logit contributions
lly-2.288
 liqu-2.276
 Castro-2.218
bred-2.048
ful-1.991
TokenK
Token ID42
Feature activation+8.27e-01

Pos logit contributions
PDATE+3.843
atism+3.453
ADRA+3.354
+3.244
 mathemat+3.164
Neg logit contributions
lly-5.369
 liqu-5.368
 Castro-5.297
bred-4.900
 Honour-4.839
TokenAD
Token ID2885
Feature activation+2.88e-01

Pos logit contributions
PDATE+6.129
ADRA+5.835
atism+5.827
+5.412
ESA+5.371
Neg logit contributions
 liqu-7.202
 Castro-7.053
lly-7.024
bred-6.472
 Honour-6.437
TokenU
Token ID52
Feature activation+3.00e-01

Pos logit contributions
PDATE+1.974
atism+1.762
ADRA+1.604
 mathemat+1.595
+1.584
Neg logit contributions
lly-2.949
 liqu-2.921
 Castro-2.891
bred-2.673
 Honour-2.633
Token),
Token ID828
Feature activation+2.54e-01

Pos logit contributions
PDATE+2.537
atism+2.330
ADRA+2.179
+2.151
 mathemat+2.130
Neg logit contributions
lly-2.527
 liqu-2.515
 Castro-2.468
bred-2.253
 Honour-2.215
Token which
Token ID543
Feature activation-2.95e-01

Pos logit contributions
PDATE+2.029
atism+1.897
 mathemat+1.734
+1.718
ADRA+1.710
Neg logit contributions
lly-2.264
 liqu-2.218
 Castro-2.178
bred-2.006
 Honour-1.973
Token was
Token ID373
Feature activation-2.98e-01

Pos logit contributions
lly+2.121
 liqu+2.097
 Castro+2.083
bred+1.834
ful+1.803
Neg logit contributions
PDATE-2.954
atism-2.692
ADRA-2.541
 mathemat-2.526
-2.523
Token "
Token ID366
Feature activation-2.82e-01

Pos logit contributions
lly+1.274
 liqu+1.261
 Castro+1.237
bred+1.074
ful+1.056
Neg logit contributions
PDATE-2.335
atism-2.151
 mathemat-2.059
ADRA-2.048
-2.010
Tokenvim
Token ID31124
Feature activation+1.43e+00

Pos logit contributions
lly+1.426
 liqu+1.382
 Castro+1.345
bred+1.160
ful+1.130
Neg logit contributions
PDATE-3.462
atism-3.201
 mathemat-3.081
ADRA-3.066
-3.013
Token:"
Token ID11097
Feature activation+6.44e-02

Pos logit contributions
PDATE+7.271
atism+6.583
+6.082
 mathemat+5.721
 volunte+5.369
Neg logit contributions
 liqu-14.106
 Castro-14.044
lly-13.853
 Honour-13.154
chin-13.093
Token or
Token ID393
Feature activation-3.33e-01

Pos logit contributions
PDATE+0.594
atism+0.556
+0.513
ADRA+0.511
 mathemat+0.509
Neg logit contributions
lly-0.497
 liqu-0.490
 Castro-0.486
bred-0.437
 Honour-0.426
Token "
Token ID366
Feature activation+7.45e-02

Pos logit contributions
lly+1.979
 liqu+1.951
 Castro+1.915
bred+1.656
ful+1.638
Neg logit contributions
PDATE-3.812
atism-3.480
 mathemat-3.367
ADRA-3.336
-3.253
Tokenex
Token ID1069
Feature activation+1.01e+00

Pos logit contributions
PDATE+0.644
atism+0.609
+0.561
ADRA+0.551
 mathemat+0.548
Neg logit contributions
lly-0.604
 liqu-0.601
 Castro-0.597
bred-0.532
 Honour-0.531
Token:"
Token ID11097
Feature activation-4.00e-02

Pos logit contributions
atism+5.425
PDATE+5.204
+4.933
usha+4.415
 mathemat+4.253
Neg logit contributions
 liqu-9.937
 Castro-9.920
lly-9.681
 Chin-9.152
 Honour-9.121
Token [
Token ID685
Feature activation+1.92e-01

Pos logit contributions
lly+0.313
 liqu+0.310
 Castro+0.305
bred+0.275
ful+0.270
Neg logit contributions
PDATE-0.371
atism-0.342
 mathemat-0.319
ADRA-0.318
-0.316
Tokenwhite
Token ID11186
Feature activation-5.36e-01

Pos logit contributions
PDATE+1.525
atism+1.409
+1.283
 mathemat+1.276
ADRA+1.269
Neg logit contributions
lly-1.721
 liqu-1.717
 Castro-1.706
bred-1.547
 Honour-1.533
Token]
Token ID60
Feature activation+8.92e-02

Pos logit contributions
lly+1.857
 liqu+1.821
 Castro+1.737
ful+1.338
bred+1.323
Neg logit contributions
PDATE-7.251
atism-6.861
 mathemat-6.569
ADRA-6.567
-6.483
Token optional
Token ID11902
Feature activation-8.91e-02

Pos logit contributions
PDATE+0.850
atism+0.786
 mathemat+0.732
ADRA+0.732
+0.728
Neg logit contributions
lly-0.673
 liqu-0.666
 Castro-0.658
bred-0.590
ful-0.577
Token it
Token ID340
Feature activation-2.05e-02

Pos logit contributions
lly+0.451
 liqu+0.448
 Castro+0.441
bred+0.392
ful+0.387
Neg logit contributions
PDATE-0.587
atism-0.542
 mathemat-0.506
ADRA-0.505
-0.503
Token will
Token ID481
Feature activation-6.51e-02

Pos logit contributions
lly+0.152
 liqu+0.151
 Castro+0.149
bred+0.134
ful+0.131
Neg logit contributions
PDATE-0.197
atism-0.183
 mathemat-0.171
ADRA-0.170
-0.169
Token panic
Token ID13619
Feature activation+3.69e-01

Pos logit contributions
lly+0.514
 liqu+0.508
 Castro+0.502
bred+0.452
ful+0.444
Neg logit contributions
PDATE-0.600
atism-0.553
 mathemat-0.514
ADRA-0.513
-0.510
Token and
Token ID290
Feature activation+8.21e-02

Pos logit contributions
PDATE+3.603
atism+3.274
ADRA+3.080
 mathemat+3.058
+3.044
Neg logit contributions
lly-2.637
 liqu-2.626
 Castro-2.597
bred-2.359
 Honour-2.292
Token reboot
Token ID20149
Feature activation+3.10e-01

Pos logit contributions
PDATE+0.741
atism+0.686
 mathemat+0.637
ADRA+0.634
+0.630
Neg logit contributions
lly-0.655
 liqu-0.647
 Castro-0.640
bred-0.582
ful-0.567
Token endlessly
Token ID33903
Feature activation+3.63e-01

Pos logit contributions
PDATE+3.085
atism+2.859
ADRA+2.672
 mathemat+2.642
+2.640
Neg logit contributions
lly-2.144
 liqu-2.131
 Castro-2.104
bred-1.910
 Honour-1.834
Token.
Token ID13
Feature activation-7.60e-02

Pos logit contributions
PDATE+3.645
atism+3.388
+3.158
ADRA+3.151
 mathemat+3.147
Neg logit contributions
 liqu-2.446
lly-2.445
 Castro-2.400
bred-2.160
 Honour-2.107
Token That
Token ID1320
Feature activation+3.19e-01

Pos logit contributions
lly+0.522
 liqu+0.516
 Castro+0.510
bred+0.447
ful+0.441
Neg logit contributions
PDATE-0.781
atism-0.724
 mathemat-0.680
ADRA-0.678
-0.674
Token means
Token ID1724
Feature activation+2.93e-01

Pos logit contributions
PDATE+2.977
atism+2.799
 mathemat+2.604
+2.571
ADRA+2.566
Neg logit contributions
lly-2.364
 liqu-2.344
 Castro-2.325
bred-2.120
 Honour-2.056
Token that
Token ID326
Feature activation+3.69e-01

Pos logit contributions
PDATE+2.703
atism+2.526
 mathemat+2.333
ADRA+2.325
+2.321
Neg logit contributions
lly-2.231
 liqu-2.194
 Castro-2.163
bred-1.982
 Honour-1.922
Token if
Token ID611
Feature activation+1.64e-01

Pos logit contributions
PDATE+3.375
atism+3.163
 mathemat+2.913
ADRA+2.904
+2.893
Neg logit contributions
lly-2.805
 liqu-2.753
 Castro-2.722
bred-2.507
 Honour-2.429
Token )
Token ID1267
Feature activation-1.57e-01

Pos logit contributions
PDATE+3.136
atism+2.888
ADRA+2.743
+2.648
 mathemat+2.631
Neg logit contributions
lly-2.973
 liqu-2.936
 Castro-2.881
bred-2.638
 Honour-2.577
Token ->
Token ID4613
Feature activation+3.98e-01

Pos logit contributions
lly+1.062
 liqu+1.046
 Castro+1.034
bred+0.899
ful+0.892
Neg logit contributions
PDATE-1.642
atism-1.511
 mathemat-1.430
ADRA-1.417
-1.412
Token Own
Token ID11744
Feature activation+2.67e-01

Pos logit contributions
PDATE+3.175
atism+2.977
+2.730
 mathemat+2.716
ADRA+2.702
Neg logit contributions
lly-3.463
 liqu-3.383
 Castro-3.349
bred-3.106
ful-2.981
Tokened
Token ID276
Feature activation+3.41e-01

Pos logit contributions
PDATE+3.586
atism+3.435
+3.275
ADRA+3.258
 mathemat+3.219
Neg logit contributions
lly-0.894
 liqu-0.878
 Castro-0.863
bred-0.640
 Honour-0.614
Token <
Token ID1279
Feature activation+2.03e-01

Pos logit contributions
PDATE+3.367
atism+3.123
ADRA+2.937
+2.918
 mathemat+2.882
Neg logit contributions
lly-2.418
 liqu-2.385
 Castro-2.348
bred-2.111
 Honour-2.042
Token T
Token ID309
Feature activation+2.43e-01

Pos logit contributions
PDATE+1.683
atism+1.555
ADRA+1.424
+1.421
 mathemat+1.416
Neg logit contributions
lly-1.760
 liqu-1.737
 Castro-1.715
bred-1.577
ful-1.533
Token >
Token ID1875
Feature activation+1.49e-01

Pos logit contributions
PDATE+2.392
atism+2.234
ADRA+2.111
+2.063
 mathemat+2.056
Neg logit contributions
lly-1.712
 liqu-1.692
 Castro-1.661
bred-1.505
 Honour-1.451
Token ;
Token ID2162
Feature activation-7.91e-02

Pos logit contributions
PDATE+1.655
atism+1.557
ADRA+1.476
+1.464
 mathemat+1.458
Neg logit contributions
lly-0.871
 liqu-0.856
 Castro-0.844
bred-0.742
 Honour-0.709
Token }
Token ID1782
Feature activation+1.70e-01

Pos logit contributions
lly+0.465
 liqu+0.460
 Castro+0.452
bred+0.390
ful+0.381
Neg logit contributions
PDATE-0.888
atism-0.831
 mathemat-0.784
ADRA-0.783
-0.779
Token impl
Token ID4114
Feature activation+4.23e-01

Pos logit contributions
PDATE+1.640
atism+1.530
 mathemat+1.428
ADRA+1.426
+1.414
Neg logit contributions
lly-1.244
 liqu-1.217
 Castro-1.205
bred-1.093
ful-1.055
Token <
Token ID1279
Feature activation+6.16e-01

Pos logit contributions
PDATE+3.852
atism+3.646
 mathemat+3.380
ADRA+3.360
+3.359
Neg logit contributions
lly-3.163
 liqu-3.115
 Castro-3.033
bred-2.792
 Honour-2.647
Token daughter
Token ID4957
Feature activation+7.96e-02

Pos logit contributions
PDATE+2.210
atism+2.055
 mathemat+1.909
+1.877
ADRA+1.875
Neg logit contributions
lly-2.214
 liqu-2.179
 Castro-2.141
bred-1.949
ful-1.912
Token.
Token ID13
Feature activation+2.46e-01

Pos logit contributions
PDATE+0.831
atism+0.777
 mathemat+0.731
ADRA+0.727
+0.725
Neg logit contributions
lly-0.518
 liqu-0.513
 Castro-0.499
bred-0.446
 Honour-0.430
Token My
Token ID2011
Feature activation+1.72e-01

Pos logit contributions
PDATE+2.265
atism+2.126
+1.985
ADRA+1.975
 mathemat+1.922
Neg logit contributions
lly-1.882
 liqu-1.871
 Castro-1.782
bred-1.648
ful-1.583
Token husband
Token ID5229
Feature activation+7.50e-02

Pos logit contributions
PDATE+1.443
atism+1.341
 mathemat+1.242
+1.224
ADRA+1.222
Neg logit contributions
lly-1.450
 liqu-1.434
 Castro-1.412
bred-1.296
ful-1.266
Token was
Token ID373
Feature activation-9.81e-02

Pos logit contributions
PDATE+0.701
atism+0.659
 mathemat+0.616
+0.607
ADRA+0.605
Neg logit contributions
lly-0.559
 liqu-0.557
 Castro-0.546
bred-0.494
 Honour-0.482
Token still
Token ID991
Feature activation+1.81e-01

Pos logit contributions
lly+0.744
 liqu+0.735
 Castro+0.725
bred+0.651
ful+0.638
Neg logit contributions
PDATE-0.931
atism-0.862
 mathemat-0.803
ADRA-0.801
-0.797
Token abusive
Token ID19546
Feature activation-6.89e-02

Pos logit contributions
PDATE+1.622
atism+1.524
 mathemat+1.415
+1.399
ADRA+1.382
Neg logit contributions
lly-1.414
 liqu-1.393
 Castro-1.368
bred-1.247
ful-1.202
Token -
Token ID532
Feature activation-1.09e-01

Pos logit contributions
lly+0.450
 liqu+0.444
 Castro+0.437
bred+0.386
 Honour+0.375
Neg logit contributions
PDATE-0.726
atism-0.677
 mathemat-0.635
ADRA-0.634
-0.632
Token I
Token ID314
Feature activation-6.10e-02

Pos logit contributions
lly+0.801
 liqu+0.793
 Castro+0.786
bred+0.696
ful+0.687
Neg logit contributions
PDATE-1.077
atism-0.992
ADRA-0.931
 mathemat-0.926
-0.923
Token almost
Token ID2048
Feature activation-2.93e-01

Pos logit contributions
lly+0.463
 liqu+0.458
 Castro+0.452
bred+0.406
ful+0.398
Neg logit contributions
PDATE-0.576
atism-0.536
 mathemat-0.498
ADRA-0.497
-0.494
Token had
Token ID550
Feature activation+7.81e-03

Pos logit contributions
lly+2.207
 liqu+2.184
 Castro+2.144
bred+1.919
ful+1.882
Neg logit contributions
PDATE-2.870
atism-2.600
ADRA-2.459
 mathemat-2.431
-2.431
Token she
Token ID673
Feature activation-4.28e-01

Pos logit contributions
PDATE+2.217
atism+2.066
 mathemat+1.908
ADRA+1.908
+1.904
Neg logit contributions
lly-1.815
 liqu-1.799
 Castro-1.762
bred-1.604
ful-1.546
Token te
Token ID573
Feature activation-2.16e-01

Pos logit contributions
lly+2.678
 liqu+2.636
 Castro+2.586
bred+2.235
ful+2.223
Neg logit contributions
PDATE-4.774
atism-4.346
ADRA-4.168
 mathemat-4.118
-4.074
Tokenases
Token ID1386
Feature activation-4.27e-01

Pos logit contributions
 liqu+1.544
 Castro+1.532
lly+1.531
bred+1.359
 Honour+1.345
Neg logit contributions
PDATE-2.072
atism-1.978
ADRA-1.821
 mathemat-1.808
-1.804
Token.
Token ID13
Feature activation+1.24e-01

Pos logit contributions
lly+2.641
 liqu+2.565
 Castro+2.542
ful+2.193
bred+2.140
Neg logit contributions
PDATE-4.763
atism-4.391
ADRA-4.177
 mathemat-4.165
-4.128
Token
Token ID198
Feature activation+2.55e-01

Pos logit contributions
PDATE+1.210
atism+1.132
ADRA+1.052
+1.050
 mathemat+1.043
Neg logit contributions
lly-0.897
 liqu-0.889
 Castro-0.868
bred-0.787
ful-0.756
Token
Token ID198
Feature activation+4.14e-01

Pos logit contributions
atism+2.065
PDATE+2.006
+1.947
Buff+1.862
ESA+1.815
Neg logit contributions
 liqu-2.023
lly-2.001
 Castro-1.959
bred-1.757
 Honour-1.727
TokenI
Token ID40
Feature activation+3.75e-01

Pos logit contributions
PDATE+3.209
atism+3.092
+2.852
ADRA+2.846
Buff+2.756
Neg logit contributions
 liqu-3.608
lly-3.566
 Castro-3.558
bred-3.234
 Honour-3.165
Token tried
Token ID3088
Feature activation-3.32e-03

Pos logit contributions
PDATE+3.273
atism+3.247
 mathemat+2.942
+2.914
ADRA+2.890
Neg logit contributions
 Castro-2.749
lly-2.718
 liqu-2.688
bred-2.479
 Honour-2.395
Token to
Token ID284
Feature activation+1.16e-01

Pos logit contributions
lly+0.023
 liqu+0.023
 Castro+0.023
bred+0.020
 Honour+0.020
Neg logit contributions
PDATE-0.033
atism-0.031
ADRA-0.029
 mathemat-0.029
-0.029
Token hide
Token ID7808
Feature activation+7.84e-02

Pos logit contributions
PDATE+1.014
atism+0.970
 mathemat+0.888
+0.887
ADRA+0.881
Neg logit contributions
lly-0.899
 Castro-0.895
 liqu-0.889
bred-0.807
 Honour-0.785
Token my
Token ID616
Feature activation-8.64e-03

Pos logit contributions
PDATE+0.733
atism+0.678
 mathemat+0.630
ADRA+0.630
+0.626
Neg logit contributions
lly-0.604
 liqu-0.598
 Castro-0.590
bred-0.532
ful-0.520
Token:
Token ID25
Feature activation+3.88e-01

Pos logit contributions
PDATE+4.628
atism+4.301
+4.097
ADRA+3.975
 mathemat+3.953
Neg logit contributions
lly-3.951
 liqu-3.918
 Castro-3.864
bred-3.576
 Honour-3.443
Token Half
Token ID13139
Feature activation+2.65e-01

Pos logit contributions
PDATE+3.562
atism+3.308
+3.107
ADRA+3.072
 mathemat+3.050
Neg logit contributions
lly-2.996
 liqu-2.944
 Castro-2.898
bred-2.642
ful-2.533
Token Shell
Token ID17537
Feature activation-1.05e-01

Pos logit contributions
PDATE+2.776
atism+2.586
+2.493
ADRA+2.446
 mathemat+2.406
Neg logit contributions
lly-1.646
 liqu-1.637
 Castro-1.607
bred-1.404
 Honour-1.384
Token still
Token ID991
Feature activation+3.15e-01

Pos logit contributions
lly+0.732
 liqu+0.722
 Castro+0.712
bred+0.637
ful+0.627
Neg logit contributions
PDATE-1.074
atism-0.993
 mathemat-0.935
ADRA-0.931
-0.922
Token in
Token ID287
Feature activation+2.32e-01

Pos logit contributions
PDATE+2.834
atism+2.657
+2.484
ADRA+2.438
 mathemat+2.435
Neg logit contributions
lly-2.406
 liqu-2.373
 Castro-2.338
bred-2.157
 Honour-2.102
Token full
Token ID1336
Feature activation+5.29e-01

Pos logit contributions
PDATE+2.027
atism+1.868
+1.729
ADRA+1.726
 mathemat+1.723
Neg logit contributions
lly-1.901
 liqu-1.876
 Castro-1.849
bred-1.689
 Honour-1.644
Token swing
Token ID9628
Feature activation+1.81e-01

Pos logit contributions
PDATE+4.253
atism+3.962
+3.710
 mathemat+3.663
ADRA+3.587
Neg logit contributions
lly-4.366
 Castro-4.252
 liqu-4.241
bred-3.873
 Honour-3.821
Token,
Token ID11
Feature activation-8.92e-02

Pos logit contributions
PDATE+1.874
atism+1.749
+1.642
ADRA+1.632
 mathemat+1.627
Neg logit contributions
lly-1.187
 liqu-1.175
 Castro-1.156
bred-1.026
 Honour-1.004
Token it
Token ID340
Feature activation+7.15e-02

Pos logit contributions
lly+0.671
 liqu+0.666
 Castro+0.661
bred+0.587
ful+0.581
Neg logit contributions
PDATE-0.858
atism-0.789
 mathemat-0.739
ADRA-0.739
-0.729
Token
Token ID447
Feature activation-3.70e-01

Pos logit contributions
PDATE+0.679
atism+0.641
+0.595
 mathemat+0.592
ADRA+0.587
Neg logit contributions
lly-0.527
 liqu-0.523
 Castro-0.516
bred-0.463
 Honour-0.452
Token
Token ID247
Feature activation-1.30e-02

Pos logit contributions
lly+4.210
 liqu+4.154
 Castro+4.143
bred+3.849
ful+3.812
Neg logit contributions
PDATE-2.173
atism-1.856
 mathemat-1.747
ADRA-1.627
-1.564
Token Studies
Token ID10422
Feature activation-5.52e-02

Pos logit contributions
lly+0.325
 liqu+0.321
 Castro+0.317
bred+0.285
ful+0.281
Neg logit contributions
PDATE-0.392
atism-0.361
 mathemat-0.337
ADRA-0.337
-0.334
Token at
Token ID379
Feature activation+2.72e-01

Pos logit contributions
lly+0.416
 liqu+0.411
 Castro+0.406
bred+0.364
ful+0.358
Neg logit contributions
PDATE-0.529
atism-0.488
ADRA-0.456
 mathemat-0.456
-0.453
Token the
Token ID262
Feature activation+1.44e-01

Pos logit contributions
PDATE+2.438
atism+2.297
 mathemat+2.091
+2.090
ADRA+2.078
Neg logit contributions
lly-2.150
 liqu-2.126
 Castro-2.048
bred-1.865
ful-1.843
Token Cand
Token ID15518
Feature activation-4.00e-01

Pos logit contributions
PDATE+1.350
atism+1.262
 mathemat+1.169
+1.162
ADRA+1.159
Neg logit contributions
lly-1.090
 liqu-1.080
 Castro-1.055
bred-0.952
ful-0.935
Tokenler
Token ID1754
Feature activation+1.80e-01

Pos logit contributions
lly+2.819
 Castro+2.742
 liqu+2.719
bred+2.456
ful+2.429
Neg logit contributions
PDATE-4.114
atism-3.671
 mathemat-3.558
ADRA-3.525
-3.497
Token School
Token ID3961
Feature activation-7.42e-02

Pos logit contributions
PDATE+1.596
atism+1.491
+1.369
 mathemat+1.366
ADRA+1.359
Neg logit contributions
lly-1.446
 liqu-1.442
 Castro-1.406
bred-1.282
 Honour-1.250
Token of
Token ID286
Feature activation-4.89e-01

Pos logit contributions
lly+0.522
 liqu+0.515
 Castro+0.509
bred+0.452
ful+0.445
Neg logit contributions
PDATE-0.748
atism-0.692
ADRA-0.650
 mathemat-0.649
-0.645
Token The
Token ID383
Feature activation-4.46e-01

Pos logit contributions
lly+3.446
 Castro+3.398
 liqu+3.358
bred+3.025
 Chin+2.989
Neg logit contributions
PDATE-4.903
atism-4.452
 mathemat-4.304
ADRA-4.251
-4.231
Tokenology
Token ID1435
Feature activation-3.40e-01

Pos logit contributions
lly+3.799
 Castro+3.712
 liqu+3.655
bred+3.359
ful+3.316
Neg logit contributions
PDATE-3.976
atism-3.454
 mathemat-3.305
ADRA-3.296
-3.261
Token of
Token ID286
Feature activation-2.62e-01

Pos logit contributions
lly+2.346
 liqu+2.300
 Castro+2.300
bred+2.034
ful+2.031
Neg logit contributions
PDATE-3.477
atism-3.163
ADRA-3.044
 mathemat-2.994
-2.970
Token Em
Token ID2295
Feature activation-2.41e-01

Pos logit contributions
lly+1.810
 Castro+1.781
 liqu+1.776
bred+1.573
 Honour+1.554
Neg logit contributions
PDATE-2.689
atism-2.464
 mathemat-2.353
ADRA-2.353
-2.325
Token on
Token ID319
Feature activation-5.55e-02

Pos logit contributions
lly+0.030
 liqu+0.029
 Castro+0.029
bred+0.026
ful+0.025
Neg logit contributions
PDATE-0.040
atism-0.038
 mathemat-0.035
ADRA-0.035
-0.035
Token big
Token ID1263
Feature activation-7.17e-02

Pos logit contributions
lly+0.442
 liqu+0.436
 Castro+0.431
bred+0.390
ful+0.382
Neg logit contributions
PDATE-0.506
atism-0.467
 mathemat-0.434
ADRA-0.433
-0.430
Token projects
Token ID4493
Feature activation-1.26e-01

Pos logit contributions
lly+0.553
 liqu+0.546
 Castro+0.542
bred+0.487
ful+0.477
Neg logit contributions
PDATE-0.669
atism-0.618
 mathemat-0.575
ADRA-0.574
-0.572
Token such
Token ID884
Feature activation-1.35e-02

Pos logit contributions
lly+0.914
 liqu+0.903
 Castro+0.891
bred+0.794
ful+0.779
Neg logit contributions
PDATE-1.239
atism-1.145
ADRA-1.071
 mathemat-1.070
-1.063
Token as
Token ID355
Feature activation-1.15e-01

Pos logit contributions
lly+0.098
 liqu+0.097
 Castro+0.096
bred+0.086
ful+0.084
Neg logit contributions
PDATE-0.130
atism-0.122
 mathemat-0.113
-0.113
ADRA-0.113
Token the
Token ID262
Feature activation-1.16e-01

Pos logit contributions
lly+0.885
 liqu+0.873
 Castro+0.863
bred+0.776
ful+0.761
Neg logit contributions
PDATE-1.088
atism-1.005
 mathemat-0.936
ADRA-0.935
-0.930
Token Kee
Token ID19799
Feature activation-1.23e-01

Pos logit contributions
lly+0.855
 liqu+0.841
 Castro+0.832
bred+0.745
ful+0.731
Neg logit contributions
PDATE-1.131
atism-1.050
 mathemat-0.981
ADRA-0.979
-0.973
Tokeny
Token ID88
Feature activation-5.53e-02

Pos logit contributions
lly+0.837
 liqu+0.828
 Castro+0.816
bred+0.722
ful+0.706
Neg logit contributions
PDATE-1.262
atism-1.174
 mathemat-1.100
ADRA-1.100
-1.094
Tokenask
Token ID2093
Feature activation+4.24e-02

Pos logit contributions
lly+0.396
 liqu+0.392
 Castro+0.387
bred+0.345
ful+0.337
Neg logit contributions
PDATE-0.547
atism-0.508
ADRA-0.474
 mathemat-0.474
-0.472
Token dam
Token ID1801
Feature activation+2.03e-01

Pos logit contributions
PDATE+0.414
atism+0.387
ADRA+0.360
 mathemat+0.359
+0.358
Neg logit contributions
lly-0.307
 liqu-0.304
 Castro-0.301
bred-0.269
ful-0.261
Token or
Token ID393
Feature activation-2.29e-01

Pos logit contributions
PDATE+1.941
atism+1.810
ADRA+1.684
+1.683
 mathemat+1.675
Neg logit contributions
lly-1.468
 Castro-1.449
 liqu-1.438
bred-1.297
 Honour-1.259
Token Lee
Token ID5741
Feature activation+1.48e-01

Pos logit contributions
lly+2.373
 Castro+2.351
 liqu+2.324
bred+2.054
ful+2.045
Neg logit contributions
PDATE-3.191
atism-2.927
 mathemat-2.767
ADRA-2.735
-2.695
Token and
Token ID290
Feature activation+1.78e-01

Pos logit contributions
PDATE+1.448
atism+1.354
+1.261
ADRA+1.257
 mathemat+1.256
Neg logit contributions
 liqu-1.051
lly-1.051
 Castro-1.019
bred-0.916
 Honour-0.896
Token Laura
Token ID16753
Feature activation-2.14e-01

Pos logit contributions
PDATE+1.671
atism+1.565
+1.454
 mathemat+1.454
ADRA+1.441
Neg logit contributions
lly-1.331
 liqu-1.321
 Castro-1.283
bred-1.170
ful-1.141
Token Ling
Token ID25116
Feature activation+1.78e-01

Pos logit contributions
lly+1.611
 liqu+1.588
 Castro+1.582
bred+1.409
ful+1.381
Neg logit contributions
PDATE-2.069
atism-1.899
 mathemat-1.786
ADRA-1.777
-1.759
Token who
Token ID508
Feature activation+6.48e-02

Pos logit contributions
PDATE+1.712
atism+1.601
+1.485
ADRA+1.482
 mathemat+1.480
Neg logit contributions
 liqu-1.283
lly-1.282
 Castro-1.246
bred-1.120
 Honour-1.100
Token sn
Token ID3013
Feature activation-1.86e-02

Pos logit contributions
PDATE+0.592
atism+0.549
 mathemat+0.509
ADRA+0.507
+0.506
Neg logit contributions
lly-0.512
 liqu-0.507
 Castro-0.500
bred-0.451
ful-0.442
Tokenuck
Token ID1347
Feature activation-3.88e-03

Pos logit contributions
lly+0.164
 liqu+0.162
 Castro+0.160
bred+0.146
ful+0.144
Neg logit contributions
PDATE-0.155
atism-0.141
 mathemat-0.130
ADRA-0.130
-0.129
Token into
Token ID656
Feature activation-9.50e-02

Pos logit contributions
lly+0.028
 liqu+0.028
 Castro+0.027
bred+0.025
 Honour+0.024
Neg logit contributions
PDATE-0.038
atism-0.035
 mathemat-0.033
ADRA-0.033
-0.033
Token North
Token ID2258
Feature activation+7.93e-02

Pos logit contributions
lly+0.682
 liqu+0.675
 Castro+0.666
bred+0.593
ful+0.580
Neg logit contributions
PDATE-0.944
atism-0.874
 mathemat-0.818
ADRA-0.817
-0.813
Token Korea
Token ID4969
Feature activation+2.36e-01

Pos logit contributions
PDATE+0.757
atism+0.705
+0.653
 mathemat+0.653
ADRA+0.652
Neg logit contributions
lly-0.587
 liqu-0.586
 Castro-0.571
bred-0.513
ful-0.502
Token via
Token ID2884
Feature activation+2.71e-01

Pos logit contributions
PDATE+2.217
atism+2.085
+1.939
 mathemat+1.937
ADRA+1.905
Neg logit contributions
lly-1.720
 liqu-1.704
 Castro-1.665
bred-1.520
 Honour-1.471
Token
Token ID247
Feature activation+1.15e-01

Pos logit contributions
lly+0.622
 liqu+0.616
 Castro+0.611
bred+0.567
ful+0.562
Neg logit contributions
PDATE-0.345
atism-0.301
 mathemat-0.274
ADRA-0.265
-0.263
Tokent
Token ID83
Feature activation-1.47e-01

Pos logit contributions
PDATE+1.026
atism+0.971
+0.896
ADRA+0.892
 mathemat+0.876
Neg logit contributions
 liqu-0.906
lly-0.903
 Castro-0.894
bred-0.802
 Honour-0.790
Token figure
Token ID3785
Feature activation-2.87e-01

Pos logit contributions
lly+1.155
 liqu+1.143
 Castro+1.125
bred+1.008
ful+0.997
Neg logit contributions
PDATE-1.377
atism-1.253
ADRA-1.173
 mathemat-1.169
-1.163
Token any
Token ID597
Feature activation+1.39e-01

Pos logit contributions
lly+2.319
 liqu+2.291
 Castro+2.279
bred+2.032
ful+2.003
Neg logit contributions
PDATE-2.624
atism-2.392
ADRA-2.241
 mathemat-2.227
-2.200
Token of
Token ID286
Feature activation+3.55e-01

Pos logit contributions
PDATE+1.317
atism+1.229
 mathemat+1.139
ADRA+1.138
+1.138
Neg logit contributions
lly-1.037
 liqu-1.028
 Castro-1.011
bred-0.909
ful-0.888
Token this
Token ID428
Feature activation-1.56e-02

Pos logit contributions
PDATE+3.319
atism+3.153
+2.914
 mathemat+2.870
ADRA+2.858
Neg logit contributions
lly-2.606
 liqu-2.588
 Castro-2.503
bred-2.298
 Honour-2.229
Token out
Token ID503
Feature activation-2.20e-01

Pos logit contributions
lly+0.131
 liqu+0.130
 Castro+0.128
bred+0.116
ful+0.114
Neg logit contributions
PDATE-0.135
atism-0.124
 mathemat-0.115
ADRA-0.114
-0.114
Token.
Token ID13
Feature activation+8.31e-02

Pos logit contributions
lly+1.375
 liqu+1.350
 Castro+1.333
bred+1.163
ful+1.139
Neg logit contributions
PDATE-2.424
atism-2.238
ADRA-2.135
 mathemat-2.120
-2.100
Token
Token ID564
Feature activation-6.70e-02

Pos logit contributions
PDATE+0.812
atism+0.760
+0.710
ADRA+0.708
 mathemat+0.698
Neg logit contributions
lly-0.598
 liqu-0.594
 Castro-0.573
bred-0.518
ful-0.505
Token
Token ID250
Feature activation-6.19e-03

Pos logit contributions
lly+0.783
 liqu+0.777
 Castro+0.771
bred+0.720
ful+0.712
Neg logit contributions
PDATE-0.362
atism-0.313
 mathemat-0.274
ADRA-0.272
-0.269
Token(
Token ID7
Feature activation-2.14e-01

Pos logit contributions
lly+0.050
 liqu+0.049
 Castro+0.049
bred+0.044
ful+0.043
Neg logit contributions
PDATE-0.056
atism-0.052
 mathemat-0.048
ADRA-0.048
-0.048
Token open
Token ID1280
Feature activation+1.54e-01

Pos logit contributions
PDATE+1.779
atism+1.637
 mathemat+1.534
ADRA+1.524
+1.522
Neg logit contributions
lly-1.597
 liqu-1.564
 Castro-1.555
bred-1.413
 Honour-1.372
Token enrollment
Token ID20753
Feature activation+2.95e-01

Pos logit contributions
PDATE+1.052
atism+0.940
ADRA+0.853
+0.846
 mathemat+0.838
Neg logit contributions
lly-1.563
 liqu-1.543
 Castro-1.537
bred-1.418
 Honour-1.402
Token period
Token ID2278
Feature activation+1.95e-01

Pos logit contributions
PDATE+2.374
atism+2.179
 mathemat+2.024
ADRA+2.024
+2.019
Neg logit contributions
lly-2.554
 liqu-2.493
 Castro-2.478
bred-2.306
 Honour-2.247
Token and
Token ID290
Feature activation-2.43e-02

Pos logit contributions
PDATE+1.900
atism+1.766
 mathemat+1.649
+1.639
ADRA+1.638
Neg logit contributions
lly-1.410
 liqu-1.389
 Castro-1.366
bred-1.228
 Honour-1.199
Token tighten
Token ID31833
Feature activation-2.38e-01

Pos logit contributions
lly+0.189
 liqu+0.187
 Castro+0.185
bred+0.166
ful+0.163
Neg logit contributions
PDATE-0.226
atism-0.208
 mathemat-0.194
ADRA-0.194
-0.193
Token enrollment
Token ID20753
Feature activation+3.03e-02

Pos logit contributions
lly+1.779
 liqu+1.767
 Castro+1.725
bred+1.540
ful+1.531
Neg logit contributions
PDATE-2.301
atism-2.128
 mathemat-1.994
ADRA-1.966
-1.966
Token eligibility
Token ID20069
Feature activation-1.38e-01

Pos logit contributions
PDATE+0.294
atism+0.273
 mathemat+0.256
ADRA+0.255
+0.255
Neg logit contributions
lly-0.219
 liqu-0.215
 Castro-0.213
bred-0.192
 Honour-0.187
Token "
Token ID366
Feature activation-7.96e-02

Pos logit contributions
lly+0.900
 liqu+0.891
 Castro+0.875
bred+0.765
ful+0.755
Neg logit contributions
PDATE-1.468
atism-1.365
 mathemat-1.284
ADRA-1.284
-1.273
Tokento
Token ID1462
Feature activation+8.88e-02

Pos logit contributions
lly+0.585
 liqu+0.576
 Castro+0.568
bred+0.511
ful+0.503
Neg logit contributions
PDATE-0.783
atism-0.718
 mathemat-0.676
ADRA-0.676
-0.667
Token deter
Token ID2206
Feature activation+3.31e-01

Pos logit contributions
PDATE+0.815
atism+0.750
 mathemat+0.697
ADRA+0.696
+0.692
Neg logit contributions
lly-0.704
 liqu-0.697
 Castro-0.688
bred-0.620
ful-0.609
Token people
Token ID661
Feature activation+4.64e-01

Pos logit contributions
PDATE+2.833
atism+2.610
ADRA+2.441
 mathemat+2.417
+2.400
Neg logit contributions
lly-2.730
 liqu-2.695
 Castro-2.685
bred-2.433
 Honour-2.397
Tokens
Token ID82
Feature activation-2.04e-01

Pos logit contributions
lly+0.686
 liqu+0.676
 Castro+0.666
bred+0.583
ful+0.569
Neg logit contributions
PDATE-1.182
atism-1.101
 mathemat-1.037
ADRA-1.037
-1.030
Token eventual
Token ID19657
Feature activation-3.61e-01

Pos logit contributions
lly+1.587
 liqu+1.575
 Castro+1.553
bred+1.393
ful+1.371
Neg logit contributions
PDATE-1.919
atism-1.763
ADRA-1.645
 mathemat-1.642
-1.635
Token blood
Token ID2910
Feature activation-2.79e-01

Pos logit contributions
lly+2.636
 liqu+2.604
 Castro+2.568
bred+2.283
ful+2.271
Neg logit contributions
PDATE-3.567
atism-3.277
 mathemat-3.071
-3.058
ADRA-3.056
Token alcohol
Token ID5548
Feature activation-7.04e-02

Pos logit contributions
lly+2.550
 liqu+2.538
 Castro+2.490
bred+2.284
ful+2.261
Neg logit contributions
PDATE-2.236
atism-2.036
 mathemat-1.876
ADRA-1.863
-1.843
Token level
Token ID1241
Feature activation+2.02e-01

Pos logit contributions
lly+0.531
 liqu+0.526
 Castro+0.519
bred+0.463
ful+0.457
Neg logit contributions
PDATE-0.675
atism-0.622
 mathemat-0.583
ADRA-0.581
-0.578
Token was
Token ID373
Feature activation-6.08e-01

Pos logit contributions
PDATE+1.918
atism+1.791
 mathemat+1.671
+1.654
ADRA+1.649
Neg logit contributions
lly-1.500
 liqu-1.474
 Castro-1.462
bred-1.316
 Honour-1.288
Token below
Token ID2174
Feature activation-6.06e-01

Pos logit contributions
 liqu+4.231
lly+4.026
 Castro+4.018
 Honour+3.471
 Chin+3.464
Neg logit contributions
PDATE-6.312
atism-5.639
 mathemat-5.454
-5.441
ADRA-5.439
Token the
Token ID262
Feature activation-8.77e-01

Pos logit contributions
 liqu+4.184
lly+4.097
 Castro+4.024
 Honour+3.521
ful+3.507
Neg logit contributions
PDATE-6.249
atism-5.504
 mathemat-5.417
ADRA-5.322
-5.307
Token.
Token ID764
Feature activation-8.82e-01

Pos logit contributions
 liqu+5.374
lly+5.206
 Castro+5.112
 Honour+4.506
 GM+4.460
Neg logit contributions
PDATE-9.579
atism-8.504
ADRA-8.496
 mathemat-8.342
 volunte-8.216
Token08
Token ID2919
Feature activation-3.58e-01

Pos logit contributions
 liqu+5.543
lly+5.529
 Castro+5.509
ful+4.940
 Honour+4.706
Neg logit contributions
PDATE-9.447
 mathemat-8.681
 volunte-8.537
atism-8.326
-7.977
Token threshold
Token ID11387
Feature activation-4.41e-01

Pos logit contributions
lly+2.521
 liqu+2.505
 Castro+2.410
ful+2.166
bred+2.112
Neg logit contributions
PDATE-3.606
atism-3.336
 mathemat-3.190
ADRA-3.165
-3.115
Token generous
Token ID14431
Feature activation-2.12e-01

Pos logit contributions
lly+0.482
 liqu+0.476
 Castro+0.470
bred+0.425
ful+0.417
Neg logit contributions
PDATE-0.549
atism-0.507
 mathemat-0.472
ADRA-0.469
-0.467
Token 10
Token ID838
Feature activation-1.76e-01

Pos logit contributions
lly+1.680
 liqu+1.660
 Castro+1.643
bred+1.476
ful+1.454
Neg logit contributions
PDATE-1.971
atism-1.802
ADRA-1.685
 mathemat-1.676
-1.665
Token metre
Token ID42760
Feature activation-2.94e-01

Pos logit contributions
lly+1.153
 liqu+1.138
 Castro+1.120
bred+0.988
ful+0.962
Neg logit contributions
PDATE-1.857
atism-1.730
 mathemat-1.624
ADRA-1.623
-1.618
Token setback
Token ID34116
Feature activation-1.80e-01

Pos logit contributions
lly+2.260
 liqu+2.259
 Castro+2.227
bred+1.979
ful+1.971
Neg logit contributions
PDATE-2.795
atism-2.554
ADRA-2.402
 mathemat-2.386
-2.357
Token along
Token ID1863
Feature activation-3.82e-01

Pos logit contributions
lly+1.216
 liqu+1.199
 Castro+1.187
ful+1.041
bred+1.040
Neg logit contributions
PDATE-1.883
atism-1.737
ADRA-1.649
 mathemat-1.636
-1.615
Token Y
Token ID575
Feature activation-7.14e-01

Pos logit contributions
lly+2.997
 liqu+2.957
 Castro+2.937
bred+2.628
ful+2.593
Neg logit contributions
PDATE-3.549
atism-3.257
ADRA-3.041
 mathemat-3.034
-3.015
Tokenonge
Token ID14220
Feature activation-1.00e-01

Pos logit contributions
lly+3.276
 Castro+3.207
 liqu+3.164
ful+2.567
bred+2.534
Neg logit contributions
PDATE-9.059
atism-8.310
 mathemat-8.193
ADRA-8.064
-7.880
Token Street
Token ID3530
Feature activation-2.60e-01

Pos logit contributions
lly+0.651
 liqu+0.644
 Castro+0.632
bred+0.557
ful+0.544
Neg logit contributions
PDATE-1.058
atism-0.987
 mathemat-0.926
ADRA-0.925
-0.921
Token and
Token ID290
Feature activation-5.69e-01

Pos logit contributions
lly+1.687
 liqu+1.657
 Castro+1.646
bred+1.437
ful+1.420
Neg logit contributions
PDATE-2.767
atism-2.564
ADRA-2.430
 mathemat-2.417
-2.394
Token a
Token ID257
Feature activation-2.60e-01

Pos logit contributions
 liqu+4.241
lly+4.229
 Castro+4.197
ful+3.684
bred+3.660
Neg logit contributions
PDATE-5.554
atism-5.020
ADRA-4.748
-4.685
 mathemat-4.649
Token 5
Token ID642
Feature activation-4.13e-01

Pos logit contributions
lly+2.061
 liqu+2.055
 Castro+2.027
bred+1.819
ful+1.790
Neg logit contributions
PDATE-2.406
atism-2.203
ADRA-2.056
 mathemat-2.044
-2.035
Token]
Token ID60
Feature activation+6.06e-03

Pos logit contributions
lly+0.899
 liqu+0.879
 Castro+0.867
bred+0.742
ful+0.724
Neg logit contributions
PDATE-1.885
atism-1.765
 mathemat-1.671
ADRA-1.668
-1.658
TokenB
Token ID33
Feature activation+3.10e-01

Pos logit contributions
PDATE+0.053
atism+0.049
 mathemat+0.045
ADRA+0.045
+0.045
Neg logit contributions
lly-0.050
 liqu-0.050
 Castro-0.049
bred-0.045
ful-0.044
Tokenionic
Token ID26523
Feature activation+1.64e-01

Pos logit contributions
PDATE+2.879
atism+2.688
ADRA+2.518
 mathemat+2.480
+2.479
Neg logit contributions
 liqu-2.345
lly-2.322
 Castro-2.305
bred-2.083
 Honour-2.038
Token Commando
Token ID47150
Feature activation-2.58e-01

Pos logit contributions
PDATE+1.555
atism+1.437
ADRA+1.343
 mathemat+1.343
+1.340
Neg logit contributions
lly-1.230
 liqu-1.212
 Castro-1.191
bred-1.080
ful-1.055
Token Rear
Token ID30144
Feature activation-3.77e-01

Pos logit contributions
lly+1.496
 liqu+1.464
 Castro+1.449
bred+1.247
ful+1.227
Neg logit contributions
PDATE-2.924
atism-2.733
ADRA-2.581
 mathemat-2.581
-2.553
Tokenmed
Token ID1150
Feature activation-5.55e-01

Pos logit contributions
lly+2.354
 liqu+2.293
 Castro+2.271
bred+2.037
 Honour+1.968
Neg logit contributions
PDATE-4.095
atism-3.816
ADRA-3.614
 mathemat-3.606
-3.569
Token[/
Token ID13412
Feature activation-2.00e-01

Pos logit contributions
lly+3.347
 liqu+3.222
 Castro+3.203
bred+2.794
ful+2.787
Neg logit contributions
PDATE-6.115
atism-5.677
ADRA-5.386
 mathemat-5.385
-5.292
Tokenb
Token ID65
Feature activation+6.04e-02

Pos logit contributions
lly+1.382
 liqu+1.363
 Castro+1.341
bred+1.195
ful+1.168
Neg logit contributions
PDATE-2.054
atism-1.901
 mathemat-1.786
ADRA-1.784
-1.768
Token][/
Token ID44926
Feature activation-7.30e-03

Pos logit contributions
PDATE+0.712
atism+0.671
ADRA+0.633
 mathemat+0.632
+0.632
Neg logit contributions
lly-0.316
 liqu-0.313
 Castro-0.307
bred-0.260
 Honour-0.251
Tokencolor
Token ID8043
Feature activation+4.18e-01

Pos logit contributions
lly+0.065
 liqu+0.064
 Castro+0.064
bred+0.058
 Honour+0.057
Neg logit contributions
PDATE-0.059
atism-0.055
-0.050
 mathemat-0.050
ADRA-0.050
Token]
Token ID60
Feature activation+3.91e-01

Pos logit contributions
PDATE+4.934
atism+4.710
+4.431
ADRA+4.395
 mathemat+4.383
Neg logit contributions
lly-2.062
 liqu-2.029
 Castro-1.982
bred-1.630
 Honour-1.604
Token between
Token ID1022
Feature activation-3.72e-01

Pos logit contributions
lly+1.936
 liqu+1.906
 Castro+1.863
bred+1.604
ful+1.579
Neg logit contributions
PDATE-3.730
atism-3.445
ADRA-3.290
 mathemat-3.269
-3.252
Token £
Token ID4248
Feature activation-3.28e-01

Pos logit contributions
lly+1.984
 liqu+1.936
 Castro+1.892
bred+1.608
 Honour+1.593
Neg logit contributions
PDATE-4.447
atism-4.113
ADRA-3.940
 mathemat-3.924
-3.871
Token200
Token ID2167
Feature activation-2.63e-01

Pos logit contributions
lly+2.312
 liqu+2.263
 Castro+2.256
bred+2.023
ful+1.994
Neg logit contributions
PDATE-3.373
atism-3.048
 mathemat-2.949
ADRA-2.862
 volunte-2.855
Token,
Token ID11
Feature activation-4.04e-01

Pos logit contributions
lly+1.641
 liqu+1.617
 Castro+1.591
bred+1.391
ful+1.359
Neg logit contributions
PDATE-2.862
atism-2.672
 mathemat-2.515
ADRA-2.513
-2.500
Token000
Token ID830
Feature activation-3.46e-01

Pos logit contributions
lly+3.272
 liqu+3.231
 Castro+3.202
ful+2.885
bred+2.882
Neg logit contributions
PDATE-3.752
atism-3.303
 mathemat-3.201
ADRA-3.153
 volunte-3.053
Token and
Token ID290
Feature activation-6.46e-01

Pos logit contributions
lly+2.351
 liqu+2.312
 Castro+2.275
bred+2.018
ful+1.983
Neg logit contributions
PDATE-3.563
atism-3.315
ADRA-3.113
 mathemat-3.109
-3.085
Token £
Token ID4248
Feature activation-1.12e-01

Pos logit contributions
lly+3.237
 liqu+3.189
 Castro+3.068
bred+2.609
ful+2.585
Neg logit contributions
PDATE-7.785
atism-7.214
 mathemat-6.892
-6.863
ADRA-6.831
Token700
Token ID9879
Feature activation-1.56e-01

Pos logit contributions
lly+0.790
 liqu+0.782
 Castro+0.770
bred+0.684
ful+0.669
Neg logit contributions
PDATE-1.112
atism-1.036
ADRA-0.968
 mathemat-0.965
-0.964
Token,
Token ID11
Feature activation-4.19e-01

Pos logit contributions
lly+0.935
 liqu+0.922
 Castro+0.906
bred+0.787
ful+0.767
Neg logit contributions
PDATE-1.740
atism-1.626
 mathemat-1.533
ADRA-1.532
-1.524
Token000
Token ID830
Feature activation-2.05e-01

Pos logit contributions
lly+3.387
 liqu+3.291
 Castro+3.278
ful+2.992
bred+2.958
Neg logit contributions
PDATE-3.963
atism-3.475
 mathemat-3.419
ADRA-3.297
 volunte-3.264
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
lly+1.300
 liqu+1.286
 Castro+1.265
bred+1.109
ful+1.079
Neg logit contributions
PDATE-2.201
atism-2.052
 mathemat-1.932
ADRA-1.926
-1.919
Token favorite
Token ID4004
Feature activation+7.79e-02

Pos logit contributions
lly+0.126
 liqu+0.125
 Castro+0.123
bred+0.112
 Honour+0.110
Neg logit contributions
PDATE-0.122
atism-0.113
 mathemat-0.103
-0.103
ADRA-0.102
Token favorite
Token ID4004
Feature activation-1.36e-01

Pos logit contributions
PDATE+0.637
atism+0.584
 mathemat+0.538
+0.533
ADRA+0.533
Neg logit contributions
lly-0.682
 liqu-0.671
 Castro-0.662
bred-0.608
 Honour-0.598
Token -
Token ID532
Feature activation-6.54e-01

Pos logit contributions
lly+0.998
 liqu+0.986
 Castro+0.972
bred+0.869
ful+0.850
Neg logit contributions
PDATE-1.333
atism-1.234
 mathemat-1.153
ADRA-1.151
-1.144
Token July
Token ID2901
Feature activation-4.51e-01

Pos logit contributions
 Castro+3.676
lly+3.650
 liqu+3.624
 Honour+3.097
 Chin+3.096
Neg logit contributions
PDATE-7.588
atism-6.995
 mathemat-6.752
ADRA-6.719
 volunte-6.713
Token 6
Token ID718
Feature activation-4.20e-01

Pos logit contributions
lly+3.330
 Castro+3.298
 liqu+3.272
bred+2.923
ful+2.902
Neg logit contributions
PDATE-4.630
atism-4.061
 mathemat-4.008
 volunte-3.934
ADRA-3.893
Token,
Token ID11
Feature activation-7.16e-01

Pos logit contributions
lly+2.328
 liqu+2.243
 Castro+2.241
bred+1.935
ful+1.922
Neg logit contributions
PDATE-5.073
atism-4.579
 mathemat-4.537
ADRA-4.409
 volunte-4.393
Token 2010
Token ID3050
Feature activation+3.65e-02

Pos logit contributions
lly+4.280
 liqu+4.128
 Castro+4.076
 Honour+3.550
bred+3.517
Neg logit contributions
PDATE-7.661
atism-7.314
ADRA-7.101
 mathemat-6.891
-6.777
Token
Token ID198
Feature activation-1.10e-01

Pos logit contributions
PDATE+0.333
atism+0.320
+0.300
ADRA+0.291
Buff+0.286
Neg logit contributions
lly-0.272
 liqu-0.272
 Castro-0.264
bred-0.236
 Honour-0.231
Token
Token ID198
Feature activation-2.36e-01

Pos logit contributions
lly+0.793
 liqu+0.788
 Castro+0.772
bred+0.688
 Honour+0.672
Neg logit contributions
PDATE-1.059
atism-1.000
-0.933
ADRA-0.924
 mathemat-0.914
TokenSubject
Token ID19776
Feature activation-1.99e-01

Pos logit contributions
lly+1.793
 Castro+1.743
 liqu+1.742
bred+1.573
ful+1.561
Neg logit contributions
PDATE-2.291
atism-2.087
 mathemat-2.000
ADRA-1.952
 volunte-1.912
Token:
Token ID25
Feature activation-4.86e-01

Pos logit contributions
lly+1.390
 liqu+1.368
 Castro+1.352
bred+1.202
ful+1.181
Neg logit contributions
PDATE-2.025
atism-1.869
 mathemat-1.764
ADRA-1.757
-1.735