TOP ACTIVATIONS
MAX = 1.903

s . All told , Trump s
Danny Kane hit the home post after getting on the end
iologist at the University of Washington who has been studying Jak
get more information and they told us that Mer rick Cemetery
had one catch for 6 yards . I was just out
at the Economic Club in Washington , DC , in this
irony ), they have become aware of each other sexually and
' You could tell she was hurting , but
s stack frame | void * arg | + -------------------- --+
teammates raising their arms in celebration behind him .
Institute of World Politics in Washington , DC ,
had said , " How cum nobody just does simple stories
God , page 127 , said :
This is explained here . * mod eline * * vim
. Back handedly acknowledging that Kiev has created a humanitarian catastrophe
' s frame | void * arg | + -------------------- --+
where . Her so - called urban thru - h ikes
of all children , terminate if any are not valid //
that he was the only realistic option as Kenya 's future
x . x . 77 115 57 21 x . x

MIN ACTIVATIONS
MIN = -2.134

bad ' people around Trump MORE pr attled inqu is itive
than 10 years in prison MORE , in his weekly column
on sleeve at basketball game MORE and Hillary Clinton Hillary Diane
than 10 years in prison MORE and Hillary Clinton Hillary Diane
than 10 years in prison MORE and tick ed through her
holes in American politics ' MORE completed a swing through North
twitter . com / O JV L F Re K ch
. com / Eh U g I x 2 N t
Other media [ edit ] Video games
Film ography [ edit ] Al bum
holes in American politics ' MORE , told 77 WA BC
was , and what Ad e o R ess i warned
. Follow @ greg ory k ort e on
https :// bit co int alk . org / index .
: 13 PH OTOS Donald Trump 's potential running
. com / 74 E g V V p DM 2
. twitter . com / j Ax 6 c U xi
Video games [ edit ] In the
by Robert Math is (@ r math is _ the 1
holes in American politics ' MORE as a 68 - year

INTERVAL 0.893 - 1.903
CONTAINS 0.787%

common as we become more aware of unconscious bias ,
Tom Holland goal dis allowed for off side in
as if the way to engage a digital - sav vy
some of whom have moved with local forces within 20 miles
]. each do | i | print "# { i }

INTERVAL -0.116 - 0.893
CONTAINS 64.515%

- team repet itions . Lyn ch had one
), John T rav olt a and Tom Cruise ( for
in the eyes of his opponents . Social movements are back
cracked areas in the bones , and meticulous pre - cle
later in the election cycle than the GW Batt leground poll

INTERVAL -1.125 - -0.116
CONTAINS 34.536%

any bad blood will carry over between seasons , noting that
initiate what people inside Goldman would eventually dub the
when Sam Bac ile would be arrested " drew wide attention
right now , you really feel as if you
Stadium , and forced to watch eight people , who were

INTERVAL -2.134 - -1.125
CONTAINS 0.163%

1996 Jerry Se infeld 1996 Garry Sh
ACLU of Oregon (@ AC LU _ OR ) May 29
. ( 1981 ). A psych ometric investigation of the standard
t . co / X h 7 J m B z
www . bel fast te legraph . co . uk /
Tokens
Token ID82
Feature activation+6.09e-01

Pos logit contributions
idth+0.468
esters+0.396
ammy+0.352
ilyn+0.351
ples+0.345
Neg logit contributions
 mathemat-0.489
ADRA-0.460
��-0.459
ingly-0.447
WARN-0.440
Token.
Token ID13
Feature activation+1.19e-01

Pos logit contributions
 mathemat+1.797
��+1.674
ADRA+1.671
ingly+1.656
MAP+1.623
Neg logit contributions
idth-1.950
esters-1.637
ammy-1.499
ilyn-1.480
uv-1.462
Token
Token ID198
Feature activation+3.58e-01

Pos logit contributions
 mathemat+0.324
ADRA+0.297
��+0.297
ingly+0.291
MAP+0.284
Neg logit contributions
idth-0.411
esters-0.352
ammy-0.324
ilyn-0.320
 Rated-0.319
Token
Token ID198
Feature activation+3.02e-01

Pos logit contributions
 mathemat+0.931
ADRA+0.850
��+0.847
ingly+0.822
MAP+0.806
Neg logit contributions
idth-1.288
esters-1.110
ammy-1.026
ilyn-1.014
uv-1.009
TokenAll
Token ID3237
Feature activation-2.06e-01

Pos logit contributions
 mathemat+0.879
��+0.801
ADRA+0.791
ingly+0.776
MAP+0.752
Neg logit contributions
idth-1.001
esters-0.855
ammy-0.782
uv-0.774
ples-0.769
Token told
Token ID1297
Feature activation+1.90e+00

Pos logit contributions
idth+0.653
esters+0.554
ammy+0.508
uv+0.505
ples+0.503
Neg logit contributions
 mathemat-0.608
��-0.567
ADRA-0.564
MAP-0.548
ingly-0.547
Token,
Token ID11
Feature activation-1.88e-01

Pos logit contributions
 mathemat+4.773
ingly+4.507
��+4.454
ADRA+4.329
 Leopard+4.189
Neg logit contributions
idth-6.469
esters-5.583
ilyn-5.166
ammy-5.160
uv-5.101
Token Trump
Token ID1301
Feature activation-1.57e-01

Pos logit contributions
idth+0.611
esters+0.517
ammy+0.471
ples+0.465
 Rated+0.464
Neg logit contributions
 mathemat-0.546
ADRA-0.517
��-0.512
WARN-0.497
ingly-0.495
Token
Token ID447
Feature activation+2.04e-02

Pos logit contributions
idth+0.493
esters+0.415
ammy+0.371
ples+0.370
ilyn+0.370
Neg logit contributions
 mathemat-0.475
ADRA-0.448
��-0.447
ingly-0.438
MAP-0.432
Token
Token ID247
Feature activation+6.97e-02

Pos logit contributions
 mathemat+0.060
ADRA+0.054
ingly+0.052
��+0.052
MAP+0.051
Neg logit contributions
idth-0.068
esters-0.058
ammy-0.053
ples-0.052
ilyn-0.052
Tokens
Token ID82
Feature activation+7.28e-02

Pos logit contributions
 mathemat+0.233
ADRA+0.218
��+0.216
ingly+0.210
MAP+0.209
Neg logit contributions
idth-0.200
esters-0.166
ammy-0.149
ilyn-0.147
ples-0.146
Token Danny
Token ID15105
Feature activation-8.71e-02

Pos logit contributions
 mathemat+1.262
��+1.177
ADRA+1.176
ingly+1.160
MAP+1.137
Neg logit contributions
idth-1.362
esters-1.145
ammy-1.048
ilyn-1.034
uv-1.020
Token Kane
Token ID20055
Feature activation-4.03e-02

Pos logit contributions
idth+0.278
esters+0.234
ammy+0.216
ilyn+0.212
uv+0.210
Neg logit contributions
 mathemat-0.258
ADRA-0.241
��-0.240
ingly-0.236
MAP-0.232
Token hit
Token ID2277
Feature activation+4.64e-01

Pos logit contributions
idth+0.129
esters+0.109
ammy+0.101
uv+0.099
ilyn+0.099
Neg logit contributions
 mathemat-0.118
ADRA-0.110
��-0.110
ingly-0.107
MAP-0.107
Token the
Token ID262
Feature activation+3.48e-01

Pos logit contributions
 mathemat+1.365
ADRA+1.272
��+1.271
ingly+1.249
MAP+1.230
Neg logit contributions
idth-1.494
esters-1.259
ammy-1.154
ilyn-1.138
uv-1.127
Token home
Token ID1363
Feature activation+4.04e-01

Pos logit contributions
 mathemat+1.011
ADRA+0.948
��+0.937
ingly+0.921
MAP+0.910
Neg logit contributions
idth-1.128
esters-0.956
ammy-0.882
ilyn-0.869
uv-0.868
Token post
Token ID1281
Feature activation+1.89e+00

Pos logit contributions
 mathemat+1.164
ADRA+1.083
��+1.082
ingly+1.064
MAP+1.046
Neg logit contributions
idth-1.323
esters-1.118
ammy-1.025
ilyn-1.013
uv-1.003
Token after
Token ID706
Feature activation+4.99e-01

Pos logit contributions
 mathemat+5.463
ingly+5.220
��+5.080
ADRA+5.058
MAP+4.933
Neg logit contributions
idth-6.020
esters-5.039
ammy-4.564
ilyn-4.521
 Rated-4.395
Token getting
Token ID1972
Feature activation+9.29e-01

Pos logit contributions
 mathemat+1.467
ADRA+1.369
��+1.367
ingly+1.347
MAP+1.323
Neg logit contributions
idth-1.608
esters-1.356
ammy-1.242
ilyn-1.224
uv-1.213
Token on
Token ID319
Feature activation+3.45e-01

Pos logit contributions
 mathemat+2.580
��+2.390
ADRA+2.387
ingly+2.363
MAP+2.307
Neg logit contributions
idth-3.144
esters-2.666
ammy-2.454
ilyn-2.420
uv-2.380
Token the
Token ID262
Feature activation+2.50e-02

Pos logit contributions
 mathemat+1.026
ADRA+0.957
��+0.957
ingly+0.941
MAP+0.925
Neg logit contributions
idth-1.100
esters-0.923
ammy-0.844
ilyn-0.833
uv-0.825
Token end
Token ID886
Feature activation-5.94e-02

Pos logit contributions
 mathemat+0.068
ADRA+0.064
��+0.063
ingly+0.062
WARN+0.061
Neg logit contributions
idth-0.085
esters-0.073
ammy-0.067
uv-0.067
ilyn-0.067
Tokeniologist
Token ID31599
Feature activation+6.51e-01

Pos logit contributions
 mathemat+2.287
ingly+2.139
��+2.134
ADRA+2.131
MAP+2.089
Neg logit contributions
idth-2.013
esters-1.655
ammy-1.506
ilyn-1.462
ples-1.448
Token at
Token ID379
Feature activation+5.69e-01

Pos logit contributions
 mathemat+1.935
��+1.790
ingly+1.789
ADRA+1.773
MAP+1.742
Neg logit contributions
idth-2.068
esters-1.730
ammy-1.599
ilyn-1.564
uv-1.543
Token the
Token ID262
Feature activation+2.88e-01

Pos logit contributions
 mathemat+1.665
ADRA+1.550
��+1.550
ingly+1.532
MAP+1.500
Neg logit contributions
idth-1.840
esters-1.547
ammy-1.422
ilyn-1.403
uv-1.381
Token University
Token ID2059
Feature activation+2.24e-01

Pos logit contributions
 mathemat+0.802
ADRA+0.755
��+0.750
ingly+0.730
MAP+0.718
Neg logit contributions
idth-0.972
esters-0.827
ammy-0.756
ilyn-0.749
uv-0.747
Token of
Token ID286
Feature activation+4.64e-01

Pos logit contributions
 mathemat+0.649
ADRA+0.600
��+0.596
ingly+0.576
MAP+0.565
Neg logit contributions
idth-0.743
esters-0.634
ammy-0.581
uv-0.573
ilyn-0.572
Token Washington
Token ID2669
Feature activation+1.84e+00

Pos logit contributions
 mathemat+1.369
ADRA+1.276
��+1.276
ingly+1.260
MAP+1.234
Neg logit contributions
idth-1.490
esters-1.253
ammy-1.148
ilyn-1.133
uv-1.117
Token who
Token ID508
Feature activation+5.12e-01

Pos logit contributions
 mathemat+4.892
ingly+4.450
��+4.449
MAP+4.420
ADRA+4.359
Neg logit contributions
idth-6.267
esters-5.207
ammy-4.901
ilyn-4.884
uv-4.801
Token has
Token ID468
Feature activation+3.03e-01

Pos logit contributions
 mathemat+1.543
ADRA+1.442
��+1.440
ingly+1.419
MAP+1.393
Neg logit contributions
idth-1.611
esters-1.352
ammy-1.234
ilyn-1.217
uv-1.201
Token been
Token ID587
Feature activation+1.62e-01

Pos logit contributions
 mathemat+1.023
ADRA+0.963
��+0.963
ingly+0.950
MAP+0.934
Neg logit contributions
idth-0.845
esters-0.691
ammy-0.622
ilyn-0.612
uv-0.603
Token studying
Token ID11065
Feature activation+1.95e-01

Pos logit contributions
 mathemat+0.491
ADRA+0.466
��+0.465
ingly+0.454
MAP+0.445
Neg logit contributions
idth-0.503
esters-0.425
ammy-0.386
ilyn-0.382
uv-0.381
Token Jak
Token ID25845
Feature activation+1.20e-01

Pos logit contributions
 mathemat+0.553
ADRA+0.517
��+0.515
ingly+0.505
MAP+0.496
Neg logit contributions
idth-0.649
esters-0.549
ammy-0.502
ilyn-0.498
uv-0.495
Token get
Token ID651
Feature activation+4.22e-01

Pos logit contributions
idth+0.219
esters+0.188
ammy+0.171
ilyn+0.166
ples+0.166
Neg logit contributions
 mathemat-0.207
ADRA-0.195
��-0.194
ingly-0.188
MAP-0.186
Token more
Token ID517
Feature activation+1.37e-01

Pos logit contributions
 mathemat+1.207
ADRA+1.123
��+1.122
ingly+1.104
MAP+1.083
Neg logit contributions
idth-1.393
esters-1.181
ammy-1.083
ilyn-1.070
uv-1.057
Token information
Token ID1321
Feature activation+3.20e-01

Pos logit contributions
 mathemat+0.399
ADRA+0.374
��+0.374
ingly+0.363
MAP+0.358
Neg logit contributions
idth-0.441
esters-0.375
ammy-0.343
ilyn-0.338
uv-0.335
Token and
Token ID290
Feature activation+1.08e-01

Pos logit contributions
 mathemat+0.945
ADRA+0.883
��+0.881
ingly+0.865
MAP+0.850
Neg logit contributions
idth-1.027
esters-0.866
ammy-0.792
ilyn-0.781
uv-0.773
Token they
Token ID484
Feature activation+4.24e-01

Pos logit contributions
 mathemat+0.325
ADRA+0.306
��+0.304
ingly+0.297
MAP+0.292
Neg logit contributions
idth-0.338
esters-0.288
ammy-0.262
ilyn-0.259
uv-0.255
Token told
Token ID1297
Feature activation+1.76e+00

Pos logit contributions
 mathemat+1.186
ADRA+1.110
��+1.108
ingly+1.077
MAP+1.067
Neg logit contributions
idth-1.424
esters-1.218
ammy-1.115
ilyn-1.102
uv-1.091
Token us
Token ID514
Feature activation-4.67e-01

Pos logit contributions
 mathemat+2.728
ingly+2.554
��+2.412
WARN+2.323
ADRA+2.319
Neg logit contributions
idth-7.956
esters-6.941
ilyn-6.591
ammy-6.584
ples-6.509
Token that
Token ID326
Feature activation+1.38e-01

Pos logit contributions
idth+1.364
esters+1.168
ammy+1.047
ilyn+1.030
 Rated+1.029
Neg logit contributions
 mathemat-1.485
ADRA-1.400
��-1.383
ingly-1.345
MAP-1.329
Token Mer
Token ID4638
Feature activation+2.94e-01

Pos logit contributions
 mathemat+0.417
ADRA+0.390
��+0.388
ingly+0.381
MAP+0.374
Neg logit contributions
idth-0.430
esters-0.368
ammy-0.332
ilyn-0.328
uv-0.326
Tokenrick
Token ID5557
Feature activation+1.17e-01

Pos logit contributions
 mathemat+0.790
ADRA+0.718
��+0.718
ingly+0.691
MAP+0.684
Neg logit contributions
idth-1.048
esters-0.896
ammy-0.831
ilyn-0.830
uv-0.817
Token Cemetery
Token ID35637
Feature activation-2.59e-01

Pos logit contributions
 mathemat+0.288
ADRA+0.263
��+0.262
ingly+0.257
MAP+0.251
Neg logit contributions
idth-0.435
esters-0.376
ammy-0.349
ilyn-0.346
uv-0.343
Token had
Token ID550
Feature activation+2.08e-01

Pos logit contributions
idth+0.049
esters+0.042
ammy+0.038
uv+0.038
 Rated+0.038
Neg logit contributions
 mathemat-0.044
ADRA-0.043
��-0.042
MAP-0.041
WARN-0.040
Token one
Token ID530
Feature activation-2.17e-01

Pos logit contributions
 mathemat+0.635
ADRA+0.597
��+0.596
ingly+0.584
MAP+0.577
Neg logit contributions
idth-0.647
esters-0.542
ammy-0.493
ilyn-0.485
uv-0.482
Token catch
Token ID4929
Feature activation+5.75e-01

Pos logit contributions
idth+0.695
esters+0.597
ammy+0.555
uv+0.547
 Rated+0.545
Neg logit contributions
 mathemat-0.621
��-0.592
ADRA-0.588
MAP-0.563
ingly-0.554
Token for
Token ID329
Feature activation-1.32e-01

Pos logit contributions
 mathemat+1.749
ADRA+1.638
��+1.636
ingly+1.598
MAP+1.579
Neg logit contributions
idth-1.802
esters-1.512
ammy-1.379
ilyn-1.358
uv-1.346
Token 6
Token ID718
Feature activation-1.01e-01

Pos logit contributions
idth+0.383
esters+0.323
ammy+0.290
 Rated+0.288
uv+0.283
Neg logit contributions
 mathemat-0.421
ADRA-0.401
��-0.401
MAP-0.387
WARN-0.386
Token yards
Token ID5695
Feature activation+1.75e+00

Pos logit contributions
idth+0.344
esters+0.296
ammy+0.270
uv+0.269
ples+0.268
Neg logit contributions
 mathemat-0.276
��-0.257
ADRA-0.256
ingly-0.250
WARN-0.247
Token.
Token ID13
Feature activation+2.85e-01

Pos logit contributions
 mathemat+4.724
ingly+4.469
��+4.341
ADRA+4.289
MAP+4.237
Neg logit contributions
idth-5.926
esters-4.939
ilyn-4.569
ammy-4.552
uv-4.378
Token I
Token ID314
Feature activation-1.68e-01

Pos logit contributions
 mathemat+0.881
��+0.812
ADRA+0.812
ingly+0.791
MAP+0.777
Neg logit contributions
idth-0.890
esters-0.749
ammy-0.683
 Rated-0.682
uv-0.673
Token was
Token ID373
Feature activation+1.26e-01

Pos logit contributions
idth+0.530
esters+0.455
ammy+0.418
uv+0.416
 Rated+0.414
Neg logit contributions
 mathemat-0.489
ADRA-0.478
��-0.470
MAP-0.454
WARN-0.444
Token just
Token ID655
Feature activation+7.69e-02

Pos logit contributions
 mathemat+0.367
��+0.345
ADRA+0.344
ingly+0.334
MAP+0.330
Neg logit contributions
idth-0.401
esters-0.346
ammy-0.316
uv-0.313
 Rated-0.311
Token out
Token ID503
Feature activation+2.84e-01

Pos logit contributions
 mathemat+0.224
��+0.210
ADRA+0.210
ingly+0.203
MAP+0.201
Neg logit contributions
idth-0.245
esters-0.212
ammy-0.194
uv-0.192
 Rated-0.190
Token at
Token ID379
Feature activation+3.22e-01

Pos logit contributions
idth+0.154
esters+0.127
ammy+0.116
ilyn+0.113
uv+0.112
Neg logit contributions
 mathemat-0.166
ADRA-0.157
��-0.156
ingly-0.152
MAP-0.151
Token the
Token ID262
Feature activation+2.54e-01

Pos logit contributions
 mathemat+0.961
��+0.899
ADRA+0.896
ingly+0.882
MAP+0.867
Neg logit contributions
idth-1.026
esters-0.865
ammy-0.791
ilyn-0.777
uv-0.768
Token Economic
Token ID11279
Feature activation+1.03e-01

Pos logit contributions
 mathemat+0.670
ADRA+0.624
��+0.621
ingly+0.606
MAP+0.598
Neg logit contributions
idth-0.891
esters-0.761
ammy-0.706
ilyn-0.694
 Rated-0.690
Token Club
Token ID6289
Feature activation+7.09e-02

Pos logit contributions
 mathemat+0.246
��+0.226
ingly+0.225
ADRA+0.224
MAP+0.218
Neg logit contributions
idth-0.391
esters-0.337
ammy-0.314
ilyn-0.310
uv-0.307
Token in
Token ID287
Feature activation+1.17e-01

Pos logit contributions
 mathemat+0.199
ADRA+0.185
��+0.185
ingly+0.181
MAP+0.178
Neg logit contributions
idth-0.238
esters-0.202
ammy-0.186
ilyn-0.183
uv-0.181
Token Washington
Token ID2669
Feature activation+1.73e+00

Pos logit contributions
 mathemat+0.303
��+0.282
ADRA+0.282
ingly+0.273
MAP+0.269
Neg logit contributions
idth-0.419
esters-0.361
ammy-0.334
ilyn-0.328
uv-0.326
Token,
Token ID11
Feature activation+3.68e-01

Pos logit contributions
 mathemat+4.772
��+4.446
ingly+4.414
ADRA+4.394
MAP+4.339
Neg logit contributions
idth-5.754
esters-4.811
ammy-4.417
ilyn-4.388
uv-4.312
Token DC
Token ID6257
Feature activation+6.26e-01

Pos logit contributions
 mathemat+0.951
ADRA+0.881
��+0.878
ingly+0.859
MAP+0.835
Neg logit contributions
idth-1.316
esters-1.134
ammy-1.054
ilyn-1.038
uv-1.031
Token,
Token ID11
Feature activation+2.65e-01

Pos logit contributions
 mathemat+1.852
��+1.728
ADRA+1.725
ingly+1.708
MAP+1.676
Neg logit contributions
idth-2.003
esters-1.679
ammy-1.537
ilyn-1.518
uv-1.495
Tokenin
Token ID259
Feature activation+2.07e-01

Pos logit contributions
 mathemat+0.801
ADRA+0.753
��+0.749
ingly+0.733
MAP+0.716
Neg logit contributions
idth-0.830
esters-0.702
ammy-0.641
 Rated-0.630
uv-0.630
Token this
Token ID428
Feature activation+3.08e-01

Pos logit contributions
 mathemat+0.712
ADRA+0.675
��+0.674
ingly+0.658
MAP+0.652
Neg logit contributions
idth-0.564
esters-0.460
ammy-0.415
ilyn-0.407
uv-0.401
Token irony
Token ID21296
Feature activation-1.21e-01

Pos logit contributions
 mathemat+0.198
ADRA+0.186
��+0.186
ingly+0.182
MAP+0.178
Neg logit contributions
idth-0.217
esters-0.184
ammy-0.168
ilyn-0.165
uv-0.164
Token),
Token ID828
Feature activation-5.15e-01

Pos logit contributions
idth+0.355
esters+0.295
ammy+0.268
ilyn+0.262
uv+0.260
Neg logit contributions
 mathemat-0.388
ADRA-0.364
��-0.364
ingly-0.354
MAP-0.351
Token they
Token ID484
Feature activation+7.21e-02

Pos logit contributions
idth+1.631
esters+1.389
ammy+1.272
 Rated+1.250
ilyn+1.244
Neg logit contributions
 mathemat-1.510
��-1.430
ADRA-1.424
ingly-1.382
MAP-1.366
Token have
Token ID423
Feature activation+5.96e-02

Pos logit contributions
 mathemat+0.202
��+0.190
ADRA+0.190
ingly+0.183
MAP+0.182
Neg logit contributions
idth-0.242
esters-0.206
ammy-0.189
ilyn-0.186
 Rated-0.185
Token become
Token ID1716
Feature activation+1.12e+00

Pos logit contributions
 mathemat+0.175
ADRA+0.165
��+0.165
ingly+0.160
MAP+0.159
Neg logit contributions
idth-0.191
esters-0.161
ammy-0.147
ilyn-0.145
 Rated-0.144
Token aware
Token ID3910
Feature activation+1.70e+00

Pos logit contributions
 mathemat+3.190
ingly+2.937
ADRA+2.930
��+2.919
MAP+2.839
Neg logit contributions
idth-3.741
esters-3.126
ammy-2.893
ilyn-2.857
ples-2.789
Token of
Token ID286
Feature activation+5.91e-01

Pos logit contributions
 mathemat+4.284
ingly+4.095
��+3.862
ADRA+3.818
MAP+3.758
Neg logit contributions
idth-6.048
esters-5.101
ammy-4.819
ilyn-4.701
uv-4.618
Token each
Token ID1123
Feature activation-5.07e-02

Pos logit contributions
 mathemat+1.720
��+1.600
ADRA+1.600
ingly+1.582
MAP+1.546
Neg logit contributions
idth-1.924
esters-1.621
ammy-1.487
ilyn-1.467
uv-1.448
Token other
Token ID584
Feature activation+7.20e-01

Pos logit contributions
idth+0.163
esters+0.137
ammy+0.126
ilyn+0.124
uv+0.122
Neg logit contributions
 mathemat-0.149
ADRA-0.139
��-0.139
ingly-0.138
MAP-0.135
Token sexually
Token ID11363
Feature activation+4.92e-01

Pos logit contributions
 mathemat+2.117
ingly+1.977
��+1.962
ADRA+1.961
MAP+1.910
Neg logit contributions
idth-2.306
esters-1.919
ammy-1.774
ilyn-1.747
uv-1.712
Token and
Token ID290
Feature activation-2.12e-02

Pos logit contributions
 mathemat+1.463
��+1.364
ADRA+1.363
ingly+1.347
MAP+1.319
Neg logit contributions
idth-1.568
esters-1.316
ammy-1.206
ilyn-1.189
uv-1.173
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
 mathemat+0.824
ADRA+0.777
��+0.771
ingly+0.755
MAP+0.747
Neg logit contributions
idth-0.817
esters-0.685
ammy-0.623
ilyn-0.613
uv-0.612
Token
Token ID198
Feature activation-8.07e-02

Pos logit contributions
idth+0.385
esters+0.332
ammy+0.306
uv+0.305
 Rated+0.303
Neg logit contributions
 mathemat-0.312
ADRA-0.283
��-0.280
ingly-0.265
 juven-0.262
Token'
Token ID6
Feature activation+1.02e-01

Pos logit contributions
idth+0.264
esters+0.225
ammy+0.207
uv+0.205
ilyn+0.204
Neg logit contributions
 mathemat-0.236
��-0.216
ADRA-0.214
ingly-0.211
MAP-0.203
TokenYou
Token ID1639
Feature activation-8.26e-02

Pos logit contributions
 mathemat+0.304
��+0.281
ADRA+0.279
ingly+0.274
MAP+0.267
Neg logit contributions
idth-0.328
esters-0.278
ammy-0.255
ilyn-0.251
uv-0.251
Token could
Token ID714
Feature activation+8.67e-01

Pos logit contributions
idth+0.286
esters+0.248
ammy+0.230
uv+0.225
 Rated+0.224
Neg logit contributions
 mathemat-0.218
ADRA-0.208
��-0.206
MAP-0.198
WARN-0.195
Token tell
Token ID1560
Feature activation+1.69e+00

Pos logit contributions
 mathemat+2.338
��+2.159
ADRA+2.155
ingly+2.150
MAP+2.085
Neg logit contributions
idth-2.991
esters-2.548
ammy-2.352
ilyn-2.328
uv-2.290
Token she
Token ID673
Feature activation-5.60e-02

Pos logit contributions
 mathemat+4.868
ingly+4.660
��+4.509
 Leopard+4.455
ADRA+4.451
Neg logit contributions
idth-5.438
esters-4.481
ammy-4.088
ilyn-4.063
ples-3.934
Token was
Token ID373
Feature activation+1.69e-01

Pos logit contributions
idth+0.175
esters+0.149
ammy+0.136
uv+0.135
ilyn+0.134
Neg logit contributions
 mathemat-0.167
ADRA-0.160
��-0.158
MAP-0.152
ingly-0.151
Token hurting
Token ID21530
Feature activation-3.58e-01

Pos logit contributions
 mathemat+0.492
ADRA+0.460
��+0.460
ingly+0.449
MAP+0.442
Neg logit contributions
idth-0.545
esters-0.464
ammy-0.426
ilyn-0.418
uv-0.417
Token,
Token ID11
Feature activation+2.43e-01

Pos logit contributions
idth+1.087
esters+0.916
ammy+0.833
uv+0.823
ples+0.823
Neg logit contributions
 mathemat-1.103
��-1.046
ADRA-1.043
WARN-0.999
ingly-0.995
Token but
Token ID475
Feature activation+3.90e-01

Pos logit contributions
 mathemat+0.725
ADRA+0.682
��+0.679
ingly+0.659
MAP+0.653
Neg logit contributions
idth-0.768
esters-0.650
ammy-0.594
ilyn-0.586
uv-0.584
Tokens
Token ID82
Feature activation+4.96e-01

Pos logit contributions
 mathemat+1.515
��+1.419
ADRA+1.419
ingly+1.401
MAP+1.376
Neg logit contributions
idth-1.411
esters-1.170
ammy-1.062
ilyn-1.046
uv-1.030
Token stack
Token ID8931
Feature activation+2.25e-01

Pos logit contributions
 mathemat+1.235
��+1.131
ADRA+1.130
ingly+1.121
MAP+1.091
Neg logit contributions
idth-1.813
esters-1.567
ammy-1.454
ilyn-1.435
uv-1.413
Token frame
Token ID5739
Feature activation+9.39e-02

Pos logit contributions
 mathemat+0.593
��+0.549
ADRA+0.547
ingly+0.547
MAP+0.532
Neg logit contributions
idth-0.788
esters-0.678
ammy-0.627
ilyn-0.616
ples-0.606
Token |
Token ID930
Feature activation+2.67e-01

Pos logit contributions
 mathemat+0.298
ADRA+0.280
��+0.279
ingly+0.277
MAP+0.272
Neg logit contributions
idth-0.280
esters-0.232
ammy-0.212
ilyn-0.209
uv-0.205
Token void
Token ID7951
Feature activation+9.05e-01

Pos logit contributions
 mathemat+0.597
��+0.546
ADRA+0.545
ingly+0.533
MAP+0.513
Neg logit contributions
idth-1.046
esters-0.910
ammy-0.848
uv-0.840
ilyn-0.839
Token *
Token ID1635
Feature activation+1.67e+00

Pos logit contributions
 mathemat+1.662
��+1.505
ADRA+1.502
ingly+1.487
MAP+1.426
Neg logit contributions
idth-3.852
esters-3.423
ammy-3.218
ilyn-3.199
uv-3.137
Tokenarg
Token ID853
Feature activation+5.19e-01

Pos logit contributions
 mathemat+2.427
ingly+2.211
ADRA+2.147
WARN+2.145
��+2.137
Neg logit contributions
idth-7.426
esters-6.649
ilyn-6.316
ammy-6.304
 Rated-6.240
Token |
Token ID930
Feature activation+5.73e-01

Pos logit contributions
 mathemat+1.582
ADRA+1.489
��+1.485
ingly+1.483
MAP+1.454
Neg logit contributions
idth-1.593
esters-1.332
ammy-1.221
ilyn-1.207
ples-1.163
Token +
Token ID1343
Feature activation+8.77e-01

Pos logit contributions
 mathemat+1.539
ADRA+1.426
��+1.424
ingly+1.404
MAP+1.378
Neg logit contributions
idth-1.986
esters-1.695
ammy-1.569
ilyn-1.549
uv-1.523
Token--------------------
Token ID19351
Feature activation+2.87e-01

Pos logit contributions
 mathemat+2.509
��+2.334
ADRA+2.331
ingly+2.299
MAP+2.248
Neg logit contributions
idth-2.900
esters-2.454
ammy-2.252
ilyn-2.222
uv-2.198
Token--+
Token ID44785
Feature activation-4.26e-01

Pos logit contributions
 mathemat+1.004
��+0.931
ADRA+0.929
ingly+0.909
MAP+0.898
Neg logit contributions
idth-0.784
esters-0.640
ammy-0.574
ilyn-0.560
uv-0.557
Token teammates
Token ID13107
Feature activation+4.74e-01

Pos logit contributions
 mathemat+0.161
��+0.149
ADRA+0.149
ingly+0.145
MAP+0.144
Neg logit contributions
idth-0.185
esters-0.157
ammy-0.145
ilyn-0.142
uv-0.142
Token raising
Token ID8620
Feature activation+1.59e-01

Pos logit contributions
 mathemat+1.324
ADRA+1.233
��+1.232
ingly+1.205
MAP+1.184
Neg logit contributions
idth-1.590
esters-1.362
ammy-1.251
ilyn-1.232
uv-1.226
Token their
Token ID511
Feature activation-1.66e-02

Pos logit contributions
 mathemat+0.494
ADRA+0.461
��+0.461
ingly+0.450
MAP+0.446
Neg logit contributions
idth-0.490
esters-0.410
ammy-0.372
ilyn-0.368
uv-0.365
Token arms
Token ID5101
Feature activation+3.80e-01

Pos logit contributions
idth+0.054
esters+0.046
ammy+0.042
uv+0.041
ilyn+0.041
Neg logit contributions
 mathemat-0.048
ADRA-0.045
��-0.045
ingly-0.044
MAP-0.043
Token in
Token ID287
Feature activation+2.45e-01

Pos logit contributions
 mathemat+1.188
ADRA+1.113
��+1.111
ingly+1.089
MAP+1.074
Neg logit contributions
idth-1.155
esters-0.964
ammy-0.878
ilyn-0.863
uv-0.855
Token celebration
Token ID14395
Feature activation+1.67e+00

Pos logit contributions
 mathemat+0.686
ADRA+0.640
��+0.639
ingly+0.623
MAP+0.619
Neg logit contributions
idth-0.814
esters-0.696
ammy-0.638
uv-0.634
ilyn-0.633
Token behind
Token ID2157
Feature activation+8.70e-02

Pos logit contributions
 mathemat+4.463
ingly+4.294
��+4.178
ADRA+4.113
MAP+4.007
Neg logit contributions
idth-5.645
esters-4.827
ammy-4.400
ilyn-4.343
uv-4.205
Token him
Token ID683
Feature activation+3.99e-01

Pos logit contributions
 mathemat+0.193
ADRA+0.176
��+0.175
MAP+0.169
ingly+0.168
Neg logit contributions
idth-0.343
esters-0.299
ammy-0.279
ilyn-0.277
uv-0.276
Token.
Token ID13
Feature activation+2.53e-01

Pos logit contributions
 mathemat+1.169
��+1.092
ADRA+1.088
ingly+1.088
MAP+1.055
Neg logit contributions
idth-1.286
esters-1.082
ammy-0.989
ilyn-0.977
uv-0.959
Token
Token ID198
Feature activation+4.14e-01

Pos logit contributions
 mathemat+0.699
��+0.640
ADRA+0.640
ingly+0.628
MAP+0.614
Neg logit contributions
idth-0.868
esters-0.741
ammy-0.683
ilyn-0.677
 Rated-0.672
Token
Token ID198
Feature activation+3.29e-01

Pos logit contributions
 mathemat+1.075
ADRA+0.981
��+0.977
ingly+0.950
MAP+0.930
Neg logit contributions
idth-1.494
esters-1.288
ammy-1.191
ilyn-1.178
uv-1.171
Token Institute
Token ID5136
Feature activation-2.35e-01

Pos logit contributions
 mathemat+1.115
ADRA+1.046
��+1.040
ingly+1.014
MAP+1.001
Neg logit contributions
idth-1.342
esters-1.142
ammy-1.050
ilyn-1.035
uv-1.025
Token of
Token ID286
Feature activation+4.48e-01

Pos logit contributions
idth+0.774
esters+0.654
ammy+0.601
ilyn+0.593
uv+0.585
Neg logit contributions
 mathemat-0.671
��-0.624
ADRA-0.624
ingly-0.616
MAP-0.603
Token World
Token ID2159
Feature activation+2.33e-01

Pos logit contributions
 mathemat+1.417
ADRA+1.332
��+1.327
ingly+1.323
MAP+1.295
Neg logit contributions
idth-1.335
esters-1.102
ammy-1.004
ilyn-0.989
ples-0.968
Token Politics
Token ID17554
Feature activation+1.74e-01

Pos logit contributions
 mathemat+0.743
��+0.696
ADRA+0.696
ingly+0.686
MAP+0.674
Neg logit contributions
idth-0.695
esters-0.576
ammy-0.523
ilyn-0.516
uv-0.508
Token in
Token ID287
Feature activation+2.28e-01

Pos logit contributions
 mathemat+0.528
��+0.496
ADRA+0.495
ingly+0.487
MAP+0.479
Neg logit contributions
idth-0.540
esters-0.454
ammy-0.412
ilyn-0.407
uv-0.401
Token Washington
Token ID2669
Feature activation+1.67e+00

Pos logit contributions
 mathemat+0.589
ADRA+0.546
��+0.545
ingly+0.530
MAP+0.520
Neg logit contributions
idth-0.817
esters-0.704
ammy-0.652
ilyn-0.643
uv-0.638
Token,
Token ID11
Feature activation+2.33e-01

Pos logit contributions
 mathemat+4.561
��+4.230
ingly+4.173
ADRA+4.167
MAP+4.070
Neg logit contributions
idth-5.582
esters-4.704
ammy-4.344
ilyn-4.284
uv-4.250
Token DC
Token ID6257
Feature activation+4.98e-01

Pos logit contributions
 mathemat+0.664
��+0.616
ADRA+0.616
ingly+0.606
MAP+0.595
Neg logit contributions
idth-0.775
esters-0.661
ammy-0.604
ilyn-0.599
uv-0.590
Token,
Token ID11
Feature activation+1.95e-01

Pos logit contributions
 mathemat+1.466
��+1.359
ADRA+1.358
ingly+1.347
MAP+1.316
Neg logit contributions
idth-1.602
esters-1.343
ammy-1.234
ilyn-1.217
uv-1.202
Token
Token ID564
Feature activation-1.06e-01

Pos logit contributions
 mathemat+0.590
��+0.549
ADRA+0.549
ingly+0.540
WARN+0.531
Neg logit contributions
idth-0.617
esters-0.521
ammy-0.471
ilyn-0.468
uv-0.462
Token
Token ID250
Feature activation-7.61e-03

Pos logit contributions
idth+0.420
esters+0.368
ammy+0.343
ilyn+0.338
ples+0.337
Neg logit contributions
 mathemat-0.237
ADRA-0.210
ingly-0.204
MAP-0.202
��-0.202
Token had
Token ID550
Feature activation+2.41e-01

Pos logit contributions
 mathemat+0.093
ADRA+0.088
��+0.088
ingly+0.086
MAP+0.085
Neg logit contributions
idth-0.096
esters-0.081
ammy-0.074
ilyn-0.073
uv-0.072
Token said
Token ID531
Feature activation-2.26e-01

Pos logit contributions
 mathemat+0.722
ADRA+0.676
��+0.676
ingly+0.664
MAP+0.652
Neg logit contributions
idth-0.761
esters-0.639
ammy-0.583
ilyn-0.575
uv-0.568
Token,
Token ID11
Feature activation+4.59e-02

Pos logit contributions
idth+0.685
esters+0.570
ammy+0.520
ilyn+0.511
uv+0.505
Neg logit contributions
 mathemat-0.706
��-0.663
ADRA-0.663
ingly-0.647
MAP-0.639
Token "
Token ID366
Feature activation+3.32e-01

Pos logit contributions
 mathemat+0.137
��+0.128
ADRA+0.128
ingly+0.126
MAP+0.123
Neg logit contributions
idth-0.146
esters-0.122
ammy-0.112
ilyn-0.110
uv-0.109
TokenHow
Token ID2437
Feature activation-4.41e-01

Pos logit contributions
 mathemat+0.934
��+0.866
ADRA+0.865
ingly+0.844
MAP+0.822
Neg logit contributions
idth-1.117
esters-0.951
ammy-0.876
ilyn-0.864
uv-0.857
Tokencum
Token ID36340
Feature activation+1.67e+00

Pos logit contributions
idth+1.460
esters+1.258
ammy+1.165
ilyn+1.154
uv+1.141
Neg logit contributions
 mathemat-1.226
��-1.156
ADRA-1.154
ingly-1.107
MAP-1.096
Token nobody
Token ID8168
Feature activation+1.44e-01

Pos logit contributions
 mathemat+4.841
ingly+4.611
��+4.497
ADRA+4.483
MAP+4.365
Neg logit contributions
idth-5.239
esters-4.391
ammy-4.042
ilyn-3.996
uv-3.955
Token just
Token ID655
Feature activation-3.47e-02

Pos logit contributions
 mathemat+0.403
ADRA+0.383
��+0.382
ingly+0.367
MAP+0.367
Neg logit contributions
idth-0.483
esters-0.414
ammy-0.378
uv-0.373
 Rated-0.371
Token does
Token ID857
Feature activation-2.39e-01

Pos logit contributions
idth+0.111
esters+0.095
ammy+0.086
 Rated+0.085
uv+0.085
Neg logit contributions
 mathemat-0.101
ADRA-0.098
��-0.097
MAP-0.094
WARN-0.092
Token simple
Token ID2829
Feature activation-1.74e-02

Pos logit contributions
idth+0.759
esters+0.640
ammy+0.583
ilyn+0.573
 Rated+0.570
Neg logit contributions
 mathemat-0.714
ADRA-0.674
��-0.673
ingly-0.649
MAP-0.646
Token stories
Token ID3923
Feature activation-1.49e-01

Pos logit contributions
idth+0.054
esters+0.045
ammy+0.041
ilyn+0.040
uv+0.040
Neg logit contributions
 mathemat-0.054
ADRA-0.050
��-0.050
ingly-0.049
MAP-0.049
Token God
Token ID1793
Feature activation+4.08e-02

Pos logit contributions
idth+0.689
esters+0.581
ammy+0.530
uv+0.520
ilyn+0.517
Neg logit contributions
 mathemat-0.635
ADRA-0.594
��-0.580
ingly-0.571
MAP-0.570
Token
Token ID447
Feature activation+2.99e-01

Pos logit contributions
 mathemat+0.119
��+0.111
ADRA+0.111
ingly+0.109
MAP+0.107
Neg logit contributions
idth-0.132
esters-0.112
ammy-0.102
ilyn-0.101
uv-0.100
Token
Token ID251
Feature activation+1.24e-01

Pos logit contributions
 mathemat+0.618
��+0.565
ADRA+0.562
ingly+0.548
MAP+0.531
Neg logit contributions
idth-1.228
esters-1.071
ammy-1.007
ilyn-0.996
uv-0.984
Token,
Token ID11
Feature activation-5.61e-02

Pos logit contributions
 mathemat+0.365
��+0.342
ADRA+0.340
ingly+0.335
MAP+0.328
Neg logit contributions
idth-0.400
esters-0.337
ammy-0.307
ilyn-0.305
uv-0.302
Token page
Token ID2443
Feature activation+1.37e-01

Pos logit contributions
idth+0.175
esters+0.147
ammy+0.134
ilyn+0.133
 Rated+0.132
Neg logit contributions
 mathemat-0.169
��-0.160
ADRA-0.159
ingly-0.156
WARN-0.153
Token 127
Token ID18112
Feature activation+1.64e+00

Pos logit contributions
 mathemat+0.421
ADRA+0.393
��+0.393
ingly+0.384
WARN+0.378
Neg logit contributions
idth-0.425
esters-0.355
ammy-0.323
ilyn-0.319
uv-0.317
Token,
Token ID11
Feature activation+6.82e-01

Pos logit contributions
 mathemat+4.534
ADRA+4.170
ingly+4.170
��+4.155
MAP+4.069
Neg logit contributions
idth-5.441
esters-4.627
ammy-4.280
ilyn-4.217
uv-4.106
Token said
Token ID531
Feature activation-2.76e-01

Pos logit contributions
 mathemat+2.052
��+1.918
ADRA+1.916
ingly+1.890
MAP+1.851
Neg logit contributions
idth-2.154
esters-1.808
ammy-1.651
ilyn-1.630
uv-1.610
Token :
Token ID1058
Feature activation-3.91e-01

Pos logit contributions
idth+0.839
esters+0.703
ammy+0.638
ilyn+0.629
uv+0.625
Neg logit contributions
 mathemat-0.858
ADRA-0.802
��-0.802
ingly-0.782
MAP-0.775
Token
Token ID198
Feature activation-6.89e-01

Pos logit contributions
idth+1.266
esters+1.067
ammy+0.977
ilyn+0.964
uv+0.963
Neg logit contributions
 mathemat-1.144
��-1.061
ADRA-1.059
ingly-1.037
MAP-1.024
Token
Token ID198
Feature activation-6.12e-01

Pos logit contributions
idth+2.195
esters+1.894
uv+1.759
 Rated+1.753
ples+1.721
Neg logit contributions
 mathemat-2.221
ADRA-1.944
��-1.909
 juven-1.857
MAP-1.732
Token This
Token ID770
Feature activation+5.33e-01

Pos logit contributions
 mathemat+0.805
��+0.762
ADRA+0.757
ingly+0.740
WARN+0.718
Neg logit contributions
idth-0.772
esters-0.645
ammy-0.588
 Rated-0.585
uv-0.584
Token is
Token ID318
Feature activation+1.08e-01

Pos logit contributions
 mathemat+1.468
��+1.371
ADRA+1.369
ingly+1.335
MAP+1.307
Neg logit contributions
idth-1.811
esters-1.547
ammy-1.421
ilyn-1.401
ples-1.390
Token explained
Token ID4893
Feature activation+2.14e-02

Pos logit contributions
 mathemat+0.325
ADRA+0.305
��+0.304
ingly+0.297
MAP+0.293
Neg logit contributions
idth-0.339
esters-0.288
ammy-0.261
uv-0.256
ilyn-0.256
Token here
Token ID994
Feature activation+2.29e-01

Pos logit contributions
 mathemat+0.066
ADRA+0.062
��+0.062
ingly+0.060
MAP+0.059
Neg logit contributions
idth-0.066
esters-0.055
ammy-0.050
ilyn-0.050
ples-0.049
Token.
Token ID13
Feature activation+6.34e-01

Pos logit contributions
 mathemat+0.712
��+0.668
ADRA+0.666
ingly+0.651
WARN+0.640
Neg logit contributions
idth-0.701
esters-0.585
ammy-0.531
ilyn-0.525
 Rated-0.518
Token *
Token ID1635
Feature activation+1.64e+00

Pos logit contributions
 mathemat+1.911
��+1.797
ADRA+1.786
ingly+1.751
MAP+1.697
Neg logit contributions
idth-1.996
esters-1.684
ammy-1.541
 Rated-1.520
ilyn-1.520
Tokenmod
Token ID4666
Feature activation+1.07e-01

Pos logit contributions
 mathemat+4.754
ADRA+4.438
��+4.405
ingly+4.397
MAP+4.364
Neg logit contributions
idth-5.263
esters-4.419
ammy-4.031
ilyn-4.018
ples-3.918
Tokeneline
Token ID4470
Feature activation-9.23e-02

Pos logit contributions
 mathemat+0.203
��+0.182
ADRA+0.181
ingly+0.176
MAP+0.170
Neg logit contributions
idth-0.455
esters-0.402
ammy-0.377
ilyn-0.374
uv-0.371
Token*
Token ID9
Feature activation+7.28e-01

Pos logit contributions
idth+0.299
esters+0.252
ammy+0.229
uv+0.227
ilyn+0.226
Neg logit contributions
 mathemat-0.270
��-0.253
ADRA-0.251
ingly-0.245
WARN-0.237
Token *
Token ID1635
Feature activation+9.99e-01

Pos logit contributions
 mathemat+2.100
��+1.962
ADRA+1.957
ingly+1.918
MAP+1.871
Neg logit contributions
idth-2.388
esters-2.027
ammy-1.850
ilyn-1.830
uv-1.813
Tokenvim
Token ID31124
Feature activation+7.36e-02

Pos logit contributions
 mathemat+2.869
ADRA+2.667
��+2.664
ingly+2.641
MAP+2.600
Neg logit contributions
idth-3.269
esters-2.759
ammy-2.526
ilyn-2.501
 Rated-2.454
Token.
Token ID13
Feature activation+5.41e-02

Pos logit contributions
 mathemat+0.673
ADRA+0.628
��+0.628
ingly+0.619
MAP+0.608
Neg logit contributions
idth-0.702
esters-0.589
ammy-0.538
ilyn-0.530
uv-0.523
Token Back
Token ID5157
Feature activation-2.44e-03

Pos logit contributions
 mathemat+0.170
��+0.153
ADRA+0.153
ingly+0.149
MAP+0.146
Neg logit contributions
idth-0.168
esters-0.141
 Rated-0.130
uv-0.128
ilyn-0.128
Tokenhandedly
Token ID43919
Feature activation-2.86e-01

Pos logit contributions
idth+0.008
esters+0.006
ammy+0.006
ilyn+0.006
uv+0.006
Neg logit contributions
 mathemat-0.007
ADRA-0.007
��-0.007
ingly-0.007
MAP-0.007
Token acknowledging
Token ID25937
Feature activation+4.98e-01

Pos logit contributions
idth+0.907
esters+0.775
ammy+0.704
uv+0.697
ilyn+0.694
Neg logit contributions
 mathemat-0.838
��-0.789
ADRA-0.787
ingly-0.761
MAP-0.759
Token that
Token ID326
Feature activation+6.47e-02

Pos logit contributions
 mathemat+1.483
ADRA+1.384
��+1.384
ingly+1.369
MAP+1.340
Neg logit contributions
idth-1.581
esters-1.328
ammy-1.216
ilyn-1.198
uv-1.180
Token Kiev
Token ID22761
Feature activation+1.64e+00

Pos logit contributions
 mathemat+0.199
ADRA+0.186
��+0.186
ingly+0.181
WARN+0.179
Neg logit contributions
idth-0.200
esters-0.168
ammy-0.152
uv-0.150
ilyn-0.150
Token has
Token ID468
Feature activation+1.37e-01

Pos logit contributions
 mathemat+4.409
ingly+4.100
ADRA+4.072
��+4.062
MAP+3.992
Neg logit contributions
idth-5.600
esters-4.695
ammy-4.400
ilyn-4.283
 Rated-4.175
Token created
Token ID2727
Feature activation+1.07e-01

Pos logit contributions
 mathemat+0.389
��+0.362
ADRA+0.362
ingly+0.353
WARN+0.347
Neg logit contributions
idth-0.457
esters-0.386
ammy-0.354
ilyn-0.351
uv-0.348
Token a
Token ID257
Feature activation+7.33e-01

Pos logit contributions
 mathemat+0.322
ADRA+0.300
��+0.300
ingly+0.293
WARN+0.289
Neg logit contributions
idth-0.335
esters-0.283
ammy-0.256
ilyn-0.254
uv-0.252
Token humanitarian
Token ID15706
Feature activation+4.87e-01

Pos logit contributions
 mathemat+1.999
��+1.852
ADRA+1.852
ingly+1.826
MAP+1.787
Neg logit contributions
idth-2.518
esters-2.144
ammy-1.976
ilyn-1.952
uv-1.928
Token catastrophe
Token ID27767
Feature activation+3.74e-01

Pos logit contributions
 mathemat+1.563
��+1.460
ADRA+1.455
ingly+1.425
WARN+1.404
Neg logit contributions
idth-1.450
esters-1.202
ammy-1.095
ilyn-1.074
uv-1.059
Token'
Token ID6
Feature activation+4.58e-01

Pos logit contributions
idth+0.124
esters+0.108
ammy+0.101
ilyn+0.100
uv+0.099
Neg logit contributions
 mathemat-0.068
ADRA-0.062
��-0.062
ingly-0.061
MAP-0.059
Tokens
Token ID82
Feature activation+5.05e-01

Pos logit contributions
 mathemat+1.444
��+1.356
ADRA+1.354
ingly+1.344
MAP+1.318
Neg logit contributions
idth-1.371
esters-1.135
ammy-1.033
ilyn-1.017
uv-1.000
Token frame
Token ID5739
Feature activation+4.10e-01

Pos logit contributions
 mathemat+1.281
��+1.178
ADRA+1.176
ingly+1.164
MAP+1.134
Neg logit contributions
idth-1.828
esters-1.575
ammy-1.459
ilyn-1.441
uv-1.422
Token |
Token ID930
Feature activation+3.40e-01

Pos logit contributions
 mathemat+1.251
ADRA+1.178
��+1.175
ingly+1.175
MAP+1.155
Neg logit contributions
idth-1.253
esters-1.056
ammy-0.968
ilyn-0.960
uv-0.930
Token void
Token ID7951
Feature activation+9.50e-01

Pos logit contributions
 mathemat+0.750
ADRA+0.686
��+0.684
ingly+0.668
MAP+0.643
Neg logit contributions
idth-1.346
esters-1.170
ammy-1.089
ilyn-1.080
uv-1.079
Token *
Token ID1635
Feature activation+1.63e+00

Pos logit contributions
 mathemat+1.884
��+1.708
ADRA+1.702
ingly+1.684
MAP+1.628
Neg logit contributions
idth-3.940
esters-3.464
ammy-3.247
ilyn-3.219
uv-3.177
Tokenarg
Token ID853
Feature activation+5.48e-01

Pos logit contributions
 mathemat+2.453
ingly+2.270
WARN+2.206
MAP+2.204
ADRA+2.202
Neg logit contributions
idth-7.162
esters-6.391
ilyn-6.078
ammy-6.063
 Rated-6.002
Token |
Token ID930
Feature activation+6.16e-01

Pos logit contributions
 mathemat+1.807
ADRA+1.694
��+1.693
ingly+1.668
MAP+1.639
Neg logit contributions
idth-1.576
esters-1.297
ammy-1.172
ilyn-1.153
uv-1.139
Token +
Token ID1343
Feature activation+8.37e-01

Pos logit contributions
 mathemat+1.847
ADRA+1.732
��+1.730
ingly+1.712
MAP+1.689
Neg logit contributions
idth-1.929
esters-1.621
ammy-1.491
ilyn-1.470
uv-1.429
Token--------------------
Token ID19351
Feature activation+2.67e-01

Pos logit contributions
 mathemat+2.449
��+2.286
ADRA+2.286
ingly+2.260
MAP+2.214
Neg logit contributions
idth-2.701
esters-2.275
ammy-2.085
ilyn-2.056
uv-2.026
Token--+
Token ID44785
Feature activation-4.40e-01

Pos logit contributions
 mathemat+0.940
��+0.868
ADRA+0.865
ingly+0.848
MAP+0.837
Neg logit contributions
idth-0.728
esters-0.594
ammy-0.532
ilyn-0.520
uv-0.516
Token where
Token ID810
Feature activation-1.23e-01

Pos logit contributions
 mathemat+0.329
ADRA+0.311
��+0.310
ingly+0.301
MAP+0.299
Neg logit contributions
idth-0.331
esters-0.276
ammy-0.253
ilyn-0.249
uv-0.246
Token.
Token ID13
Feature activation+2.92e-01

Pos logit contributions
idth+0.403
esters+0.342
ammy+0.313
ilyn+0.309
uv+0.306
Neg logit contributions
 mathemat-0.355
ADRA-0.331
��-0.331
ingly-0.323
MAP-0.319
Token Her
Token ID2332
Feature activation+3.37e-01

Pos logit contributions
 mathemat+0.848
ADRA+0.786
��+0.786
ingly+0.771
MAP+0.755
Neg logit contributions
idth-0.955
esters-0.807
ammy-0.741
ilyn-0.732
uv-0.726
Token so
Token ID523
Feature activation+2.00e-01

Pos logit contributions
 mathemat+0.946
��+0.881
ADRA+0.880
ingly+0.864
MAP+0.848
Neg logit contributions
idth-1.134
esters-0.962
ammy-0.885
ilyn-0.874
uv-0.865
Token-
Token ID12
Feature activation+5.24e-01

Pos logit contributions
 mathemat+0.600
ADRA+0.561
��+0.560
ingly+0.552
MAP+0.542
Neg logit contributions
idth-0.630
esters-0.529
ammy-0.483
ilyn-0.477
uv-0.471
Tokencalled
Token ID7174
Feature activation+1.63e+00

Pos logit contributions
 mathemat+1.538
ADRA+1.433
��+1.428
ingly+1.389
MAP+1.373
Neg logit contributions
idth-1.716
esters-1.441
ammy-1.328
ilyn-1.306
uv-1.296
Token urban
Token ID7876
Feature activation+4.27e-01

Pos logit contributions
 mathemat+4.638
ingly+4.282
��+4.252
ADRA+4.241
 Leopard+4.142
Neg logit contributions
idth-5.270
esters-4.457
ilyn-4.022
ammy-4.018
uv-3.963
Token thru
Token ID33834
Feature activation+2.47e-01

Pos logit contributions
 mathemat+1.199
��+1.115
ADRA+1.113
ingly+1.095
MAP+1.072
Neg logit contributions
idth-1.437
esters-1.221
ammy-1.122
ilyn-1.108
uv-1.095
Token-
Token ID12
Feature activation+1.02e-01

Pos logit contributions
 mathemat+0.725
��+0.675
ADRA+0.674
ingly+0.667
MAP+0.653
Neg logit contributions
idth-0.798
esters-0.671
ammy-0.616
ilyn-0.607
uv-0.598
Tokenh
Token ID71
Feature activation+1.69e-01

Pos logit contributions
 mathemat+0.305
ADRA+0.285
��+0.285
ingly+0.280
MAP+0.275
Neg logit contributions
idth-0.326
esters-0.274
ammy-0.250
ilyn-0.247
uv-0.244
Tokenikes
Token ID7938
Feature activation+5.95e-01

Pos logit contributions
 mathemat+0.496
��+0.463
ADRA+0.460
ingly+0.452
MAP+0.444
Neg logit contributions
idth-0.548
esters-0.462
ammy-0.425
ilyn-0.417
uv-0.414
Token of
Token ID286
Feature activation+2.38e-01

Pos logit contributions
 mathemat+2.020
ADRA+1.877
��+1.876
ingly+1.837
MAP+1.801
Neg logit contributions
idth-2.316
esters-1.946
ammy-1.784
ilyn-1.757
uv-1.751
Token all
Token ID477
Feature activation-1.77e-01

Pos logit contributions
 mathemat+0.706
ADRA+0.663
��+0.661
ingly+0.647
WARN+0.636
Neg logit contributions
idth-0.768
esters-0.640
ammy-0.587
uv-0.579
 Rated-0.577
Token children
Token ID1751
Feature activation+4.96e-01

Pos logit contributions
idth+0.560
esters+0.467
uv+0.435
ammy+0.429
ples+0.426
Neg logit contributions
 mathemat-0.530
ADRA-0.497
��-0.494
ingly-0.481
WARN-0.477
Token,
Token ID11
Feature activation+3.48e-01

Pos logit contributions
 mathemat+1.457
��+1.361
ADRA+1.361
ingly+1.327
WARN+1.301
Neg logit contributions
idth-1.612
esters-1.354
ammy-1.239
ilyn-1.220
uv-1.220
Token terminate
Token ID23654
Feature activation+9.56e-01

Pos logit contributions
 mathemat+1.005
ADRA+0.950
��+0.946
ingly+0.924
WARN+0.906
Neg logit contributions
idth-1.140
esters-0.956
ammy-0.875
 Rated-0.866
ples-0.864
Token if
Token ID611
Feature activation+1.63e+00

Pos logit contributions
 mathemat+2.782
��+2.588
ADRA+2.582
ingly+2.542
MAP+2.494
Neg logit contributions
idth-3.121
esters-2.628
ammy-2.408
ilyn-2.379
uv-2.354
Token any
Token ID597
Feature activation+7.72e-01

Pos logit contributions
 mathemat+4.108
��+3.755
ingly+3.751
ADRA+3.750
MAP+3.655
Neg logit contributions
idth-5.781
esters-4.993
ammy-4.613
ilyn-4.573
uv-4.499
Token are
Token ID389
Feature activation+6.14e-01

Pos logit contributions
 mathemat+2.105
��+1.950
ADRA+1.946
ingly+1.915
MAP+1.873
Neg logit contributions
idth-2.664
esters-2.271
ammy-2.095
ilyn-2.064
uv-2.047
Token not
Token ID407
Feature activation+2.01e-01

Pos logit contributions
 mathemat+1.554
ADRA+1.431
��+1.429
ingly+1.394
MAP+1.360
Neg logit contributions
idth-2.234
esters-1.933
ammy-1.792
uv-1.764
ilyn-1.763
Token valid
Token ID4938
Feature activation+2.94e-01

Pos logit contributions
 mathemat+0.606
��+0.568
ADRA+0.567
ingly+0.553
WARN+0.545
Neg logit contributions
idth-0.631
esters-0.532
ammy-0.483
ples-0.476
uv-0.475
Token //
Token ID3373
Feature activation+5.14e-01

Pos logit contributions
 mathemat+0.823
��+0.767
ADRA+0.767
ingly+0.746
WARN+0.734
Neg logit contributions
idth-0.993
esters-0.841
ammy-0.771
ilyn-0.759
uv-0.758
Token that
Token ID326
Feature activation+3.34e-01

Pos logit contributions
idth+0.196
esters+0.164
ammy+0.148
ilyn+0.147
uv+0.147
Neg logit contributions
 mathemat-0.192
ADRA-0.182
��-0.181
WARN-0.176
ingly-0.176
Token he
Token ID339
Feature activation+1.72e-01

Pos logit contributions
 mathemat+0.954
ADRA+0.895
��+0.893
ingly+0.874
WARN+0.862
Neg logit contributions
idth-1.098
esters-0.929
ammy-0.847
ilyn-0.842
uv-0.836
Token was
Token ID373
Feature activation+2.92e-01

Pos logit contributions
 mathemat+0.478
ADRA+0.448
��+0.447
ingly+0.429
MAP+0.429
Neg logit contributions
idth-0.581
esters-0.499
ammy-0.455
ilyn-0.449
uv-0.448
Token the
Token ID262
Feature activation-1.31e-01

Pos logit contributions
 mathemat+0.803
��+0.748
ADRA+0.748
ingly+0.728
MAP+0.716
Neg logit contributions
idth-0.989
esters-0.847
ammy-0.781
ilyn-0.772
uv-0.766
Token only
Token ID691
Feature activation+1.30e-01

Pos logit contributions
idth+0.445
esters+0.380
ammy+0.353
 Rated+0.349
uv+0.347
Neg logit contributions
 mathemat-0.355
ADRA-0.336
��-0.336
ingly-0.323
MAP-0.318
Token realistic
Token ID12653
Feature activation+1.61e+00

Pos logit contributions
 mathemat+0.374
ADRA+0.354
��+0.352
ingly+0.339
WARN+0.335
Neg logit contributions
idth-0.429
esters-0.360
ammy-0.333
ilyn-0.329
uv-0.328
Token option
Token ID3038
Feature activation+7.55e-01

Pos logit contributions
 mathemat+4.129
ingly+3.821
��+3.693
MAP+3.662
ADRA+3.660
Neg logit contributions
idth-5.726
esters-4.924
ammy-4.556
ilyn-4.481
uv-4.386
Token as
Token ID355
Feature activation-1.66e-01

Pos logit contributions
 mathemat+2.478
��+2.321
ADRA+2.314
ingly+2.305
MAP+2.260
Neg logit contributions
idth-2.171
esters-1.783
ammy-1.619
ilyn-1.588
uv-1.557
Token Kenya
Token ID21506
Feature activation+2.87e-01

Pos logit contributions
idth+0.538
esters+0.453
ammy+0.414
ilyn+0.411
uv+0.408
Neg logit contributions
 mathemat-0.480
ADRA-0.451
��-0.448
ingly-0.439
WARN-0.433
Token's
Token ID338
Feature activation+4.63e-01

Pos logit contributions
 mathemat+0.859
ADRA+0.807
��+0.806
ingly+0.785
MAP+0.774
Neg logit contributions
idth-0.907
esters-0.764
ammy-0.695
ilyn-0.689
uv-0.683
Token future
Token ID2003
Feature activation+4.41e-02

Pos logit contributions
 mathemat+1.235
ADRA+1.149
��+1.147
ingly+1.124
MAP+1.102
Neg logit contributions
idth-1.615
esters-1.380
ammy-1.277
ilyn-1.261
uv-1.249
Tokenx
Token ID87
Feature activation-8.62e-03

Pos logit contributions
idth+0.160
esters+0.134
ammy+0.122
ilyn+0.120
uv+0.119
Neg logit contributions
 mathemat-0.160
ADRA-0.150
��-0.150
ingly-0.147
MAP-0.144
Token.
Token ID13
Feature activation-9.52e-02

Pos logit contributions
idth+0.027
esters+0.023
ammy+0.021
ilyn+0.021
uv+0.020
Neg logit contributions
 mathemat-0.026
��-0.024
ADRA-0.024
ingly-0.024
MAP-0.023
Tokenx
Token ID87
Feature activation+9.79e-02

Pos logit contributions
idth+0.296
esters+0.247
ammy+0.226
ilyn+0.223
uv+0.220
Neg logit contributions
 mathemat-0.292
ADRA-0.272
��-0.272
ingly-0.268
MAP-0.263
Token.
Token ID13
Feature activation-1.86e-01

Pos logit contributions
 mathemat+0.285
��+0.267
ADRA+0.266
ingly+0.264
MAP+0.259
Neg logit contributions
idth-0.317
esters-0.268
ammy-0.245
ilyn-0.241
uv-0.236
Token77
Token ID3324
Feature activation+7.28e-01

Pos logit contributions
idth+0.552
esters+0.456
ammy+0.415
ilyn+0.411
uv+0.409
Neg logit contributions
 mathemat-0.598
ADRA-0.557
��-0.556
ingly-0.547
MAP-0.536
Token 115
Token ID12279
Feature activation+1.61e+00

Pos logit contributions
 mathemat+2.120
��+1.979
ADRA+1.977
ingly+1.953
MAP+1.918
Neg logit contributions
idth-2.358
esters-1.995
ammy-1.820
ilyn-1.796
uv-1.769
Token57
Token ID3553
Feature activation+7.21e-01

Pos logit contributions
 mathemat+4.961
ADRA+4.848
ingly+4.768
��+4.757
MAP+4.754
Neg logit contributions
idth-4.612
esters-3.814
ammy-3.482
ilyn-3.389
uv-3.268
Token21
Token ID2481
Feature activation+4.84e-01

Pos logit contributions
 mathemat+2.372
ADRA+2.230
��+2.229
ingly+2.203
MAP+2.165
Neg logit contributions
idth-2.064
esters-1.698
ammy-1.534
ilyn-1.510
uv-1.486
Token x
Token ID2124
Feature activation+5.16e-01

Pos logit contributions
 mathemat+1.365
��+1.269
ADRA+1.269
ingly+1.250
MAP+1.226
Neg logit contributions
idth-1.613
esters-1.371
ammy-1.259
ilyn-1.242
uv-1.224
Token.
Token ID13
Feature activation-3.72e-02

Pos logit contributions
 mathemat+1.514
��+1.424
ADRA+1.420
ingly+1.413
MAP+1.384
Neg logit contributions
idth-1.653
esters-1.388
ammy-1.272
ilyn-1.253
uv-1.229
Tokenx
Token ID87
Feature activation+5.18e-02

Pos logit contributions
idth+0.114
esters+0.095
ammy+0.087
ilyn+0.085
uv+0.085
Neg logit contributions
 mathemat-0.117
ADRA-0.109
��-0.109
ingly-0.106
MAP-0.104
Tokenbad
Token ID14774
Feature activation-2.96e-01

Pos logit contributions
 mathemat+1.700
ADRA+1.574
��+1.555
ingly+1.532
MAP+1.495
Neg logit contributions
idth-2.048
esters-1.759
ammy-1.608
ilyn-1.595
ples-1.588
Token'
Token ID6
Feature activation+7.57e-01

Pos logit contributions
idth+0.874
esters+0.730
ammy+0.661
ilyn+0.656
uv+0.646
Neg logit contributions
 mathemat-0.945
ADRA-0.888
��-0.880
ingly-0.863
MAP-0.851
Token people
Token ID661
Feature activation+2.94e-02

Pos logit contributions
 mathemat+1.987
��+1.860
ingly+1.859
ADRA+1.819
MAP+1.793
Neg logit contributions
idth-2.677
esters-2.257
ammy-2.086
ilyn-2.049
ples-2.039
Token around
Token ID1088
Feature activation-1.08e-01

Pos logit contributions
 mathemat+0.079
ADRA+0.073
��+0.073
ingly+0.072
MAP+0.070
Neg logit contributions
idth-0.102
esters-0.088
ammy-0.081
ilyn-0.080
uv-0.079
Token Trump
Token ID1301
Feature activation-1.31e-01

Pos logit contributions
idth+0.316
esters+0.267
ammy+0.239
ilyn+0.237
uv+0.235
Neg logit contributions
 mathemat-0.348
ADRA-0.326
��-0.324
ingly-0.312
MAP-0.311
Token MORE
Token ID12011
Feature activation-2.13e+00

Pos logit contributions
idth+0.396
esters+0.339
ammy+0.306
ilyn+0.304
uv+0.303
Neg logit contributions
 mathemat-0.399
��-0.375
ADRA-0.372
ingly-0.367
WARN-0.356
Token pr
Token ID778
Feature activation-8.13e-01

Pos logit contributions
idth+5.779
esters+5.098
uv+4.651
 Rated+4.461
 drinks+4.428
Neg logit contributions
��-6.363
 mathemat-6.322
ADRA-6.296
MAP-5.941
WARN-5.835
Tokenattled
Token ID43535
Feature activation-9.62e-02

Pos logit contributions
idth+2.510
esters+2.163
ammy+1.930
uv+1.921
ilyn+1.915
Neg logit contributions
 mathemat-2.531
��-2.387
ADRA-2.337
MAP-2.288
WARN-2.268
Token inqu
Token ID7949
Feature activation-6.51e-01

Pos logit contributions
idth+0.299
esters+0.252
ammy+0.229
uv+0.229
ilyn+0.228
Neg logit contributions
 mathemat-0.292
��-0.275
ADRA-0.273
MAP-0.265
WARN-0.260
Tokenis
Token ID271
Feature activation-7.99e-02

Pos logit contributions
idth+2.010
esters+1.719
ammy+1.526
ilyn+1.521
ples+1.489
Neg logit contributions
 mathemat-2.041
��-1.919
ADRA-1.907
MAP-1.849
WARN-1.837
Tokenitive
Token ID1800
Feature activation-4.59e-01

Pos logit contributions
idth+0.304
esters+0.267
ammy+0.245
ilyn+0.244
uv+0.241
Neg logit contributions
 mathemat-0.189
��-0.175
ADRA-0.172
MAP-0.166
ingly-0.165
Token than
Token ID621
Feature activation+6.38e-01

Pos logit contributions
 mathemat+0.395
ADRA+0.365
��+0.365
ingly+0.358
MAP+0.349
Neg logit contributions
idth-0.555
esters-0.480
ammy-0.443
ilyn-0.438
uv-0.435
Token 10
Token ID838
Feature activation+4.25e-02

Pos logit contributions
 mathemat+1.755
��+1.666
ingly+1.641
ADRA+1.639
MAP+1.607
Neg logit contributions
idth-2.123
esters-1.781
ammy-1.666
ilyn-1.622
uv-1.591
Token years
Token ID812
Feature activation-2.67e-01

Pos logit contributions
 mathemat+0.121
��+0.113
ADRA+0.113
ingly+0.111
WARN+0.109
Neg logit contributions
idth-0.141
esters-0.119
ammy-0.109
ilyn-0.108
uv-0.106
Token in
Token ID287
Feature activation-2.98e-02

Pos logit contributions
idth+0.815
esters+0.694
ammy+0.630
 Rated+0.617
ilyn+0.614
Neg logit contributions
 mathemat-0.829
ADRA-0.769
��-0.752
ingly-0.749
MAP-0.735
Token prison
Token ID3770
Feature activation-3.48e-01

Pos logit contributions
idth+0.098
esters+0.084
ammy+0.076
ilyn+0.076
uv+0.075
Neg logit contributions
 mathemat-0.086
ADRA-0.079
��-0.079
ingly-0.078
WARN-0.076
Token MORE
Token ID12011
Feature activation-2.11e+00

Pos logit contributions
idth+1.054
esters+0.903
 Rated+0.813
uv+0.806
ples+0.797
Neg logit contributions
 mathemat-1.089
ADRA-1.011
��-0.980
WARN-0.978
ingly-0.970
Token,
Token ID11
Feature activation-5.27e-01

Pos logit contributions
idth+4.978
esters+4.329
uv+3.767
ammy+3.742
avis+3.642
Neg logit contributions
 mathemat-7.244
ADRA-6.918
��-6.885
MAP-6.558
WARN-6.557
Token in
Token ID287
Feature activation-8.53e-01

Pos logit contributions
idth+1.614
esters+1.389
 Rated+1.242
ammy+1.236
uv+1.234
Neg logit contributions
 mathemat-1.589
ADRA-1.521
��-1.506
ingly-1.443
MAP-1.439
Token his
Token ID465
Feature activation-4.77e-02

Pos logit contributions
idth+2.435
esters+2.012
ammy+1.841
uv+1.801
ilyn+1.782
Neg logit contributions
 mathemat-2.775
ADRA-2.640
��-2.597
MAP-2.551
ingly-2.533
Token weekly
Token ID10273
Feature activation-1.45e-01

Pos logit contributions
idth+0.157
esters+0.134
ammy+0.124
 Rated+0.122
ilyn+0.121
Neg logit contributions
 mathemat-0.136
ADRA-0.127
��-0.125
ingly-0.122
MAP-0.122
Token column
Token ID5721
Feature activation+3.45e-01

Pos logit contributions
idth+0.580
esters+0.509
ammy+0.473
uv+0.470
ilyn+0.470
Neg logit contributions
 mathemat-0.305
ADRA-0.286
��-0.280
ingly-0.270
MAP-0.269
Token on
Token ID319
Feature activation+3.00e-01

Pos logit contributions
 mathemat+1.613
ADRA+1.493
��+1.486
ingly+1.455
MAP+1.444
Neg logit contributions
idth-1.832
esters-1.569
ammy-1.440
ilyn-1.425
 Rated-1.413
Token sleeve
Token ID23179
Feature activation-3.12e-01

Pos logit contributions
 mathemat+0.834
��+0.779
ADRA+0.759
ingly+0.743
MAP+0.742
Neg logit contributions
idth-1.016
esters-0.873
ammy-0.796
ilyn-0.794
 Rated-0.785
Token at
Token ID379
Feature activation+3.99e-01

Pos logit contributions
idth+1.081
esters+0.922
ammy+0.851
ilyn+0.841
uv+0.831
Neg logit contributions
 mathemat-0.840
��-0.778
ADRA-0.778
ingly-0.766
MAP-0.749
Token basketball
Token ID9669
Feature activation+5.06e-01

Pos logit contributions
 mathemat+1.200
��+1.114
ADRA+1.111
ingly+1.088
WARN+1.079
Neg logit contributions
idth-1.276
esters-1.079
ammy-0.994
ilyn-0.966
ples-0.965
Token game
Token ID983
Feature activation+3.11e-01

Pos logit contributions
 mathemat+1.309
ingly+1.214
��+1.206
ADRA+1.185
WARN+1.172
Neg logit contributions
idth-1.810
esters-1.548
ammy-1.425
ilyn-1.405
uv-1.390
Token MORE
Token ID12011
Feature activation-2.06e+00

Pos logit contributions
 mathemat+0.815
ingly+0.773
ADRA+0.769
��+0.764
 Leopard+0.745
Neg logit contributions
idth-1.088
esters-0.911
ammy-0.845
ilyn-0.842
uv-0.827
Token and
Token ID290
Feature activation-7.66e-01

Pos logit contributions
idth+5.215
esters+4.480
ammy+3.995
uv+3.900
ilyn+3.890
Neg logit contributions
 mathemat-6.834
ADRA-6.471
��-6.450
MAP-6.268
WARN-6.189
Token Hillary
Token ID5073
Feature activation-4.07e-02

Pos logit contributions
idth+2.414
esters+2.046
ammy+1.880
uv+1.870
ilyn+1.856
Neg logit contributions
 mathemat-2.241
��-2.107
ADRA-2.104
WARN-2.037
MAP-2.019
Token Clinton
Token ID2605
Feature activation-1.78e-01

Pos logit contributions
idth+0.151
esters+0.130
ammy+0.121
ilyn+0.120
uv+0.118
Neg logit contributions
 mathemat-0.100
��-0.092
ADRA-0.092
ingly-0.090
MAP-0.088
Token Hillary
Token ID5073
Feature activation+4.17e-01

Pos logit contributions
idth+0.558
esters+0.470
ammy+0.436
ilyn+0.435
uv+0.424
Neg logit contributions
 mathemat-0.537
ADRA-0.511
��-0.497
MAP-0.487
ingly-0.486
Token Diane
Token ID26542
Feature activation-4.32e-01

Pos logit contributions
 mathemat+1.172
ingly+1.106
��+1.097
ADRA+1.091
MAP+1.066
Neg logit contributions
idth-1.373
esters-1.158
ammy-1.058
uv-1.056
ples-1.042
Token than
Token ID621
Feature activation+6.57e-01

Pos logit contributions
 mathemat+0.307
ADRA+0.283
��+0.283
ingly+0.276
MAP+0.270
Neg logit contributions
idth-0.447
esters-0.388
ammy-0.358
ilyn-0.354
uv-0.352
Token 10
Token ID838
Feature activation+1.58e-01

Pos logit contributions
 mathemat+1.799
��+1.712
ingly+1.689
ADRA+1.682
 Leopard+1.658
Neg logit contributions
idth-2.177
esters-1.828
ammy-1.712
ilyn-1.662
uv-1.638
Token years
Token ID812
Feature activation-2.31e-01

Pos logit contributions
 mathemat+0.446
��+0.417
ADRA+0.415
ingly+0.410
WARN+0.407
Neg logit contributions
idth-0.525
esters-0.440
ammy-0.407
ilyn-0.401
uv-0.396
Token in
Token ID287
Feature activation-1.32e-02

Pos logit contributions
idth+0.703
esters+0.601
ammy+0.546
 Rated+0.535
ilyn+0.531
Neg logit contributions
 mathemat-0.716
ADRA-0.665
��-0.647
ingly-0.645
WARN-0.634
Token prison
Token ID3770
Feature activation-3.90e-01

Pos logit contributions
idth+0.043
esters+0.037
ammy+0.033
ilyn+0.033
uv+0.033
Neg logit contributions
 mathemat-0.038
ADRA-0.035
��-0.035
ingly-0.035
WARN-0.034
Token MORE
Token ID12011
Feature activation-1.94e+00

Pos logit contributions
idth+1.179
esters+1.013
 Rated+0.917
uv+0.907
ples+0.894
Neg logit contributions
 mathemat-1.223
ADRA-1.137
WARN-1.099
��-1.091
ingly-1.082
Token and
Token ID290
Feature activation-5.84e-01

Pos logit contributions
idth+5.458
esters+4.717
uv+4.160
ammy+4.154
ilyn+4.068
Neg logit contributions
 mathemat-6.017
ADRA-5.800
��-5.777
WARN-5.538
MAP-5.456
Token Hillary
Token ID5073
Feature activation-7.32e-02

Pos logit contributions
idth+1.850
esters+1.554
ilyn+1.441
ammy+1.439
uv+1.426
Neg logit contributions
 mathemat-1.672
ADRA-1.646
��-1.624
MAP-1.554
WARN-1.548
Token Clinton
Token ID2605
Feature activation-1.28e-01

Pos logit contributions
idth+0.258
esters+0.221
ilyn+0.205
ammy+0.204
uv+0.202
Neg logit contributions
 mathemat-0.199
ADRA-0.183
��-0.180
MAP-0.170
ingly-0.170
Token Hillary
Token ID5073
Feature activation+3.75e-01

Pos logit contributions
idth+0.372
esters+0.310
ilyn+0.291
ammy+0.289
uv+0.281
Neg logit contributions
 mathemat-0.416
ADRA-0.400
��-0.385
WARN-0.375
MAP-0.374
Token Diane
Token ID26542
Feature activation-2.89e-01

Pos logit contributions
 mathemat+1.090
��+1.020
ADRA+1.017
ingly+1.017
MAP+0.989
Neg logit contributions
idth-1.207
esters-1.016
ammy-0.929
ilyn-0.914
uv-0.913
Token than
Token ID621
Feature activation+6.88e-01

Pos logit contributions
 mathemat+0.500
ADRA+0.460
��+0.460
ingly+0.451
MAP+0.440
Neg logit contributions
idth-0.749
esters-0.648
ammy-0.600
ilyn-0.594
uv-0.588
Token 10
Token ID838
Feature activation+1.23e-01

Pos logit contributions
 mathemat+1.836
��+1.749
ingly+1.727
ADRA+1.720
 Leopard+1.700
Neg logit contributions
idth-2.315
esters-1.950
ammy-1.834
ilyn-1.773
uv-1.748
Token years
Token ID812
Feature activation-2.22e-01

Pos logit contributions
 mathemat+0.347
��+0.324
ADRA+0.323
ingly+0.319
WARN+0.315
Neg logit contributions
idth-0.411
esters-0.345
ammy-0.319
ilyn-0.314
uv-0.310
Token in
Token ID287
Feature activation-2.38e-03

Pos logit contributions
idth+0.664
esters+0.567
ammy+0.514
 Rated+0.503
ilyn+0.499
Neg logit contributions
 mathemat-0.699
ADRA-0.649
��-0.633
ingly-0.632
WARN-0.622
Token prison
Token ID3770
Feature activation-3.40e-01

Pos logit contributions
idth+0.008
esters+0.007
ammy+0.006
ilyn+0.006
uv+0.006
Neg logit contributions
 mathemat-0.007
ADRA-0.006
��-0.006
ingly-0.006
WARN-0.006
Token MORE
Token ID12011
Feature activation-1.94e+00

Pos logit contributions
idth+1.027
esters+0.883
 Rated+0.800
uv+0.790
ples+0.780
Neg logit contributions
 mathemat-1.066
ADRA-0.990
WARN-0.960
��-0.951
ingly-0.942
Token and
Token ID290
Feature activation-7.89e-01

Pos logit contributions
idth+4.899
esters+4.305
ammy+3.999
uv+3.882
 Rated+3.782
Neg logit contributions
 mathemat-6.278
ADRA-6.126
��-6.012
WARN-5.760
MAP-5.681
Token tick
Token ID4378
Feature activation+3.83e-01

Pos logit contributions
idth+2.546
esters+2.143
ammy+2.011
uv+1.983
ilyn+1.980
Neg logit contributions
 mathemat-2.243
��-2.122
ADRA-2.118
WARN-2.005
MAP-1.999
Tokened
Token ID276
Feature activation-4.00e-01

Pos logit contributions
 mathemat+0.994
��+0.916
ADRA+0.916
ingly+0.895
MAP+0.879
Neg logit contributions
idth-1.366
esters-1.175
ammy-1.086
ilyn-1.073
uv-1.060
Token through
Token ID832
Feature activation-5.01e-01

Pos logit contributions
idth+1.782
esters+1.599
ammy+1.514
ilyn+1.496
uv+1.492
Neg logit contributions
 mathemat-0.681
ADRA-0.594
��-0.589
MAP-0.544
WARN-0.528
Token her
Token ID607
Feature activation-3.18e-01

Pos logit contributions
idth+1.532
esters+1.253
ammy+1.179
ilyn+1.158
 Rated+1.153
Neg logit contributions
 mathemat-1.545
ADRA-1.444
��-1.438
WARN-1.384
ingly-1.375
Tokenholes
Token ID28439
Feature activation-3.30e-01

Pos logit contributions
idth+0.139
esters+0.118
ammy+0.108
ilyn+0.107
uv+0.106
Neg logit contributions
 mathemat-0.117
ADRA-0.109
��-0.109
ingly-0.107
MAP-0.105
Token in
Token ID287
Feature activation+3.48e-01

Pos logit contributions
idth+1.292
esters+1.125
ammy+1.050
ilyn+1.037
uv+1.030
Neg logit contributions
 mathemat-0.744
ADRA-0.678
��-0.676
ingly-0.662
MAP-0.645
Token American
Token ID1605
Feature activation+6.93e-01

Pos logit contributions
 mathemat+0.941
��+0.887
ingly+0.884
ADRA+0.881
MAP+0.851
Neg logit contributions
idth-1.176
esters-0.987
ammy-0.919
ilyn-0.919
 Rated-0.900
Token politics
Token ID4819
Feature activation-2.59e-01

Pos logit contributions
 mathemat+1.819
��+1.704
ADRA+1.692
ingly+1.677
MAP+1.641
Neg logit contributions
idth-2.423
esters-2.069
ammy-1.905
ilyn-1.892
 Rated-1.859
Token'
Token ID6
Feature activation+3.95e-01

Pos logit contributions
idth+1.001
esters+0.878
ammy+0.815
ilyn+0.803
uv+0.801
Neg logit contributions
 mathemat-0.599
ADRA-0.541
��-0.539
ingly-0.527
MAP-0.513
Token MORE
Token ID12011
Feature activation-1.85e+00

Pos logit contributions
 mathemat+0.859
ingly+0.841
MAP+0.828
WARN+0.820
��+0.820
Neg logit contributions
idth-1.517
esters-1.266
ammy-1.227
ilyn-1.226
ples-1.187
Token completed
Token ID5668
Feature activation-7.10e-01

Pos logit contributions
idth+5.175
esters+4.543
uv+4.110
 Rated+4.078
ples+4.049
Neg logit contributions
ADRA-5.611
 mathemat-5.579
��-5.574
MAP-5.178
WARN-5.129
Token a
Token ID257
Feature activation-1.74e-01

Pos logit contributions
idth+2.038
esters+1.738
ammy+1.551
ilyn+1.545
 Rated+1.536
Neg logit contributions
 mathemat-2.265
��-2.149
ADRA-2.142
ingly-2.068
WARN-2.065
Token swing
Token ID9628
Feature activation-5.47e-01

Pos logit contributions
idth+0.560
esters+0.475
ammy+0.435
ilyn+0.430
uv+0.428
Neg logit contributions
 mathemat-0.506
��-0.479
ADRA-0.477
ingly-0.463
MAP-0.457
Token through
Token ID832
Feature activation-4.48e-01

Pos logit contributions
idth+1.885
esters+1.601
uv+1.487
ammy+1.480
 Rated+1.462
Neg logit contributions
 mathemat-1.455
ADRA-1.369
��-1.358
ingly-1.293
WARN-1.284
Token North
Token ID2258
Feature activation-4.15e-01

Pos logit contributions
idth+1.469
esters+1.251
ammy+1.147
ilyn+1.134
uv+1.132
Neg logit contributions
 mathemat-1.271
��-1.199
ADRA-1.199
ingly-1.146
MAP-1.142
Tokentwitter
Token ID6956
Feature activation-7.63e-01

Pos logit contributions
idth+1.218
esters+0.962
ammy+0.836
uv+0.832
ilyn+0.816
Neg logit contributions
 mathemat-2.071
ADRA-1.921
��-1.906
 juven-1.842
ingly-1.836
Token.
Token ID13
Feature activation-6.21e-01

Pos logit contributions
idth+2.152
esters+1.788
uv+1.636
 Rated+1.612
ammy+1.611
Neg logit contributions
 mathemat-2.691
ADRA-2.435
��-2.409
 juven-2.317
ingly-2.247
Tokencom
Token ID785
Feature activation-1.24e-01

Pos logit contributions
idth+2.337
esters+2.056
uv+1.935
ammy+1.906
ples+1.896
Neg logit contributions
 mathemat-1.610
ADRA-1.398
��-1.368
 juven-1.327
MAP-1.245
Token/
Token ID14
Feature activation-9.12e-01

Pos logit contributions
idth+0.317
esters+0.254
ammy+0.225
ilyn+0.222
uv+0.219
Neg logit contributions
 mathemat-0.451
ADRA-0.425
��-0.424
ingly-0.417
MAP-0.411
TokenO
Token ID46
Feature activation-1.16e+00

Pos logit contributions
idth+2.844
esters+2.377
uv+2.252
ammy+2.203
ilyn+2.168
Neg logit contributions
 mathemat-2.809
��-2.576
ADRA-2.504
ingly-2.477
 Leopard-2.433
TokenJV
Token ID41697
Feature activation-1.82e+00

Pos logit contributions
idth+3.428
esters+2.850
uv+2.706
ammy+2.608
ilyn+2.574
Neg logit contributions
 mathemat-3.731
��-3.472
ADRA-3.353
ingly-3.298
 juven-3.269
TokenL
Token ID43
Feature activation-1.40e+00

Pos logit contributions
idth+5.582
esters+4.652
uv+4.545
ammy+4.336
ilyn+4.269
Neg logit contributions
 mathemat-5.538
��-5.061
ADRA-4.836
ingly-4.752
 Leopard-4.741
TokenF
Token ID37
Feature activation-1.43e+00

Pos logit contributions
idth+4.259
esters+3.542
uv+3.423
ammy+3.292
ilyn+3.234
Neg logit contributions
 mathemat-4.345
��-4.024
ADRA-3.853
ingly-3.777
 Leopard-3.776
TokenRe
Token ID3041
Feature activation-1.47e+00

Pos logit contributions
idth+4.285
esters+3.548
uv+3.412
ammy+3.275
ilyn+3.227
Neg logit contributions
 mathemat-4.528
��-4.184
ADRA-4.008
ingly-3.944
 Leopard-3.941
TokenK
Token ID42
Feature activation-8.10e-01

Pos logit contributions
idth+4.494
esters+3.793
uv+3.647
ammy+3.527
ilyn+3.469
Neg logit contributions
 mathemat-4.468
��-4.114
ADRA-3.918
ingly-3.898
 Leopard-3.851
Tokench
Token ID354
Feature activation-3.43e-01

Pos logit contributions
idth+2.484
esters+2.080
uv+1.951
ammy+1.920
ilyn+1.886
Neg logit contributions
 mathemat-2.537
��-2.342
ADRA-2.277
ingly-2.234
 Leopard-2.195
Token.
Token ID13
Feature activation-5.07e-01

Pos logit contributions
idth+2.929
esters+2.402
uv+2.240
 Rated+2.187
ammy+2.165
Neg logit contributions
 mathemat-4.024
ADRA-3.605
��-3.561
 juven-3.467
ingly-3.267
Tokencom
Token ID785
Feature activation-1.65e-01

Pos logit contributions
idth+1.937
esters+1.698
uv+1.596
ammy+1.578
 Rated+1.569
Neg logit contributions
 mathemat-1.277
ADRA-1.114
��-1.092
 juven-1.046
ingly-0.997
Token/
Token ID14
Feature activation-8.77e-01

Pos logit contributions
idth+0.418
esters+0.335
ammy+0.297
ilyn+0.292
uv+0.289
Neg logit contributions
 mathemat-0.604
ADRA-0.567
��-0.566
ingly-0.556
MAP-0.548
TokenEh
Token ID43894
Feature activation-1.15e+00

Pos logit contributions
idth+2.738
esters+2.289
uv+2.168
ammy+2.122
ilyn+2.089
Neg logit contributions
 mathemat-2.698
��-2.475
ADRA-2.409
ingly-2.386
 Leopard-2.339
TokenU
Token ID52
Feature activation-1.08e+00

Pos logit contributions
idth+3.566
esters+2.979
uv+2.841
ammy+2.761
ples+2.723
Neg logit contributions
 mathemat-3.525
��-3.246
ADRA-3.118
ingly-3.084
 Leopard-3.038
Tokeng
Token ID70
Feature activation-1.82e+00

Pos logit contributions
idth+3.155
esters+2.606
uv+2.487
ammy+2.401
ples+2.372
Neg logit contributions
 mathemat-3.516
��-3.252
ADRA-3.141
ingly-3.083
 Leopard-3.066
TokenI
Token ID40
Feature activation-1.12e+00

Pos logit contributions
idth+5.668
esters+4.725
uv+4.631
ammy+4.427
ples+4.383
Neg logit contributions
 mathemat-5.470
��-4.988
ingly-4.725
ADRA-4.672
 Leopard-4.662
Tokenx
Token ID87
Feature activation-1.37e+00

Pos logit contributions
idth+3.238
esters+2.676
uv+2.543
ammy+2.464
ilyn+2.432
Neg logit contributions
 mathemat-3.691
��-3.438
ADRA-3.321
ingly-3.248
 Leopard-3.224
Token2
Token ID17
Feature activation-9.42e-01

Pos logit contributions
idth+4.262
esters+3.577
uv+3.463
ammy+3.342
ilyn+3.282
Neg logit contributions
 mathemat-4.143
��-3.782
ADRA-3.625
ingly-3.594
 Leopard-3.551
TokenN
Token ID45
Feature activation-1.18e+00

Pos logit contributions
idth+3.014
esters+2.534
uv+2.416
ammy+2.360
ilyn+2.322
Neg logit contributions
 mathemat-2.816
��-2.581
ADRA-2.500
ingly-2.467
 Leopard-2.420
Tokent
Token ID83
Feature activation-9.64e-01

Pos logit contributions
idth+3.501
esters+2.921
uv+2.799
ammy+2.701
ilyn+2.655
Neg logit contributions
 mathemat-3.767
��-3.478
ADRA-3.349
ingly-3.286
 Leopard-3.264
Token
Token ID198
Feature activation-1.93e-01

Pos logit contributions
idth+1.741
esters+1.487
 Rated+1.375
ammy+1.350
uv+1.348
Neg logit contributions
 mathemat-1.714
��-1.572
ADRA-1.557
ingly-1.518
WARN-1.473
Token
Token ID198
Feature activation-2.47e-01

Pos logit contributions
idth+0.652
esters+0.563
uv+0.518
 Rated+0.516
ammy+0.515
Neg logit contributions
 mathemat-0.570
ADRA-0.507
��-0.501
 juven-0.477
ingly-0.468
TokenOther
Token ID6395
Feature activation+9.51e-02

Pos logit contributions
idth+0.810
esters+0.695
ammy+0.634
uv+0.633
ples+0.626
Neg logit contributions
 mathemat-0.728
��-0.663
ADRA-0.650
ingly-0.639
 Leopard-0.620
Token media
Token ID2056
Feature activation+1.00e-01

Pos logit contributions
 mathemat+0.297
��+0.280
ADRA+0.279
ingly+0.274
MAP+0.269
Neg logit contributions
idth-0.288
esters-0.241
ammy-0.219
ilyn-0.216
uv-0.214
Token [
Token ID685
Feature activation-1.91e-01

Pos logit contributions
 mathemat+0.282
��+0.265
ADRA+0.264
ingly+0.257
MAP+0.252
Neg logit contributions
idth-0.336
esters-0.286
ammy-0.263
ilyn-0.260
uv-0.257
Token edit
Token ID4370
Feature activation-1.81e+00

Pos logit contributions
idth+0.672
esters+0.580
ammy+0.534
ilyn+0.528
uv+0.526
Neg logit contributions
 mathemat-0.508
ADRA-0.468
��-0.464
ingly-0.452
MAP-0.440
Token ]
Token ID2361
Feature activation-6.25e-01

Pos logit contributions
idth+5.882
esters+4.990
uv+4.583
ples+4.489
 Rated+4.471
Neg logit contributions
 mathemat-5.368
ADRA-5.003
��-4.669
ingly-4.564
WARN-4.544
Token
Token ID198
Feature activation-8.07e-01

Pos logit contributions
idth+2.318
esters+1.998
ammy+1.856
ilyn+1.835
uv+1.814
Neg logit contributions
 mathemat-1.532
��-1.409
ADRA-1.406
ingly-1.387
MAP-1.352
Token
Token ID198
Feature activation-7.27e-01

Pos logit contributions
idth+2.516
esters+2.178
uv+2.034
 Rated+2.024
ples+1.978
Neg logit contributions
 mathemat-2.648
ADRA-2.332
��-2.291
 juven-2.208
WARN-2.058
TokenVideo
Token ID10798
Feature activation-7.20e-01

Pos logit contributions
idth+2.344
esters+2.016
uv+1.846
ammy+1.831
ples+1.818
Neg logit contributions
 mathemat-2.174
��-1.981
ADRA-1.934
ingly-1.885
 Leopard-1.847
Token games
Token ID1830
Feature activation+3.15e-04

Pos logit contributions
idth+2.202
esters+1.877
ammy+1.702
 Rated+1.687
uv+1.684
Neg logit contributions
 mathemat-2.177
��-2.066
ADRA-2.060
ingly-1.999
WARN-1.959
Token
Token ID198
Feature activation-4.09e-01

Pos logit contributions
idth+1.439
esters+1.232
 Rated+1.144
ammy+1.117
ilyn+1.114
Neg logit contributions
 mathemat-1.364
��-1.241
ADRA-1.241
ingly-1.198
WARN-1.183
Token
Token ID198
Feature activation-4.52e-01

Pos logit contributions
idth+1.357
esters+1.170
uv+1.080
 Rated+1.075
ammy+1.070
Neg logit contributions
 mathemat-1.231
ADRA-1.101
��-1.084
 juven-1.030
ingly-1.000
TokenFilm
Token ID39750
Feature activation-5.42e-02

Pos logit contributions
idth+1.459
esters+1.254
uv+1.134
ammy+1.133
ples+1.127
Neg logit contributions
 mathemat-1.352
��-1.234
ADRA-1.207
ingly-1.179
 Leopard-1.151
Tokenography
Token ID4867
Feature activation-6.82e-02

Pos logit contributions
idth+0.180
esters+0.152
ammy+0.139
ilyn+0.137
 Rated+0.137
Neg logit contributions
 mathemat-0.155
��-0.144
ADRA-0.144
ingly-0.142
WARN-0.139
Token [
Token ID685
Feature activation-3.13e-01

Pos logit contributions
idth+0.254
esters+0.221
ammy+0.205
ilyn+0.202
uv+0.201
Neg logit contributions
 mathemat-0.163
��-0.150
ingly-0.150
ADRA-0.148
MAP-0.146
Token edit
Token ID4370
Feature activation-1.79e+00

Pos logit contributions
idth+1.045
esters+0.902
ammy+0.831
uv+0.828
ilyn+0.821
Neg logit contributions
 mathemat-0.915
ADRA-0.839
��-0.817
ingly-0.783
WARN-0.772
Token ]
Token ID2361
Feature activation-7.49e-01

Pos logit contributions
idth+5.433
esters+4.574
uv+4.285
 Rated+4.239
ples+4.137
Neg logit contributions
 mathemat-5.961
ADRA-5.340
��-4.959
 juven-4.925
WARN-4.746
Token
Token ID198
Feature activation-9.92e-01

Pos logit contributions
idth+2.417
esters+2.055
 Rated+1.920
uv+1.883
ilyn+1.878
Neg logit contributions
 mathemat-2.385
ADRA-2.076
��-2.038
 juven-1.981
ingly-1.918
Token
Token ID198
Feature activation-9.30e-01

Pos logit contributions
idth+3.016
esters+2.597
 Rated+2.440
uv+2.432
ples+2.355
Neg logit contributions
 mathemat-3.347
ADRA-2.950
��-2.883
 juven-2.786
WARN-2.592
TokenAl
Token ID2348
Feature activation-2.38e-01

Pos logit contributions
idth+2.962
esters+2.518
uv+2.283
 Rated+2.273
ples+2.253
Neg logit contributions
 mathemat-2.889
��-2.587
ADRA-2.521
ingly-2.449
WARN-2.391
Tokenbum
Token ID4435
Feature activation-1.42e-01

Pos logit contributions
idth+0.804
esters+0.681
ammy+0.623
ilyn+0.621
uv+0.613
Neg logit contributions
 mathemat-0.674
��-0.625
ADRA-0.622
ingly-0.608
MAP-0.599
Tokenholes
Token ID28439
Feature activation-3.64e-01

Pos logit contributions
idth+0.300
esters+0.255
ammy+0.234
ilyn+0.230
uv+0.229
Neg logit contributions
 mathemat-0.249
ADRA-0.232
��-0.232
ingly-0.227
MAP-0.223
Token in
Token ID287
Feature activation+3.45e-01

Pos logit contributions
idth+1.438
esters+1.252
ammy+1.169
ilyn+1.157
uv+1.146
Neg logit contributions
 mathemat-0.808
��-0.734
ADRA-0.734
ingly-0.720
MAP-0.701
Token American
Token ID1605
Feature activation+7.00e-01

Pos logit contributions
 mathemat+0.949
��+0.895
ingly+0.889
ADRA+0.887
MAP+0.858
Neg logit contributions
idth-1.154
esters-0.970
ilyn-0.897
ammy-0.897
 Rated-0.878
Token politics
Token ID4819
Feature activation-2.06e-01

Pos logit contributions
 mathemat+1.880
��+1.771
ADRA+1.755
ingly+1.742
MAP+1.707
Neg logit contributions
idth-2.391
esters-2.036
ammy-1.870
ilyn-1.858
 Rated-1.826
Token'
Token ID6
Feature activation+3.27e-01

Pos logit contributions
idth+0.808
esters+0.706
ammy+0.658
ilyn+0.650
uv+0.645
Neg logit contributions
 mathemat-0.460
ADRA-0.417
��-0.417
ingly-0.408
MAP-0.397
Token MORE
Token ID12011
Feature activation-1.79e+00

Pos logit contributions
 mathemat+0.710
ingly+0.701
MAP+0.678
WARN+0.677
��+0.674
Neg logit contributions
idth-1.268
esters-1.059
ammy-1.010
ilyn-1.007
ples-0.988
Token,
Token ID11
Feature activation-6.75e-01

Pos logit contributions
idth+4.762
esters+4.143
ammy+3.766
uv+3.657
ilyn+3.593
Neg logit contributions
 mathemat-5.764
��-5.506
ADRA-5.493
WARN-5.311
MAP-5.284
Token told
Token ID1297
Feature activation+4.89e-01

Pos logit contributions
idth+2.075
esters+1.780
 Rated+1.603
ammy+1.602
uv+1.601
Neg logit contributions
 mathemat-2.020
��-1.937
ADRA-1.920
MAP-1.836
ingly-1.822
Token 77
Token ID8541
Feature activation-4.06e-01

Pos logit contributions
 mathemat+1.338
ADRA+1.239
��+1.238
ingly+1.227
MAP+1.194
Neg logit contributions
idth-1.675
esters-1.423
ammy-1.313
ilyn-1.299
uv-1.280
TokenWA
Token ID15543
Feature activation-3.27e-01

Pos logit contributions
idth+1.324
esters+1.119
ammy+1.031
ples+1.011
uv+1.008
Neg logit contributions
 mathemat-1.178
ADRA-1.107
��-1.094
ingly-1.061
MAP-1.042
TokenBC
Token ID2749
Feature activation-3.50e-01

Pos logit contributions
idth+0.991
esters+0.828
ammy+0.751
ilyn+0.735
uv+0.735
Neg logit contributions
 mathemat-1.028
��-0.958
ADRA-0.955
ingly-0.938
MAP-0.928
Token was
Token ID373
Feature activation+1.86e-01

Pos logit contributions
 mathemat+1.058
ADRA+0.985
��+0.985
ingly+0.970
MAP+0.951
Neg logit contributions
idth-1.193
esters-1.008
ammy-0.924
ilyn-0.912
uv-0.900
Token,
Token ID11
Feature activation+2.66e-01

Pos logit contributions
 mathemat+0.546
ADRA+0.509
��+0.508
ingly+0.501
MAP+0.491
Neg logit contributions
idth-0.598
esters-0.504
ammy-0.461
ilyn-0.455
uv-0.449
Token and
Token ID290
Feature activation+3.27e-01

Pos logit contributions
 mathemat+0.804
ADRA+0.757
��+0.756
ingly+0.737
MAP+0.728
Neg logit contributions
idth-0.827
esters-0.697
ammy-0.635
ilyn-0.627
uv-0.620
Token what
Token ID644
Feature activation+2.44e-02

Pos logit contributions
 mathemat+1.013
ADRA+0.949
��+0.948
ingly+0.932
MAP+0.918
Neg logit contributions
idth-1.005
esters-0.839
ammy-0.763
ilyn-0.753
uv-0.744
Token Ad
Token ID1215
Feature activation+1.15e-01

Pos logit contributions
 mathemat+0.073
ADRA+0.069
��+0.069
ingly+0.066
MAP+0.066
Neg logit contributions
idth-0.076
esters-0.064
ammy-0.059
ilyn-0.058
uv-0.058
Tokene
Token ID68
Feature activation-1.77e+00

Pos logit contributions
 mathemat+0.331
��+0.309
ADRA+0.307
ingly+0.300
MAP+0.295
Neg logit contributions
idth-0.381
esters-0.323
ammy-0.297
ilyn-0.296
uv-0.292
Tokeno
Token ID78
Feature activation-6.27e-01

Pos logit contributions
idth+5.431
esters+4.740
ilyn+4.393
ples+4.301
ammy+4.267
Neg logit contributions
 mathemat-5.089
��-4.771
ADRA-4.605
MAP-4.575
WARN-4.548
Token R
Token ID371
Feature activation-1.29e-01

Pos logit contributions
idth+2.019
esters+1.717
ilyn+1.569
uv+1.550
ammy+1.550
Neg logit contributions
 mathemat-1.819
��-1.717
ADRA-1.682
ingly-1.621
MAP-1.612
Tokeness
Token ID408
Feature activation-7.95e-02

Pos logit contributions
idth+0.445
esters+0.381
ammy+0.349
uv+0.345
ilyn+0.344
Neg logit contributions
 mathemat-0.357
��-0.330
ADRA-0.326
MAP-0.316
ingly-0.316
Tokeni
Token ID72
Feature activation+1.16e-01

Pos logit contributions
idth+0.261
esters+0.221
ammy+0.203
ilyn+0.200
uv+0.198
Neg logit contributions
 mathemat-0.230
��-0.214
ADRA-0.214
ingly-0.208
MAP-0.206
Token warned
Token ID7728
Feature activation+9.89e-01

Pos logit contributions
 mathemat+0.341
ADRA+0.319
��+0.319
ingly+0.309
MAP+0.307
Neg logit contributions
idth-0.372
esters-0.317
ammy-0.287
ilyn-0.286
uv-0.284
Token.
Token ID13
Feature activation-2.20e-01

Pos logit contributions
idth+1.969
esters+1.638
ples+1.464
ammy+1.462
uv+1.460
Neg logit contributions
 mathemat-2.168
ADRA-2.098
��-2.039
WARN-1.979
MAP-1.967
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
idth+0.723
esters+0.618
ammy+0.567
 Rated+0.565
uv+0.559
Neg logit contributions
 mathemat-0.637
ADRA-0.581
��-0.580
ingly-0.564
MAP-0.551
Token
Token ID198
Feature activation+1.27e-01

Pos logit contributions
 mathemat+0.523
ADRA+0.475
��+0.472
ingly+0.454
MAP+0.446
Neg logit contributions
idth-0.693
esters-0.597
ammy-0.551
uv-0.544
ilyn-0.544
TokenFollow
Token ID7155
Feature activation-1.96e-02

Pos logit contributions
 mathemat+0.370
��+0.337
ADRA+0.331
ingly+0.326
 juven+0.317
Neg logit contributions
idth-0.418
esters-0.358
ammy-0.328
uv-0.326
ilyn-0.321
Token @
Token ID2488
Feature activation-1.04e-01

Pos logit contributions
idth+0.061
esters+0.052
ammy+0.048
ilyn+0.046
 Rated+0.046
Neg logit contributions
 mathemat-0.060
��-0.056
ADRA-0.056
ingly-0.053
MAP-0.053
Tokengreg
Token ID9903
Feature activation-1.76e+00

Pos logit contributions
idth+0.362
esters+0.311
ammy+0.288
ilyn+0.284
uv+0.280
Neg logit contributions
 mathemat-0.282
ADRA-0.259
��-0.258
ingly-0.252
MAP-0.246
Tokenory
Token ID652
Feature activation-3.61e-01

Pos logit contributions
idth+4.956
esters+4.141
ples+3.740
ammy+3.679
uv+3.659
Neg logit contributions
 mathemat-6.118
ADRA-5.606
��-5.555
WARN-5.316
 Leopard-5.257
Tokenk
Token ID74
Feature activation-6.94e-01

Pos logit contributions
idth+1.227
esters+1.044
ammy+0.973
ples+0.955
uv+0.954
Neg logit contributions
 mathemat-1.023
ADRA-0.943
��-0.939
ingly-0.896
WARN-0.884
Tokenort
Token ID419
Feature activation-1.91e-01

Pos logit contributions
idth+2.585
esters+2.246
ammy+2.083
uv+2.039
avis+2.038
Neg logit contributions
 mathemat-1.775
��-1.664
ADRA-1.588
ingly-1.499
 juven-1.499
Tokene
Token ID68
Feature activation-5.72e-01

Pos logit contributions
idth+0.593
esters+0.502
ammy+0.458
uv+0.448
ilyn+0.447
Neg logit contributions
 mathemat-0.589
ADRA-0.552
��-0.545
ingly-0.535
MAP-0.525
Token on
Token ID319
Feature activation-4.18e-02

Pos logit contributions
idth+2.042
esters+1.764
ammy+1.637
uv+1.610
ples+1.610
Neg logit contributions
 mathemat-1.491
��-1.370
ADRA-1.344
ingly-1.331
WARN-1.282
Token https
Token ID3740
Feature activation-4.22e-01

Pos logit contributions
 mathemat+1.117
��+1.038
ADRA+1.035
ingly+1.018
MAP+0.995
Neg logit contributions
idth-1.381
esters-1.174
ammy-1.080
ilyn-1.065
uv-1.056
Token://
Token ID1378
Feature activation-4.51e-01

Pos logit contributions
idth+0.720
esters+0.527
uv+0.416
ammy+0.416
ilyn+0.403
Neg logit contributions
 mathemat-1.953
ADRA-1.816
��-1.796
 juven-1.762
ingly-1.740
Tokenbit
Token ID2545
Feature activation-3.71e-01

Pos logit contributions
idth+1.513
esters+1.294
ammy+1.186
uv+1.168
ilyn+1.167
Neg logit contributions
 mathemat-1.275
ADRA-1.194
��-1.187
ingly-1.149
MAP-1.129
Tokenco
Token ID1073
Feature activation+3.22e-01

Pos logit contributions
idth+1.031
esters+0.844
ammy+0.767
ilyn+0.747
uv+0.747
Neg logit contributions
 mathemat-1.256
ADRA-1.196
��-1.184
ingly-1.145
WARN-1.136
Tokenint
Token ID600
Feature activation-5.66e-01

Pos logit contributions
 mathemat+0.616
��+0.544
ADRA+0.542
MAP+0.509
ingly+0.503
Neg logit contributions
idth-1.402
esters-1.231
ammy-1.169
ilyn-1.154
ples-1.144
Tokenalk
Token ID971
Feature activation-1.76e+00

Pos logit contributions
idth+1.382
esters+1.135
ammy+1.015
uv+0.991
ples+0.963
Neg logit contributions
 mathemat-2.132
��-2.013
ADRA-2.011
ingly-1.936
MAP-1.918
Token.
Token ID13
Feature activation-1.74e-01

Pos logit contributions
idth+4.642
esters+3.733
ammy+3.394
uv+3.325
ilyn+3.272
Neg logit contributions
 mathemat-6.545
ADRA-5.982
��-5.851
 juven-5.789
MAP-5.513
Tokenorg
Token ID2398
Feature activation-3.96e-01

Pos logit contributions
idth+0.643
esters+0.564
ammy+0.531
ilyn+0.520
uv+0.519
Neg logit contributions
 mathemat-0.436
ADRA-0.393
��-0.380
MAP-0.368
 juven-0.363
Token/
Token ID14
Feature activation-5.45e-01

Pos logit contributions
idth+0.946
esters+0.739
ammy+0.657
ilyn+0.638
 Rated+0.637
Neg logit contributions
 mathemat-1.525
ADRA-1.431
��-1.422
ingly-1.376
MAP-1.366
Tokenindex
Token ID9630
Feature activation-6.56e-01

Pos logit contributions
idth+1.847
esters+1.573
ammy+1.461
ilyn+1.450
ples+1.427
Neg logit contributions
 mathemat-1.534
ADRA-1.386
��-1.382
ingly-1.357
MAP-1.320
Token.
Token ID13
Feature activation-1.27e-01

Pos logit contributions
idth+1.918
esters+1.581
ammy+1.424
uv+1.408
ilyn+1.408
Neg logit contributions
 mathemat-2.235
ADRA-2.035
��-2.028
 juven-1.962
ingly-1.942
Token:
Token ID25
Feature activation+5.55e-02

Pos logit contributions
idth+2.137
esters+1.796
ammy+1.691
ilyn+1.667
uv+1.662
Neg logit contributions
 mathemat-1.874
ADRA-1.742
��-1.729
WARN-1.689
ingly-1.688
Token
Token ID198
Feature activation-4.14e-01

Pos logit contributions
 mathemat+0.141
ADRA+0.130
��+0.129
ingly+0.127
MAP+0.124
Neg logit contributions
idth-0.201
esters-0.173
ammy-0.162
ilyn-0.160
uv-0.158
Token
Token ID198
Feature activation-3.59e-01

Pos logit contributions
idth+1.361
esters+1.172
uv+1.087
 Rated+1.084
ammy+1.074
Neg logit contributions
 mathemat-1.265
ADRA-1.123
��-1.108
 juven-1.057
ingly-1.021
Token13
Token ID1485
Feature activation+2.05e-02

Pos logit contributions
idth+1.161
esters+0.987
uv+0.917
ammy+0.913
ples+0.898
Neg logit contributions
 mathemat-1.065
��-0.971
ADRA-0.957
ingly-0.932
 Leopard-0.908
Token PH
Token ID9370
Feature activation-8.54e-01

Pos logit contributions
 mathemat+0.065
ADRA+0.061
��+0.061
ingly+0.060
MAP+0.058
Neg logit contributions
idth-0.062
esters-0.051
ammy-0.047
ilyn-0.046
uv-0.046
TokenOTOS
Token ID33291
Feature activation-1.76e+00

Pos logit contributions
idth+2.764
esters+2.385
ammy+2.224
 Rated+2.160
ilyn+2.159
Neg logit contributions
 mathemat-2.433
ADRA-2.229
��-2.203
ingly-2.185
MAP-2.119
Token Donald
Token ID3759
Feature activation-1.77e-01

Pos logit contributions
idth+5.118
esters+4.064
 Rated+4.042
 Favor+3.887
uv+3.884
Neg logit contributions
 mathemat-5.610
ADRA-5.079
��-4.974
ingly-4.847
acan-4.796
Token Trump
Token ID1301
Feature activation-4.55e-01

Pos logit contributions
idth+0.617
esters+0.525
ammy+0.485
ilyn+0.481
uv+0.479
Neg logit contributions
 mathemat-0.492
ADRA-0.448
��-0.445
ingly-0.421
MAP-0.418
Token's
Token ID338
Feature activation-2.31e-01

Pos logit contributions
idth+1.396
esters+1.172
ammy+1.053
ilyn+1.050
 Rated+1.048
Neg logit contributions
 mathemat-1.390
ADRA-1.326
��-1.314
ingly-1.286
WARN-1.271
Token potential
Token ID2785
Feature activation+3.33e-01

Pos logit contributions
idth+0.648
esters+0.533
ammy+0.482
uv+0.477
ilyn+0.476
Neg logit contributions
 mathemat-0.771
ADRA-0.734
��-0.724
ingly-0.711
WARN-0.705
Token running
Token ID2491
Feature activation-2.81e-01

Pos logit contributions
 mathemat+1.133
ADRA+1.067
��+1.066
ingly+1.050
MAP+1.033
Neg logit contributions
idth-0.923
esters-0.755
ammy-0.678
ilyn-0.667
uv-0.658
Token.
Token ID13
Feature activation-3.58e-01

Pos logit contributions
idth+2.411
esters+1.976
uv+1.823
 Rated+1.804
ammy+1.788
Neg logit contributions
 mathemat-3.143
ADRA-2.831
��-2.798
 juven-2.707
ingly-2.610
Tokencom
Token ID785
Feature activation+3.55e-02

Pos logit contributions
idth+1.377
esters+1.209
uv+1.132
ammy+1.126
ilyn+1.118
Neg logit contributions
 mathemat-0.882
ADRA-0.768
��-0.756
 juven-0.723
ingly-0.703
Token/
Token ID14
Feature activation-7.05e-01

Pos logit contributions
 mathemat+0.128
ADRA+0.121
��+0.121
ingly+0.119
MAP+0.117
Neg logit contributions
idth-0.091
esters-0.073
ammy-0.065
ilyn-0.064
uv-0.063
Token74
Token ID4524
Feature activation-6.94e-01

Pos logit contributions
idth+2.206
esters+1.845
uv+1.733
ammy+1.708
ilyn+1.682
Neg logit contributions
 mathemat-2.164
��-1.990
ADRA-1.945
ingly-1.926
 Leopard-1.879
TokenE
Token ID36
Feature activation-1.16e+00

Pos logit contributions
idth+2.231
esters+1.866
uv+1.761
ammy+1.739
ilyn+1.706
Neg logit contributions
 mathemat-2.079
��-1.909
ADRA-1.863
ingly-1.841
 Leopard-1.795
Tokeng
Token ID70
Feature activation-1.75e+00

Pos logit contributions
idth+3.426
esters+2.839
uv+2.714
ammy+2.617
ilyn+2.592
Neg logit contributions
 mathemat-3.706
��-3.468
ADRA-3.331
ingly-3.294
 Leopard-3.249
TokenV
Token ID53
Feature activation-1.10e+00

Pos logit contributions
idth+5.483
esters+4.566
uv+4.453
ammy+4.286
ples+4.226
Neg logit contributions
 mathemat-5.212
��-4.782
ingly-4.542
ADRA-4.482
 Leopard-4.462
TokenV
Token ID53
Feature activation-1.02e+00

Pos logit contributions
idth+3.367
esters+2.794
uv+2.663
ammy+2.598
ilyn+2.554
Neg logit contributions
 mathemat-3.419
��-3.160
ADRA-3.054
ingly-3.010
 Leopard-2.970
Tokenp
Token ID79
Feature activation-1.35e+00

Pos logit contributions
idth+3.124
esters+2.599
uv+2.468
ammy+2.411
ilyn+2.371
Neg logit contributions
 mathemat-3.179
��-2.941
ADRA-2.849
ingly-2.808
 Leopard-2.766
TokenDM
Token ID23127
Feature activation-7.16e-01

Pos logit contributions
idth+4.228
esters+3.542
uv+3.397
ammy+3.299
ilyn+3.234
Neg logit contributions
 mathemat-4.115
��-3.764
ingly-3.607
ADRA-3.593
 Leopard-3.541
Token2
Token ID17
Feature activation-6.18e-01

Pos logit contributions
idth+2.154
esters+1.789
uv+1.671
ammy+1.648
ilyn+1.620
Neg logit contributions
 mathemat-2.281
��-2.112
ADRA-2.068
ingly-2.034
 Leopard-1.979
Token.
Token ID13
Feature activation-4.15e-01

Pos logit contributions
idth+2.060
esters+1.729
uv+1.566
ammy+1.552
 Rated+1.548
Neg logit contributions
 mathemat-2.518
ADRA-2.296
��-2.277
 juven-2.164
ingly-2.143
Tokentwitter
Token ID6956
Feature activation-8.60e-01

Pos logit contributions
idth+0.985
esters+0.781
ammy+0.681
uv+0.673
ilyn+0.665
Neg logit contributions
 mathemat-1.623
ADRA-1.514
��-1.501
ingly-1.450
 juven-1.445
Token.
Token ID13
Feature activation-6.36e-01

Pos logit contributions
idth+2.375
esters+1.971
uv+1.813
 Rated+1.781
ammy+1.770
Neg logit contributions
 mathemat-3.108
ADRA-2.800
��-2.765
 juven-2.670
ingly-2.555
Tokencom
Token ID785
Feature activation-1.77e-01

Pos logit contributions
idth+2.384
esters+2.100
uv+1.974
ammy+1.944
ples+1.937
Neg logit contributions
 mathemat-1.661
ADRA-1.444
��-1.408
 juven-1.369
MAP-1.283
Token/
Token ID14
Feature activation-9.49e-01

Pos logit contributions
idth+0.446
esters+0.357
ammy+0.317
ilyn+0.311
uv+0.309
Neg logit contributions
 mathemat-0.651
ADRA-0.611
��-0.610
ingly-0.597
MAP-0.589
Tokenj
Token ID73
Feature activation-1.74e+00

Pos logit contributions
idth+2.953
esters+2.469
uv+2.343
ammy+2.290
ilyn+2.252
Neg logit contributions
 mathemat-2.924
��-2.680
ADRA-2.604
ingly-2.577
 Leopard-2.530
TokenAx
Token ID31554
Feature activation-1.69e+00

Pos logit contributions
idth+5.418
esters+4.517
uv+4.395
ammy+4.254
ilyn+4.173
Neg logit contributions
 mathemat-5.223
��-4.775
ADRA-4.558
ingly-4.506
 Leopard-4.497
Token6
Token ID21
Feature activation-1.09e+00

Pos logit contributions
idth+5.265
esters+4.397
uv+4.303
ammy+4.133
ilyn+4.066
Neg logit contributions
 mathemat-5.041
��-4.582
ADRA-4.382
ingly-4.328
 Leopard-4.318
Tokenc
Token ID66
Feature activation-9.06e-01

Pos logit contributions
idth+3.493
esters+2.931
uv+2.813
ammy+2.741
ilyn+2.685
Neg logit contributions
 mathemat-3.254
��-2.980
ADRA-2.877
ingly-2.841
 Leopard-2.799
TokenU
Token ID52
Feature activation-8.21e-01

Pos logit contributions
idth+2.831
esters+2.353
uv+2.237
ammy+2.192
ples+2.152
Neg logit contributions
 mathemat-2.790
��-2.563
ingly-2.475
ADRA-2.463
 Leopard-2.415
Tokenxi
Token ID29992
Feature activation-1.31e+00

Pos logit contributions
idth+2.415
esters+1.995
uv+1.881
ammy+1.835
ilyn+1.808
Neg logit contributions
 mathemat-2.687
��-2.493
ADRA-2.418
ingly-2.372
 Leopard-2.351
Token
Token ID198
Feature activation-8.07e-01

Pos logit contributions
idth+2.318
esters+1.998
ammy+1.856
ilyn+1.835
uv+1.814
Neg logit contributions
 mathemat-1.532
��-1.409
ADRA-1.406
ingly-1.387
MAP-1.352
Token
Token ID198
Feature activation-7.27e-01

Pos logit contributions
idth+2.516
esters+2.178
uv+2.034
 Rated+2.024
ples+1.978
Neg logit contributions
 mathemat-2.648
ADRA-2.332
��-2.291
 juven-2.208
WARN-2.058
TokenVideo
Token ID10798
Feature activation-7.20e-01

Pos logit contributions
idth+2.344
esters+2.016
uv+1.846
ammy+1.831
ples+1.818
Neg logit contributions
 mathemat-2.174
��-1.981
ADRA-1.934
ingly-1.885
 Leopard-1.847
Token games
Token ID1830
Feature activation+3.15e-04

Pos logit contributions
idth+2.202
esters+1.877
ammy+1.702
 Rated+1.687
uv+1.684
Neg logit contributions
 mathemat-2.177
��-2.066
ADRA-2.060
ingly-1.999
WARN-1.959
Token [
Token ID685
Feature activation-1.19e-01

Pos logit contributions
 mathemat+0.001
��+0.001
ADRA+0.001
ingly+0.001
MAP+0.001
Neg logit contributions
idth-0.001
esters-0.001
ammy-0.001
ilyn-0.001
uv-0.001
Token edit
Token ID4370
Feature activation-1.72e+00

Pos logit contributions
idth+0.413
esters+0.358
ammy+0.329
uv+0.327
ilyn+0.324
Neg logit contributions
 mathemat-0.334
ADRA-0.305
��-0.298
ingly-0.285
 juven-0.282
Token ]
Token ID2361
Feature activation-6.00e-01

Pos logit contributions
idth+5.003
esters+4.388
uv+4.076
 Rated+3.914
ples+3.877
Neg logit contributions
 mathemat-6.136
ADRA-5.336
 juven-5.136
��-4.947
 bom-4.572
Token
Token ID198
Feature activation-8.22e-01

Pos logit contributions
idth+2.223
esters+1.916
ammy+1.779
ilyn+1.760
uv+1.739
Neg logit contributions
 mathemat-1.473
��-1.356
ADRA-1.353
ingly-1.334
MAP-1.301
Token
Token ID198
Feature activation-8.04e-01

Pos logit contributions
idth+2.554
esters+2.211
uv+2.065
 Rated+2.063
ples+2.015
Neg logit contributions
 mathemat-2.698
ADRA-2.374
��-2.331
 juven-2.253
ingly-2.094
TokenIn
Token ID818
Feature activation-6.65e-01

Pos logit contributions
idth+2.554
esters+2.192
uv+2.001
ammy+1.986
ples+1.967
Neg logit contributions
 mathemat-2.436
��-2.230
ADRA-2.167
ingly-2.117
 juven-2.075
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
idth+1.966
esters+1.641
ammy+1.491
uv+1.472
 Rated+1.467
Neg logit contributions
 mathemat-2.128
ADRA-1.995
��-1.989
ingly-1.932
WARN-1.910
Token by
Token ID416
Feature activation+1.02e-01

Pos logit contributions
 mathemat+1.615
��+1.518
ADRA+1.516
ingly+1.502
MAP+1.474
Neg logit contributions
idth-1.372
esters-1.126
ammy-1.012
ilyn-0.998
uv-0.981
Token Robert
Token ID5199
Feature activation+1.80e-02

Pos logit contributions
 mathemat+0.294
��+0.275
ADRA+0.275
ingly+0.269
MAP+0.264
Neg logit contributions
idth-0.336
esters-0.284
ammy-0.262
ilyn-0.258
uv-0.255
Token Math
Token ID16320
Feature activation-7.48e-03

Pos logit contributions
 mathemat+0.053
��+0.049
ADRA+0.049
ingly+0.048
MAP+0.047
Neg logit contributions
idth-0.058
esters-0.049
ammy-0.045
ilyn-0.044
uv-0.044
Tokenis
Token ID271
Feature activation+1.81e-01

Pos logit contributions
idth+0.027
esters+0.023
ammy+0.021
ilyn+0.021
uv+0.021
Neg logit contributions
 mathemat-0.020
��-0.018
ADRA-0.018
ingly-0.018
MAP-0.017
Token (@
Token ID4275
Feature activation-1.54e-01

Pos logit contributions
 mathemat+0.522
ADRA+0.486
��+0.485
ingly+0.475
MAP+0.465
Neg logit contributions
idth-0.595
esters-0.505
ammy-0.466
ilyn-0.457
uv-0.455
Tokenr
Token ID81
Feature activation-1.71e+00

Pos logit contributions
idth+0.516
esters+0.439
ammy+0.402
ilyn+0.398
uv+0.395
Neg logit contributions
 mathemat-0.433
��-0.405
ADRA-0.403
ingly-0.393
MAP-0.385
Tokenmath
Token ID11018
Feature activation-1.26e+00

Pos logit contributions
idth+5.662
esters+4.784
ammy+4.403
uv+4.385
ilyn+4.371
Neg logit contributions
 mathemat-4.605
��-4.597
ADRA-4.489
 Leopard-4.349
WARN-4.254
Tokenis
Token ID271
Feature activation-6.08e-01

Pos logit contributions
idth+4.298
esters+3.636
ammy+3.307
ilyn+3.294
ples+3.269
Neg logit contributions
 mathemat-3.536
��-3.363
ADRA-3.327
MAP-3.164
 Leopard-3.124
Token_
Token ID62
Feature activation-8.46e-01

Pos logit contributions
idth+1.801
esters+1.502
ammy+1.374
uv+1.343
ilyn+1.334
Neg logit contributions
 mathemat-1.949
��-1.830
ADRA-1.822
ingly-1.781
WARN-1.745
Tokenthe
Token ID1169
Feature activation-5.50e-01

Pos logit contributions
idth+2.707
esters+2.278
ammy+2.106
uv+2.084
ilyn+2.055
Neg logit contributions
 mathemat-2.516
ADRA-2.354
��-2.340
ingly-2.243
WARN-2.200
Token1
Token ID16
Feature activation-2.53e-01

Pos logit contributions
idth+1.573
esters+1.302
ammy+1.170
uv+1.151
ilyn+1.149
Neg logit contributions
 mathemat-1.825
��-1.721
ADRA-1.718
ingly-1.668
WARN-1.642
Tokenholes
Token ID28439
Feature activation-3.46e-01

Pos logit contributions
idth+0.130
esters+0.111
ammy+0.102
ilyn+0.100
uv+0.099
Neg logit contributions
 mathemat-0.110
ADRA-0.103
��-0.103
ingly-0.101
MAP-0.099
Token in
Token ID287
Feature activation+3.62e-01

Pos logit contributions
idth+1.354
esters+1.179
ammy+1.100
ilyn+1.085
uv+1.080
Neg logit contributions
 mathemat-0.785
ADRA-0.715
��-0.712
ingly-0.697
MAP-0.679
Token American
Token ID1605
Feature activation+7.43e-01

Pos logit contributions
 mathemat+0.979
��+0.927
ingly+0.922
ADRA+0.919
MAP+0.888
Neg logit contributions
idth-1.219
esters-1.022
ilyn-0.951
ammy-0.949
 Rated-0.933
Token politics
Token ID4819
Feature activation-2.49e-01

Pos logit contributions
 mathemat+2.011
��+1.886
ADRA+1.875
ingly+1.858
MAP+1.822
Neg logit contributions
idth-2.535
esters-2.155
ammy-1.981
ilyn-1.965
 Rated-1.932
Token'
Token ID6
Feature activation+4.30e-01

Pos logit contributions
idth+0.962
esters+0.844
ammy+0.783
ilyn+0.771
uv+0.769
Neg logit contributions
 mathemat-0.577
ADRA-0.521
��-0.520
ingly-0.508
MAP-0.494
Token MORE
Token ID12011
Feature activation-1.71e+00

Pos logit contributions
 mathemat+0.950
ingly+0.926
MAP+0.892
WARN+0.892
��+0.889
Neg logit contributions
idth-1.649
esters-1.376
ammy-1.320
ilyn-1.312
ples-1.283
Token as
Token ID355
Feature activation-8.69e-01

Pos logit contributions
idth+4.379
esters+3.770
ammy+3.417
uv+3.333
 Rated+3.312
Neg logit contributions
 mathemat-5.657
ADRA-5.514
��-5.463
WARN-5.281
MAP-5.140
Token a
Token ID257
Feature activation-2.08e-01

Pos logit contributions
idth+2.797
esters+2.386
ammy+2.209
ilyn+2.155
 Rated+2.151
Neg logit contributions
 mathemat-2.490
ADRA-2.348
��-2.336
WARN-2.245
ingly-2.240
Token 68
Token ID8257
Feature activation+3.09e-01

Pos logit contributions
idth+0.758
esters+0.653
ammy+0.611
ilyn+0.602
 Rated+0.597
Neg logit contributions
 mathemat-0.521
ADRA-0.485
��-0.482
ingly-0.467
MAP-0.459
Token-
Token ID12
Feature activation+1.73e-01

Pos logit contributions
 mathemat+0.871
ingly+0.818
��+0.816
ADRA+0.813
MAP+0.791
Neg logit contributions
idth-1.014
esters-0.860
ammy-0.786
ilyn-0.785
uv-0.763
Tokenyear
Token ID1941
Feature activation-1.20e-01

Pos logit contributions
 mathemat+0.503
ADRA+0.469
��+0.467
ingly+0.457
MAP+0.448
Neg logit contributions
idth-0.567
esters-0.478
ammy-0.439
ilyn-0.433
uv-0.430
Token common
Token ID2219
Feature activation+4.14e-01

Pos logit contributions
idth+0.582
esters+0.499
ammy+0.448
uv+0.445
ilyn+0.445
Neg logit contributions
 mathemat-0.586
��-0.566
ADRA-0.562
MAP-0.542
ingly-0.542
Token as
Token ID355
Feature activation-3.04e-01

Pos logit contributions
 mathemat+1.266
��+1.183
ADRA+1.182
ingly+1.168
MAP+1.145
Neg logit contributions
idth-1.282
esters-1.072
ammy-0.977
ilyn-0.964
uv-0.950
Token we
Token ID356
Feature activation+2.74e-01

Pos logit contributions
idth+0.942
esters+0.802
ammy+0.720
ilyn+0.715
uv+0.714
Neg logit contributions
 mathemat-0.899
��-0.864
ADRA-0.862
WARN-0.829
ingly-0.827
Token become
Token ID1716
Feature activation+7.86e-01

Pos logit contributions
 mathemat+0.731
��+0.689
ADRA+0.689
ingly+0.666
MAP+0.661
Neg logit contributions
idth-0.949
esters-0.819
ammy-0.752
ilyn-0.742
uv-0.737
Token more
Token ID517
Feature activation+2.56e-01

Pos logit contributions
 mathemat+2.043
��+1.884
ADRA+1.884
ingly+1.855
MAP+1.812
Neg logit contributions
idth-2.803
esters-2.402
ammy-2.223
ilyn-2.196
uv-2.171
Token aware
Token ID3910
Feature activation+1.43e+00

Pos logit contributions
 mathemat+0.674
��+0.639
ADRA+0.636
ingly+0.615
MAP+0.609
Neg logit contributions
idth-0.889
esters-0.769
ammy-0.704
ilyn-0.697
uv-0.693
Token of
Token ID286
Feature activation+5.16e-01

Pos logit contributions
 mathemat+3.561
ingly+3.313
��+3.172
ADRA+3.157
MAP+3.075
Neg logit contributions
idth-5.229
esters-4.419
ammy-4.179
ilyn-4.072
uv-4.036
Token unconscious
Token ID14917
Feature activation-2.96e-01

Pos logit contributions
 mathemat+1.480
��+1.390
ADRA+1.389
ingly+1.357
MAP+1.339
Neg logit contributions
idth-1.687
esters-1.433
ammy-1.306
ilyn-1.294
uv-1.284
Token bias
Token ID10690
Feature activation-8.33e-01

Pos logit contributions
idth+0.885
esters+0.753
ples+0.670
ammy+0.667
ilyn+0.667
Neg logit contributions
 mathemat-0.907
��-0.877
ADRA-0.868
WARN-0.837
MAP-0.829
Token,
Token ID11
Feature activation+8.47e-02

Pos logit contributions
idth+2.428
esters+2.061
ammy+1.824
ilyn+1.800
uv+1.797
Neg logit contributions
 mathemat-2.606
��-2.503
ADRA-2.497
WARN-2.390
ingly-2.377
Token
Token ID447
Feature activation+1.30e-01

Pos logit contributions
 mathemat+0.252
��+0.237
ADRA+0.237
ingly+0.232
MAP+0.228
Neg logit contributions
idth-0.268
esters-0.226
ammy-0.205
ilyn-0.204
uv-0.201
Token
Token ID198
Feature activation+5.21e-01

Pos logit contributions
 mathemat+0.043
��+0.040
ADRA+0.040
ingly+0.039
MAP+0.038
Neg logit contributions
idth-0.061
esters-0.052
ammy-0.048
ilyn-0.048
uv-0.047
Token
Token ID198
Feature activation+2.91e-01

Pos logit contributions
 mathemat+1.291
��+1.189
ADRA+1.188
ingly+1.171
MAP+1.143
Neg logit contributions
idth-1.916
esters-1.650
ammy-1.532
ilyn-1.515
uv-1.496
TokenTom
Token ID13787
Feature activation+1.61e-01

Pos logit contributions
 mathemat+0.837
��+0.774
ADRA+0.772
ingly+0.761
MAP+0.744
Neg logit contributions
idth-0.963
esters-0.815
ammy-0.750
ilyn-0.738
uv-0.733
Token Holland
Token ID16070
Feature activation-8.37e-02

Pos logit contributions
 mathemat+0.506
��+0.474
ADRA+0.474
ingly+0.469
MAP+0.459
Neg logit contributions
idth-0.487
esters-0.405
ammy-0.368
ilyn-0.362
uv-0.357
Token goal
Token ID3061
Feature activation+4.92e-01

Pos logit contributions
idth+0.267
esters+0.226
ammy+0.207
 Rated+0.204
uv+0.204
Neg logit contributions
 mathemat-0.246
ADRA-0.233
��-0.232
ingly-0.225
MAP-0.222
Token dis
Token ID595
Feature activation+1.09e+00

Pos logit contributions
 mathemat+1.426
ADRA+1.328
��+1.325
ingly+1.300
MAP+1.274
Neg logit contributions
idth-1.614
esters-1.363
ammy-1.253
ilyn-1.232
uv-1.223
Tokenallowed
Token ID40845
Feature activation+5.98e-01

Pos logit contributions
 mathemat+2.354
��+2.139
ingly+2.129
ADRA+2.126
MAP+2.057
Neg logit contributions
idth-4.328
esters-3.757
ammy-3.498
ilyn-3.494
uv-3.441
Token for
Token ID329
Feature activation+2.20e-01

Pos logit contributions
 mathemat+1.773
ADRA+1.654
��+1.650
ingly+1.623
MAP+1.596
Neg logit contributions
idth-1.915
esters-1.610
ammy-1.475
ilyn-1.454
uv-1.440
Token off
Token ID572
Feature activation+6.63e-01

Pos logit contributions
 mathemat+0.731
ADRA+0.690
��+0.684
ingly+0.674
MAP+0.667
Neg logit contributions
idth-0.628
esters-0.513
ammy-0.468
uv-0.457
ilyn-0.455
Tokenside
Token ID1589
Feature activation+1.81e-01

Pos logit contributions
 mathemat+2.348
��+2.217
ADRA+2.214
ingly+2.204
MAP+2.158
Neg logit contributions
idth-1.727
esters-1.391
ammy-1.239
ilyn-1.222
uv-1.191
Token in
Token ID287
Feature activation+1.89e-01

Pos logit contributions
 mathemat+0.512
ADRA+0.476
��+0.474
ingly+0.464
MAP+0.457
Neg logit contributions
idth-0.607
esters-0.513
ammy-0.473
ilyn-0.464
uv-0.463
Token as
Token ID355
Feature activation+3.31e-01

Pos logit contributions
 mathemat+0.647
ADRA+0.610
��+0.610
ingly+0.591
MAP+0.585
Neg logit contributions
idth-0.649
esters-0.549
ammy-0.499
ilyn-0.491
uv-0.488
Token if
Token ID611
Feature activation+4.03e-01

Pos logit contributions
 mathemat+0.927
ADRA+0.865
��+0.863
ingly+0.843
MAP+0.831
Neg logit contributions
idth-1.108
esters-0.946
ammy-0.872
ilyn-0.857
uv-0.850
Token the
Token ID262
Feature activation+3.79e-02

Pos logit contributions
 mathemat+1.244
ADRA+1.163
��+1.163
ingly+1.145
MAP+1.125
Neg logit contributions
idth-1.240
esters-1.035
ammy-0.943
ilyn-0.929
uv-0.917
Token way
Token ID835
Feature activation-6.40e-01

Pos logit contributions
 mathemat+0.111
��+0.104
ADRA+0.104
ingly+0.101
MAP+0.100
Neg logit contributions
idth-0.122
esters-0.103
ammy-0.095
ilyn-0.094
uv-0.093
Token to
Token ID284
Feature activation+3.68e-01

Pos logit contributions
idth+1.876
esters+1.543
ammy+1.382
uv+1.373
ilyn+1.373
Neg logit contributions
 mathemat-2.086
ADRA-1.966
��-1.963
WARN-1.884
MAP-1.884
Token engage
Token ID8209
Feature activation+9.08e-01

Pos logit contributions
 mathemat+1.015
��+0.944
ADRA+0.942
ingly+0.926
MAP+0.908
Neg logit contributions
idth-1.253
esters-1.068
ammy-0.982
ilyn-0.969
uv-0.961
Token a
Token ID257
Feature activation-3.43e-02

Pos logit contributions
 mathemat+2.604
ingly+2.438
ADRA+2.400
��+2.380
MAP+2.336
Neg logit contributions
idth-2.947
esters-2.494
ammy-2.319
ilyn-2.256
 Rated-2.218
Token digital
Token ID4875
Feature activation+3.18e-01

Pos logit contributions
idth+0.111
esters+0.092
ammy+0.085
ilyn+0.085
uv+0.084
Neg logit contributions
 mathemat-0.101
��-0.095
ADRA-0.095
ingly-0.092
WARN-0.091
Token-
Token ID12
Feature activation+4.16e-01

Pos logit contributions
 mathemat+0.961
��+0.898
ADRA+0.896
ingly+0.868
MAP+0.859
Neg logit contributions
idth-1.012
esters-0.842
ammy-0.773
ilyn-0.767
uv-0.761
Tokensav
Token ID39308
Feature activation+4.72e-01

Pos logit contributions
 mathemat+1.172
ADRA+1.088
��+1.086
ingly+1.065
MAP+1.045
Neg logit contributions
idth-1.396
esters-1.184
ammy-1.089
ilyn-1.074
uv-1.065
Tokenvy
Token ID7670
Feature activation+2.38e-01

Pos logit contributions
 mathemat+1.811
ADRA+1.716
��+1.716
ingly+1.697
MAP+1.672
Neg logit contributions
idth-1.098
esters-0.858
ammy-0.750
ilyn-0.734
uv-0.719
Token some
Token ID617
Feature activation+1.27e-01

Pos logit contributions
 mathemat+0.132
��+0.122
ADRA+0.121
ingly+0.119
WARN+0.117
Neg logit contributions
idth-0.140
esters-0.118
ammy-0.108
ilyn-0.106
 Rated-0.106
Token of
Token ID286
Feature activation+2.08e-01

Pos logit contributions
 mathemat+0.393
ADRA+0.369
��+0.368
ingly+0.357
WARN+0.353
Neg logit contributions
idth-0.391
esters-0.330
ammy-0.298
ples-0.293
ilyn-0.293
Token whom
Token ID4150
Feature activation+3.94e-01

Pos logit contributions
 mathemat+0.521
��+0.474
ADRA+0.473
ingly+0.460
WARN+0.452
Neg logit contributions
idth-0.770
esters-0.664
ammy-0.614
ilyn-0.605
uv-0.601
Token have
Token ID423
Feature activation+2.28e-01

Pos logit contributions
 mathemat+1.042
��+0.964
ADRA+0.958
ingly+0.937
MAP+0.923
Neg logit contributions
idth-1.394
esters-1.194
ammy-1.103
ilyn-1.090
 Rated-1.076
Token moved
Token ID3888
Feature activation+2.85e-01

Pos logit contributions
 mathemat+0.672
ADRA+0.627
��+0.626
ingly+0.610
MAP+0.603
Neg logit contributions
idth-0.736
esters-0.620
ammy-0.565
ilyn-0.560
 Rated-0.554
Token with
Token ID351
Feature activation+9.61e-01

Pos logit contributions
 mathemat+0.913
ADRA+0.861
��+0.857
ingly+0.833
WARN+0.826
Neg logit contributions
idth-0.844
esters-0.704
ammy-0.638
ilyn-0.625
uv-0.621
Token local
Token ID1957
Feature activation-1.38e-01

Pos logit contributions
 mathemat+2.682
��+2.488
ADRA+2.488
ingly+2.451
MAP+2.400
Neg logit contributions
idth-3.242
esters-2.752
ammy-2.533
ilyn-2.501
uv-2.471
Token forces
Token ID3386
Feature activation+3.26e-01

Pos logit contributions
idth+0.490
esters+0.419
ammy+0.391
ples+0.385
ilyn+0.384
Neg logit contributions
 mathemat-0.363
��-0.333
ADRA-0.325
WARN-0.315
MAP-0.313
Token within
Token ID1626
Feature activation+8.41e-01

Pos logit contributions
 mathemat+1.025
ADRA+0.961
��+0.955
ingly+0.936
MAP+0.924
Neg logit contributions
idth-0.989
esters-0.827
ammy-0.753
ilyn-0.739
uv-0.734
Token 20
Token ID1160
Feature activation+2.46e-01

Pos logit contributions
 mathemat+2.534
��+2.366
ADRA+2.366
ingly+2.335
MAP+2.289
Neg logit contributions
idth-2.648
esters-2.220
ammy-2.029
ilyn-1.999
uv-1.972
Token miles
Token ID4608
Feature activation+5.75e-01

Pos logit contributions
 mathemat+0.749
��+0.698
ADRA+0.698
ingly+0.688
MAP+0.673
Neg logit contributions
idth-0.765
esters-0.640
ammy-0.585
ilyn-0.578
uv-0.573
Token].
Token ID4083
Feature activation-6.69e-02

Pos logit contributions
 mathemat+1.445
ADRA+1.337
��+1.325
ingly+1.315
MAP+1.281
Neg logit contributions
idth-1.862
esters-1.596
ammy-1.473
ilyn-1.451
uv-1.425
Tokeneach
Token ID27379
Feature activation+2.18e-01

Pos logit contributions
idth+0.268
esters+0.233
ammy+0.219
ilyn+0.216
uv+0.214
Neg logit contributions
 mathemat-0.145
ADRA-0.132
��-0.131
ingly-0.128
MAP-0.125
Token do
Token ID466
Feature activation+3.95e-01

Pos logit contributions
 mathemat+0.674
��+0.630
ADRA+0.630
ingly+0.622
MAP+0.610
Neg logit contributions
idth-0.672
esters-0.560
ammy-0.511
ilyn-0.504
uv-0.497
Token |
Token ID930
Feature activation+6.61e-01

Pos logit contributions
 mathemat+1.221
��+1.142
ADRA+1.142
ingly+1.127
MAP+1.106
Neg logit contributions
idth-1.211
esters-1.010
ammy-0.920
ilyn-0.907
uv-0.894
Tokeni
Token ID72
Feature activation+2.71e-01

Pos logit contributions
 mathemat+1.992
��+1.867
ADRA+1.858
ingly+1.846
MAP+1.809
Neg logit contributions
idth-2.055
esters-1.730
ammy-1.579
ilyn-1.547
uv-1.523
Token|
Token ID91
Feature activation+9.19e-01

Pos logit contributions
 mathemat+0.929
ADRA+0.876
��+0.875
ingly+0.866
MAP+0.851
Neg logit contributions
idth-0.741
esters-0.603
ammy-0.542
ilyn-0.532
uv-0.520
Token print
Token ID3601
Feature activation+5.23e-01

Pos logit contributions
 mathemat+2.340
��+2.142
ADRA+2.140
ingly+2.123
MAP+2.074
Neg logit contributions
idth-3.297
esters-2.853
ammy-2.637
ilyn-2.602
uv-2.572
Token "#
Token ID25113
Feature activation+5.78e-01

Pos logit contributions
 mathemat+1.408
ADRA+1.299
��+1.296
ingly+1.295
MAP+1.254
Neg logit contributions
idth-1.803
esters-1.548
ammy-1.428
ilyn-1.404
uv-1.393
Token{
Token ID90
Feature activation-8.17e-02

Pos logit contributions
 mathemat+1.990
��+1.880
ADRA+1.880
ingly+1.866
MAP+1.841
Neg logit contributions
idth-1.544
esters-1.259
ammy-1.125
ilyn-1.109
uv-1.085
Tokeni
Token ID72
Feature activation-7.81e-02

Pos logit contributions
idth+0.250
esters+0.209
ammy+0.191
ilyn+0.186
uv+0.184
Neg logit contributions
 mathemat-0.252
ADRA-0.236
��-0.236
ingly-0.234
MAP-0.230
Token}
Token ID92
Feature activation+5.33e-02

Pos logit contributions
idth+0.267
esters+0.227
ammy+0.211
ilyn+0.207
uv+0.204
Neg logit contributions
 mathemat-0.214
ADRA-0.199
��-0.198
ingly-0.197
WARN-0.191
Token-
Token ID12
Feature activation+1.15e-01

Pos logit contributions
idth+1.340
esters+1.146
ammy+1.035
ples+1.032
ilyn+1.021
Neg logit contributions
 mathemat-1.207
��-1.124
ADRA-1.122
ingly-1.090
MAP-1.086
Tokenteam
Token ID15097
Feature activation+7.48e-02

Pos logit contributions
 mathemat+0.308
��+0.285
ADRA+0.285
ingly+0.281
MAP+0.275
Neg logit contributions
idth-0.402
esters-0.343
ammy-0.317
ilyn-0.314
uv-0.310
Token repet
Token ID46152
Feature activation+1.02e+00

Pos logit contributions
 mathemat+0.279
ADRA+0.263
��+0.263
ingly+0.257
WARN+0.256
Neg logit contributions
idth-0.182
esters-0.146
ammy-0.128
ilyn-0.125
ples-0.124
Tokenitions
Token ID1756
Feature activation-1.45e-01

Pos logit contributions
 mathemat+2.758
ingly+2.586
��+2.578
ADRA+2.560
MAP+2.483
Neg logit contributions
idth-3.442
esters-2.870
ammy-2.715
ilyn-2.644
uv-2.616
Token.
Token ID13
Feature activation-1.18e-01

Pos logit contributions
idth+0.435
esters+0.366
ammy+0.331
ples+0.325
uv+0.323
Neg logit contributions
 mathemat-0.459
ADRA-0.438
��-0.430
WARN-0.417
MAP-0.416
Token
Token ID198
Feature activation+5.50e-02

Pos logit contributions
idth+0.376
esters+0.318
 Rated+0.293
ammy+0.292
ilyn+0.287
Neg logit contributions
 mathemat-0.360
ADRA-0.327
��-0.325
ingly-0.317
WARN-0.311
Token
Token ID198
Feature activation+1.33e-02

Pos logit contributions
 mathemat+0.150
ADRA+0.136
��+0.135
ingly+0.129
MAP+0.127
Neg logit contributions
idth-0.194
esters-0.167
ammy-0.154
uv-0.152
ilyn-0.152
TokenLyn
Token ID37207
Feature activation+6.66e-01

Pos logit contributions
 mathemat+0.039
��+0.035
ADRA+0.035
ingly+0.034
MAP+0.033
Neg logit contributions
idth-0.044
esters-0.037
ammy-0.035
uv-0.034
ilyn-0.034
Tokench
Token ID354
Feature activation+2.50e-01

Pos logit contributions
 mathemat+1.959
ADRA+1.817
��+1.813
ingly+1.781
MAP+1.754
Neg logit contributions
idth-2.163
esters-1.829
ammy-1.678
ilyn-1.658
uv-1.639
Token had
Token ID550
Feature activation+4.72e-01

Pos logit contributions
 mathemat+0.739
ADRA+0.693
��+0.689
ingly+0.675
MAP+0.669
Neg logit contributions
idth-0.800
esters-0.679
ammy-0.620
ilyn-0.611
uv-0.604
Token one
Token ID530
Feature activation-1.88e-01

Pos logit contributions
 mathemat+1.384
��+1.289
ADRA+1.287
ingly+1.272
MAP+1.244
Neg logit contributions
idth-1.524
esters-1.283
ammy-1.177
ilyn-1.160
uv-1.145
Token),
Token ID828
Feature activation-5.14e-01

Pos logit contributions
idth+1.915
esters+1.590
ammy+1.426
ilyn+1.404
uv+1.396
Neg logit contributions
 mathemat-2.113
��-2.010
ADRA-1.992
ingly-1.938
MAP-1.926
Token John
Token ID1757
Feature activation-3.00e-01

Pos logit contributions
idth+1.633
esters+1.344
ammy+1.243
ilyn+1.220
 Rated+1.213
Neg logit contributions
 mathemat-1.558
ADRA-1.467
��-1.458
ingly-1.404
WARN-1.396
Token T
Token ID309
Feature activation-1.24e-01

Pos logit contributions
idth+0.951
esters+0.798
ammy+0.730
ilyn+0.720
uv+0.711
Neg logit contributions
 mathemat-0.895
��-0.836
ADRA-0.835
ingly-0.825
MAP-0.808
Tokenrav
Token ID4108
Feature activation-4.69e-01

Pos logit contributions
idth+0.345
esters+0.287
uv+0.256
ammy+0.256
ilyn+0.250
Neg logit contributions
 mathemat-0.435
��-0.402
ADRA-0.401
MAP-0.390
WARN-0.388
Tokenolt
Token ID5978
Feature activation+5.10e-02

Pos logit contributions
idth+1.644
esters+1.413
ammy+1.297
ilyn+1.286
uv+1.267
Neg logit contributions
 mathemat-1.255
ADRA-1.163
��-1.159
ingly-1.132
MAP-1.120
Tokena
Token ID64
Feature activation+9.76e-03

Pos logit contributions
 mathemat+0.120
ADRA+0.110
��+0.108
ingly+0.105
MAP+0.104
Neg logit contributions
idth-0.196
esters-0.170
uv-0.158
ammy-0.158
ilyn-0.157
Token and
Token ID290
Feature activation-1.44e-01

Pos logit contributions
 mathemat+0.025
��+0.023
ADRA+0.023
ingly+0.023
MAP+0.022
Neg logit contributions
idth-0.035
esters-0.030
ammy-0.028
ilyn-0.028
uv-0.027
Token Tom
Token ID4186
Feature activation-1.34e-01

Pos logit contributions
idth+0.456
esters+0.380
ammy+0.350
ilyn+0.343
 Rated+0.339
Neg logit contributions
 mathemat-0.435
ADRA-0.407
��-0.405
ingly-0.394
MAP-0.390
Token Cruise
Token ID29147
Feature activation-7.96e-01

Pos logit contributions
idth+0.456
esters+0.388
ammy+0.358
ilyn+0.353
uv+0.349
Neg logit contributions
 mathemat-0.372
ADRA-0.345
��-0.344
ingly-0.338
MAP-0.332
Token (
Token ID357
Feature activation-4.01e-01

Pos logit contributions
idth+2.387
esters+2.006
ammy+1.849
uv+1.803
ilyn+1.787
Neg logit contributions
 mathemat-2.494
ADRA-2.361
��-2.333
MAP-2.239
WARN-2.234
Tokenfor
Token ID1640
Feature activation-2.38e-01

Pos logit contributions
idth+1.410
esters+1.214
ammy+1.126
ilyn+1.107
uv+1.106
Neg logit contributions
 mathemat-1.079
ADRA-0.987
��-0.986
ingly-0.932
MAP-0.920
Token in
Token ID287
Feature activation-5.63e-02

Pos logit contributions
idth+1.761
esters+1.491
ilyn+1.358
ammy+1.350
uv+1.346
Neg logit contributions
 mathemat-1.705
ADRA-1.618
��-1.597
MAP-1.530
WARN-1.523
Token the
Token ID262
Feature activation-6.08e-02

Pos logit contributions
idth+0.180
esters+0.152
ammy+0.139
ilyn+0.138
uv+0.138
Neg logit contributions
 mathemat-0.165
ADRA-0.154
��-0.154
ingly-0.151
WARN-0.149
Token eyes
Token ID2951
Feature activation+4.89e-01

Pos logit contributions
idth+0.200
esters+0.171
ammy+0.159
uv+0.155
 Rated+0.155
Neg logit contributions
 mathemat-0.172
ADRA-0.161
��-0.161
ingly-0.155
WARN-0.154
Token of
Token ID286
Feature activation+1.30e-01

Pos logit contributions
 mathemat+1.395
ADRA+1.293
��+1.283
ingly+1.246
MAP+1.228
Neg logit contributions
idth-1.640
esters-1.391
ammy-1.278
ilyn-1.264
uv-1.260
Token his
Token ID465
Feature activation+2.48e-01

Pos logit contributions
 mathemat+0.373
��+0.352
ADRA+0.349
ingly+0.339
MAP+0.338
Neg logit contributions
idth-0.426
esters-0.358
ammy-0.326
uv-0.324
ilyn-0.324
Token opponents
Token ID7691
Feature activation+8.33e-01

Pos logit contributions
 mathemat+0.619
��+0.581
ADRA+0.575
ingly+0.554
MAP+0.547
Neg logit contributions
idth-0.908
esters-0.787
uv-0.722
ammy-0.721
ilyn-0.717
Token.
Token ID13
Feature activation-1.63e-02

Pos logit contributions
 mathemat+2.394
��+2.225
ADRA+2.220
ingly+2.210
MAP+2.154
Neg logit contributions
idth-2.732
esters-2.298
ammy-2.115
ilyn-2.085
uv-2.053
Token Social
Token ID5483
Feature activation-3.12e-02

Pos logit contributions
idth+0.050
esters+0.042
 Rated+0.039
ilyn+0.038
ammy+0.038
Neg logit contributions
 mathemat-0.051
��-0.047
ADRA-0.046
ingly-0.045
MAP-0.045
Token movements
Token ID8650
Feature activation-8.22e-02

Pos logit contributions
idth+0.109
esters+0.094
ammy+0.086
ilyn+0.085
ples+0.084
Neg logit contributions
 mathemat-0.082
��-0.077
ADRA-0.076
ingly-0.074
MAP-0.074
Token are
Token ID389
Feature activation-1.66e-01

Pos logit contributions
idth+0.261
esters+0.220
ammy+0.202
ilyn+0.199
uv+0.198
Neg logit contributions
 mathemat-0.243
��-0.230
ADRA-0.227
ingly-0.222
MAP-0.220
Token back
Token ID736
Feature activation+9.63e-03

Pos logit contributions
idth+0.534
esters+0.456
ammy+0.416
ilyn+0.413
uv+0.411
Neg logit contributions
 mathemat-0.482
ADRA-0.452
��-0.452
ingly-0.440
MAP-0.436
Token cracked
Token ID21368
Feature activation-1.79e-02

Pos logit contributions
 mathemat+0.325
ADRA+0.307
��+0.305
ingly+0.298
MAP+0.295
Neg logit contributions
idth-0.357
esters-0.303
ammy-0.275
ilyn-0.273
uv-0.272
Token areas
Token ID3006
Feature activation+4.93e-01

Pos logit contributions
idth+0.056
esters+0.047
ammy+0.042
uv+0.042
ilyn+0.042
Neg logit contributions
 mathemat-0.054
ADRA-0.051
��-0.051
ingly-0.049
MAP-0.049
Token in
Token ID287
Feature activation-4.27e-02

Pos logit contributions
 mathemat+1.439
��+1.342
ADRA+1.335
ingly+1.329
MAP+1.297
Neg logit contributions
idth-1.597
esters-1.343
ammy-1.233
ilyn-1.217
uv-1.195
Token the
Token ID262
Feature activation+3.54e-01

Pos logit contributions
idth+0.133
esters+0.112
ammy+0.101
uv+0.100
ilyn+0.100
Neg logit contributions
 mathemat-0.130
ADRA-0.122
��-0.122
ingly-0.119
MAP-0.118
Token bones
Token ID11945
Feature activation-1.42e-01

Pos logit contributions
 mathemat+1.081
ADRA+1.018
��+1.015
ingly+0.995
MAP+0.978
Neg logit contributions
idth-1.100
esters-0.924
ammy-0.837
ilyn-0.825
uv-0.822
Token,
Token ID11
Feature activation+1.89e-01

Pos logit contributions
idth+0.449
esters+0.378
ammy+0.345
ilyn+0.338
uv+0.338
Neg logit contributions
 mathemat-0.427
ADRA-0.402
��-0.399
ingly-0.391
MAP-0.384
Token and
Token ID290
Feature activation+8.08e-02

Pos logit contributions
 mathemat+0.539
ADRA+0.505
��+0.504
ingly+0.493
MAP+0.484
Neg logit contributions
idth-0.625
esters-0.532
ammy-0.485
ilyn-0.480
uv-0.478
Token meticulous
Token ID47628
Feature activation+2.21e-01

Pos logit contributions
 mathemat+0.231
ADRA+0.217
��+0.216
ingly+0.210
MAP+0.207
Neg logit contributions
idth-0.265
esters-0.227
ammy-0.206
ilyn-0.204
uv-0.203
Token pre
Token ID662
Feature activation+8.44e-01

Pos logit contributions
 mathemat+0.640
ADRA+0.599
��+0.597
ingly+0.585
MAP+0.575
Neg logit contributions
idth-0.719
esters-0.608
ammy-0.555
ilyn-0.549
uv-0.544
Token-
Token ID12
Feature activation-7.63e-02

Pos logit contributions
 mathemat+2.442
ingly+2.270
��+2.265
ADRA+2.262
MAP+2.218
Neg logit contributions
idth-2.724
esters-2.270
ammy-2.100
ilyn-2.073
uv-2.037
Tokencle
Token ID2375
Feature activation+5.84e-02

Pos logit contributions
idth+0.240
esters+0.204
ammy+0.184
ilyn+0.183
uv+0.182
Neg logit contributions
 mathemat-0.229
ADRA-0.216
��-0.214
ingly-0.209
MAP-0.204
Token later
Token ID1568
Feature activation-7.07e-02

Pos logit contributions
 mathemat+0.855
��+0.786
ADRA+0.785
ingly+0.756
MAP+0.745
Neg logit contributions
idth-1.313
esters-1.146
ammy-1.062
ilyn-1.045
uv-1.042
Token in
Token ID287
Feature activation-1.28e-01

Pos logit contributions
idth+0.242
esters+0.206
ammy+0.190
ilyn+0.187
uv+0.185
Neg logit contributions
 mathemat-0.194
ADRA-0.180
��-0.179
ingly-0.176
MAP-0.173
Token the
Token ID262
Feature activation-2.62e-01

Pos logit contributions
idth+0.360
esters+0.297
ammy+0.270
ples+0.263
uv+0.262
Neg logit contributions
 mathemat-0.429
ADRA-0.405
��-0.401
ingly-0.392
MAP-0.390
Token election
Token ID3071
Feature activation-1.24e-01

Pos logit contributions
idth+0.786
esters+0.675
ammy+0.623
uv+0.603
ples+0.601
Neg logit contributions
 mathemat-0.802
ADRA-0.762
��-0.739
WARN-0.725
MAP-0.723
Token cycle
Token ID6772
Feature activation+2.27e-01

Pos logit contributions
idth+0.390
esters+0.331
ammy+0.302
ilyn+0.295
uv+0.292
Neg logit contributions
 mathemat-0.372
ADRA-0.348
��-0.345
ingly-0.338
MAP-0.333
Token than
Token ID621
Feature activation+4.02e-01

Pos logit contributions
 mathemat+0.689
��+0.643
ADRA+0.643
ingly+0.635
MAP+0.622
Neg logit contributions
idth-0.711
esters-0.595
ammy-0.543
ilyn-0.536
uv-0.529
Token the
Token ID262
Feature activation-1.74e-01

Pos logit contributions
 mathemat+1.189
ADRA+1.108
��+1.107
ingly+1.089
MAP+1.069
Neg logit contributions
idth-1.290
esters-1.088
ammy-0.995
ilyn-0.980
uv-0.969
Token GW
Token ID27164
Feature activation-1.01e-01

Pos logit contributions
idth+0.528
esters+0.447
ammy+0.408
ples+0.401
 Rated+0.397
Neg logit contributions
 mathemat-0.540
ADRA-0.506
��-0.499
ingly-0.487
MAP-0.483
Token Batt
Token ID12350
Feature activation+5.04e-01

Pos logit contributions
idth+0.374
esters+0.323
ammy+0.299
ilyn+0.294
uv+0.293
Neg logit contributions
 mathemat-0.251
ADRA-0.230
��-0.229
ingly-0.225
MAP-0.219
Tokenleground
Token ID28272
Feature activation+1.99e-01

Pos logit contributions
 mathemat+1.130
��+1.033
ADRA+1.031
ingly+1.019
MAP+0.990
Neg logit contributions
idth-1.969
esters-1.710
ammy-1.596
ilyn-1.581
uv-1.564
Token poll
Token ID3278
Feature activation+1.16e-01

Pos logit contributions
 mathemat+0.556
ADRA+0.516
��+0.515
ingly+0.506
MAP+0.496
Neg logit contributions
idth-0.671
esters-0.572
ammy-0.525
ilyn-0.517
uv-0.513
Token any
Token ID597
Feature activation+3.51e-01

Pos logit contributions
 mathemat+2.102
��+1.971
ingly+1.966
ADRA+1.960
MAP+1.922
Neg logit contributions
idth-1.678
esters-1.352
ammy-1.208
ilyn-1.185
uv-1.161
Token bad
Token ID2089
Feature activation-6.57e-01

Pos logit contributions
 mathemat+0.983
ADRA+0.915
��+0.915
ingly+0.897
MAP+0.877
Neg logit contributions
idth-1.181
esters-1.006
ammy-0.927
ilyn-0.914
uv-0.906
Token blood
Token ID2910
Feature activation-3.54e-02

Pos logit contributions
idth+1.649
esters+1.344
ammy+1.202
ilyn+1.188
uv+1.179
Neg logit contributions
 mathemat-2.383
ADRA-2.248
��-2.231
ingly-2.178
WARN-2.152
Token will
Token ID481
Feature activation-3.34e-01

Pos logit contributions
idth+0.116
esters+0.099
ammy+0.091
ilyn+0.089
ples+0.089
Neg logit contributions
 mathemat-0.102
ADRA-0.095
��-0.094
ingly-0.091
WARN-0.091
Token carry
Token ID3283
Feature activation-2.52e-01

Pos logit contributions
idth+1.128
esters+0.962
ammy+0.886
uv+0.868
ilyn+0.861
Neg logit contributions
 mathemat-0.947
ADRA-0.883
��-0.866
MAP-0.843
WARN-0.838
Token over
Token ID625
Feature activation-7.18e-01

Pos logit contributions
idth+0.800
esters+0.677
ammy+0.624
ilyn+0.614
uv+0.614
Neg logit contributions
 mathemat-0.747
ADRA-0.709
��-0.695
MAP-0.674
ingly-0.672
Token between
Token ID1022
Feature activation-3.54e-02

Pos logit contributions
idth+2.071
esters+1.746
ammy+1.583
ilyn+1.558
uv+1.551
Neg logit contributions
 mathemat-2.318
ADRA-2.221
��-2.176
WARN-2.100
ingly-2.086
Token seasons
Token ID7028
Feature activation+4.79e-01

Pos logit contributions
idth+0.109
esters+0.092
ammy+0.083
ilyn+0.082
uv+0.082
Neg logit contributions
 mathemat-0.109
ADRA-0.102
��-0.102
ingly-0.100
MAP-0.098
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
 mathemat+1.405
��+1.310
ADRA+1.305
ingly+1.298
MAP+1.268
Neg logit contributions
idth-1.546
esters-1.299
ammy-1.190
ilyn-1.176
uv-1.157
Token noting
Token ID10820
Feature activation+2.02e-01

Pos logit contributions
idth+0.358
esters+0.305
ammy+0.278
ilyn+0.276
ples+0.274
Neg logit contributions
 mathemat-0.323
ADRA-0.305
��-0.302
ingly-0.295
WARN-0.292
Token that
Token ID326
Feature activation+1.59e-01

Pos logit contributions
 mathemat+0.599
��+0.557
ADRA+0.555
ingly+0.554
MAP+0.540
Neg logit contributions
idth-0.647
esters-0.542
ammy-0.497
ilyn-0.489
uv-0.480
Token initiate
Token ID22118
Feature activation-1.64e-02

Pos logit contributions
 mathemat+0.641
ADRA+0.600
��+0.600
ingly+0.582
MAP+0.575
Neg logit contributions
idth-0.799
esters-0.683
ammy-0.629
ilyn-0.619
uv-0.615
Token what
Token ID644
Feature activation+5.89e-01

Pos logit contributions
idth+0.052
esters+0.044
ammy+0.040
ilyn+0.040
uv+0.039
Neg logit contributions
 mathemat-0.049
��-0.046
ADRA-0.046
ingly-0.045
MAP-0.044
Token people
Token ID661
Feature activation-2.96e-01

Pos logit contributions
 mathemat+1.726
��+1.595
ingly+1.590
ADRA+1.589
MAP+1.542
Neg logit contributions
idth-1.911
esters-1.599
ammy-1.474
ilyn-1.447
uv-1.432
Token inside
Token ID2641
Feature activation+2.52e-01

Pos logit contributions
idth+0.937
esters+0.808
ilyn+0.730
ammy+0.730
 Rated+0.723
Neg logit contributions
 mathemat-0.859
ADRA-0.825
��-0.815
ingly-0.787
MAP-0.784
Token Goldman
Token ID18740
Feature activation+1.99e-01

Pos logit contributions
 mathemat+0.815
ADRA+0.763
��+0.763
ingly+0.755
MAP+0.740
Neg logit contributions
idth-0.740
esters-0.609
ammy-0.553
ilyn-0.543
uv-0.536
Token would
Token ID561
Feature activation-1.48e-01

Pos logit contributions
 mathemat+0.552
��+0.519
ADRA+0.516
ingly+0.506
MAP+0.495
Neg logit contributions
idth-0.671
esters-0.571
ammy-0.525
ilyn-0.521
 Rated-0.513
Token eventually
Token ID4191
Feature activation-1.93e-01

Pos logit contributions
idth+0.516
esters+0.443
ilyn+0.405
ammy+0.405
uv+0.401
Neg logit contributions
 mathemat-0.394
ADRA-0.370
��-0.369
MAP-0.353
ingly-0.351
Token dub
Token ID10873
Feature activation+1.98e-01

Pos logit contributions
idth+0.731
esters+0.634
ammy+0.586
ilyn+0.585
 Rated+0.580
Neg logit contributions
 mathemat-0.449
ADRA-0.423
��-0.421
MAP-0.400
WARN-0.397
Token
Token ID564
Feature activation+1.20e-01

Pos logit contributions
 mathemat+0.575
ADRA+0.537
��+0.536
ingly+0.532
MAP+0.516
Neg logit contributions
idth-0.645
esters-0.541
ammy-0.498
ilyn-0.490
uv-0.486
Token
Token ID250
Feature activation+7.65e-03

Pos logit contributions
 mathemat+0.250
ADRA+0.222
��+0.219
ingly+0.217
MAP+0.213
Neg logit contributions
idth-0.491
esters-0.432
ammy-0.406
ilyn-0.400
 Rated-0.397
Tokenthe
Token ID1169
Feature activation+1.02e-01

Pos logit contributions
 mathemat+0.022
��+0.020
ADRA+0.020
ingly+0.020
MAP+0.020
Neg logit contributions
idth-0.025
esters-0.021
ammy-0.020
ilyn-0.019
uv-0.019
Tokenwhen
Token ID12518
Feature activation-4.73e-02

Pos logit contributions
 mathemat+0.793
��+0.735
ADRA+0.735
ingly+0.716
MAP+0.695
Neg logit contributions
idth-0.956
esters-0.814
ammy-0.756
uv-0.741
ilyn-0.741
Token Sam
Token ID3409
Feature activation+2.32e-01

Pos logit contributions
idth+0.166
esters+0.143
ammy+0.132
ilyn+0.130
uv+0.129
Neg logit contributions
 mathemat-0.125
��-0.116
ADRA-0.116
ingly-0.113
MAP-0.111
Token Bac
Token ID35583
Feature activation+2.07e-01

Pos logit contributions
 mathemat+0.675
ADRA+0.629
��+0.629
ingly+0.620
MAP+0.607
Neg logit contributions
idth-0.752
esters-0.633
ammy-0.581
ilyn-0.573
uv-0.565
Tokenile
Token ID576
Feature activation+1.31e-01

Pos logit contributions
 mathemat+0.764
ADRA+0.725
��+0.715
MAP+0.699
WARN+0.694
Neg logit contributions
idth-0.519
esters-0.411
ammy-0.368
ilyn-0.366
uv-0.365
Token would
Token ID561
Feature activation-2.46e-01

Pos logit contributions
 mathemat+0.360
ADRA+0.338
��+0.337
ingly+0.328
MAP+0.323
Neg logit contributions
idth-0.447
esters-0.382
ammy-0.352
ilyn-0.346
 Rated-0.344
Token be
Token ID307
Feature activation-3.07e-01

Pos logit contributions
idth+0.836
esters+0.713
ammy+0.657
 Rated+0.645
uv+0.644
Neg logit contributions
 mathemat-0.682
��-0.643
ADRA-0.641
WARN-0.614
MAP-0.614
Token arrested
Token ID5169
Feature activation-4.63e-01

Pos logit contributions
idth+0.964
esters+0.838
ammy+0.770
 Rated+0.764
uv+0.758
Neg logit contributions
 mathemat-0.901
��-0.854
ADRA-0.846
WARN-0.816
ingly-0.815
Token"
Token ID1
Feature activation+2.65e-01

Pos logit contributions
idth+1.354
esters+1.133
ammy+1.023
ilyn+1.009
 Rated+1.006
Neg logit contributions
 mathemat-1.495
ADRA-1.408
��-1.406
WARN-1.363
ingly-1.358
Token drew
Token ID9859
Feature activation+1.44e-01

Pos logit contributions
 mathemat+0.785
ADRA+0.739
��+0.736
ingly+0.716
MAP+0.709
Neg logit contributions
idth-0.842
esters-0.712
ammy-0.653
ilyn-0.643
uv-0.636
Token wide
Token ID3094
Feature activation+1.57e-01

Pos logit contributions
 mathemat+0.430
ADRA+0.402
��+0.401
ingly+0.392
MAP+0.389
Neg logit contributions
idth-0.459
esters-0.386
ammy-0.355
ilyn-0.350
uv-0.346
Token attention
Token ID3241
Feature activation+3.73e-01

Pos logit contributions
 mathemat+0.439
ADRA+0.410
��+0.409
ingly+0.399
MAP+0.395
Neg logit contributions
idth-0.527
esters-0.447
ammy-0.410
ilyn-0.408
uv-0.404
Token right
Token ID826
Feature activation-1.39e-01

Pos logit contributions
 mathemat+1.983
ADRA+1.836
��+1.833
ingly+1.822
MAP+1.775
Neg logit contributions
idth-2.354
esters-1.992
ammy-1.830
ilyn-1.811
uv-1.778
Token now
Token ID783
Feature activation+7.35e-02

Pos logit contributions
idth+0.473
esters+0.406
ammy+0.372
ilyn+0.369
uv+0.367
Neg logit contributions
 mathemat-0.386
��-0.361
ADRA-0.360
MAP-0.344
ingly-0.343
Token,
Token ID11
Feature activation+2.85e-01

Pos logit contributions
 mathemat+0.224
��+0.211
ADRA+0.209
ingly+0.205
MAP+0.203
Neg logit contributions
idth-0.229
esters-0.192
ammy-0.175
ilyn-0.173
uv-0.171
Token you
Token ID345
Feature activation+5.79e-04

Pos logit contributions
 mathemat+0.821
��+0.769
ADRA+0.765
ingly+0.745
MAP+0.736
Neg logit contributions
idth-0.939
esters-0.797
ammy-0.731
ilyn-0.721
uv-0.713
Token really
Token ID1107
Feature activation+7.05e-01

Pos logit contributions
 mathemat+0.002
��+0.002
ADRA+0.002
MAP+0.002
WARN+0.001
Neg logit contributions
idth-0.002
esters-0.002
ammy-0.001
ilyn-0.001
 Rated-0.001
Token feel
Token ID1254
Feature activation-3.78e-01

Pos logit contributions
 mathemat+1.737
��+1.602
ADRA+1.602
ingly+1.562
MAP+1.538
Neg logit contributions
idth-2.606
esters-2.248
ammy-2.088
ilyn-2.063
uv-2.046
Token as
Token ID355
Feature activation+3.02e-01

Pos logit contributions
idth+1.102
esters+0.922
uv+0.840
ilyn+0.830
ammy+0.829
Neg logit contributions
 mathemat-1.190
��-1.142
ADRA-1.135
MAP-1.095
WARN-1.083
Token if
Token ID611
Feature activation+5.25e-01

Pos logit contributions
 mathemat+0.859
��+0.803
ADRA+0.800
ingly+0.776
MAP+0.773
Neg logit contributions
idth-0.999
esters-0.845
ammy-0.779
uv-0.770
ilyn-0.769
Token you
Token ID345
Feature activation+1.17e-01

Pos logit contributions
 mathemat+1.542
��+1.432
ADRA+1.432
ingly+1.422
MAP+1.388
Neg logit contributions
idth-1.694
esters-1.425
ammy-1.305
ilyn-1.289
uv-1.269
Token
Token ID447
Feature activation+1.72e-01

Pos logit contributions
 mathemat+0.364
ADRA+0.345
��+0.344
ingly+0.334
MAP+0.333
Neg logit contributions
idth-0.354
esters-0.297
ammy-0.270
ilyn-0.265
uv-0.264
Token
Token ID247
Feature activation-2.27e-02

Pos logit contributions
 mathemat+0.398
ADRA+0.349
��+0.336
ingly+0.336
MAP+0.333
Neg logit contributions
idth-0.674
esters-0.591
ammy-0.556
ples-0.545
uv-0.544
Token Stadium
Token ID10499
Feature activation-2.87e-01

Pos logit contributions
 mathemat+1.535
ADRA+1.432
��+1.430
ingly+1.410
MAP+1.384
Neg logit contributions
idth-1.654
esters-1.393
ammy-1.277
ilyn-1.259
uv-1.245
Token,
Token ID11
Feature activation+5.85e-02

Pos logit contributions
idth+0.848
esters+0.717
ammy+0.655
uv+0.645
 Rated+0.645
Neg logit contributions
 mathemat-0.901
ADRA-0.849
��-0.842
ingly-0.820
MAP-0.812
Token and
Token ID290
Feature activation-8.99e-03

Pos logit contributions
 mathemat+0.172
��+0.162
ADRA+0.162
ingly+0.158
WARN+0.156
Neg logit contributions
idth-0.186
esters-0.158
ammy-0.145
 Rated-0.143
ilyn-0.142
Token forced
Token ID4137
Feature activation+2.85e-01

Pos logit contributions
idth+0.028
esters+0.023
ammy+0.021
 Rated+0.021
ilyn+0.021
Neg logit contributions
 mathemat-0.028
ADRA-0.026
��-0.026
ingly-0.025
MAP-0.025
Token to
Token ID284
Feature activation-2.45e-01

Pos logit contributions
 mathemat+0.923
ingly+0.870
ADRA+0.863
��+0.862
MAP+0.836
Neg logit contributions
idth-0.838
esters-0.684
ammy-0.628
ilyn-0.619
uv-0.599
Token watch
Token ID2342
Feature activation-2.53e-01

Pos logit contributions
idth+0.795
esters+0.673
ammy+0.620
uv+0.612
ples+0.608
Neg logit contributions
 mathemat-0.708
��-0.667
ADRA-0.664
MAP-0.640
ingly-0.636
Token eight
Token ID3624
Feature activation-3.07e-02

Pos logit contributions
idth+0.761
esters+0.634
ammy+0.576
 Rated+0.575
uv+0.570
Neg logit contributions
 mathemat-0.784
��-0.744
ADRA-0.742
WARN-0.716
MAP-0.715
Token people
Token ID661
Feature activation-4.86e-01

Pos logit contributions
idth+0.104
esters+0.089
ammy+0.082
ilyn+0.081
uv+0.081
Neg logit contributions
 mathemat-0.084
��-0.078
ADRA-0.078
ingly-0.075
MAP-0.075
Token,
Token ID11
Feature activation-1.72e-01

Pos logit contributions
idth+1.662
esters+1.453
ammy+1.337
uv+1.322
 Rated+1.316
Neg logit contributions
 mathemat-1.286
ADRA-1.203
��-1.198
MAP-1.153
WARN-1.144
Token who
Token ID508
Feature activation-2.10e-02

Pos logit contributions
idth+0.510
esters+0.430
ammy+0.394
 Rated+0.385
ilyn+0.383
Neg logit contributions
 mathemat-0.540
ADRA-0.517
��-0.511
WARN-0.494
MAP-0.492
Token were
Token ID547
Feature activation+1.14e-01

Pos logit contributions
idth+0.068
esters+0.059
ammy+0.054
ilyn+0.053
 Rated+0.053
Neg logit contributions
 mathemat-0.060
��-0.056
ADRA-0.056
MAP-0.054
ingly-0.054
Token
Token ID198
Feature activation-8.39e-02

Pos logit contributions
idth+0.606
esters+0.523
ammy+0.482
uv+0.477
ilyn+0.476
Neg logit contributions
 mathemat-0.459
ADRA-0.418
��-0.416
ingly-0.401
MAP-0.394
Token
Token ID198
Feature activation-7.40e-02

Pos logit contributions
idth+0.291
esters+0.251
ammy+0.231
uv+0.230
 Rated+0.229
Neg logit contributions
 mathemat-0.235
ADRA-0.212
��-0.210
ingly-0.199
 juven-0.197
Token1996
Token ID22288
Feature activation-1.44e-01

Pos logit contributions
idth+0.174
esters+0.139
ammy+0.119
uv+0.118
ilyn+0.118
Neg logit contributions
 mathemat-0.285
��-0.266
ADRA-0.266
ingly-0.262
MAP-0.257
Token Jerry
Token ID13075
Feature activation+3.64e-02

Pos logit contributions
idth+0.428
esters+0.355
ammy+0.324
ilyn+0.319
 Rated+0.315
Neg logit contributions
 mathemat-0.459
ADRA-0.429
��-0.428
ingly-0.419
MAP-0.412
Token Se
Token ID1001
Feature activation-1.28e-01

Pos logit contributions
 mathemat+0.104
��+0.097
ADRA+0.097
ingly+0.096
MAP+0.094
Neg logit contributions
idth-0.120
esters-0.101
ammy-0.093
ilyn-0.092
uv-0.091
Tokeninfeld
Token ID47187
Feature activation-1.51e+00

Pos logit contributions
idth+0.211
esters+0.147
ammy+0.116
ilyn+0.112
uv+0.110
Neg logit contributions
 mathemat-0.583
��-0.556
ADRA-0.555
ingly-0.548
MAP-0.543
Token
Token ID198
Feature activation-1.11e-01

Pos logit contributions
idth+5.147
esters+4.433
ammy+4.040
uv+4.012
ilyn+3.999
Neg logit contributions
 mathemat-4.124
ADRA-3.799
��-3.791
ingly-3.705
WARN-3.651
Token
Token ID198
Feature activation-8.31e-02

Pos logit contributions
idth+0.381
esters+0.329
ammy+0.302
uv+0.302
 Rated+0.300
Neg logit contributions
 mathemat-0.315
ADRA-0.284
��-0.280
ingly-0.264
 juven-0.264
Token1996
Token ID22288
Feature activation-1.65e-01

Pos logit contributions
idth+0.220
esters+0.172
ammy+0.159
ilyn+0.154
 Rated+0.148
Neg logit contributions
 mathemat-0.292
ADRA-0.279
��-0.279
ingly-0.277
MAP-0.274
Token Garry
Token ID45354
Feature activation-4.05e-01

Pos logit contributions
idth+0.567
esters+0.484
ammy+0.448
ilyn+0.442
 Rated+0.438
Neg logit contributions
 mathemat-0.452
ADRA-0.420
��-0.418
ingly-0.406
MAP-0.399
Token Sh
Token ID911
Feature activation-3.41e-02

Pos logit contributions
idth+1.054
esters+0.851
ammy+0.761
ilyn+0.748
uv+0.738
Neg logit contributions
 mathemat-1.449
ADRA-1.361
��-1.359
ingly-1.330
MAP-1.315
Token ACLU
Token ID24381
Feature activation+4.04e-02

Pos logit contributions
idth+0.814
esters+0.688
ammy+0.634
ilyn+0.623
 Rated+0.619
Neg logit contributions
 mathemat-0.722
��-0.674
ADRA-0.670
ingly-0.657
WARN-0.644
Token of
Token ID286
Feature activation+1.55e-01

Pos logit contributions
 mathemat+0.119
ADRA+0.109
��+0.109
ingly+0.105
MAP+0.103
Neg logit contributions
idth-0.133
esters-0.112
ammy-0.103
uv-0.102
ilyn-0.102
Token Oregon
Token ID8819
Feature activation+3.38e-01

Pos logit contributions
 mathemat+0.334
ADRA+0.301
��+0.301
ingly+0.293
MAP+0.286
Neg logit contributions
idth-0.624
esters-0.545
ammy-0.510
ilyn-0.505
uv-0.500
Token (@
Token ID4275
Feature activation-2.24e-01

Pos logit contributions
 mathemat+0.952
ADRA+0.880
��+0.879
ingly+0.864
MAP+0.846
Neg logit contributions
idth-1.138
esters-0.967
ammy-0.889
ilyn-0.878
uv-0.868
TokenAC
Token ID2246
Feature activation-6.61e-01

Pos logit contributions
idth+0.638
esters+0.535
ammy+0.482
uv+0.479
 Rated+0.474
Neg logit contributions
 mathemat-0.783
��-0.698
ADRA-0.689
 juven-0.675
ingly-0.662
TokenLU
Token ID41596
Feature activation-1.14e+00

Pos logit contributions
idth+2.929
esters+2.662
uv+2.523
ammy+2.494
 Rated+2.484
Neg logit contributions
 mathemat-1.206
��-0.945
ADRA-0.926
ingly-0.876
 juven-0.874
Token_
Token ID62
Feature activation-6.26e-01

Pos logit contributions
idth+2.307
esters+1.821
uv+1.610
ples+1.517
ammy+1.512
Neg logit contributions
 mathemat-5.053
��-4.477
ADRA-4.445
 juven-4.421
 Leopard-4.147
TokenOR
Token ID1581
Feature activation+1.10e-01

Pos logit contributions
idth+1.748
esters+1.470
uv+1.341
 Rated+1.310
ples+1.308
Neg logit contributions
 mathemat-2.210
��-1.926
ADRA-1.925
 juven-1.905
ingly-1.876
Token)
Token ID8
Feature activation-5.58e-01

Pos logit contributions
 mathemat+0.356
ADRA+0.327
��+0.327
ingly+0.317
MAP+0.313
Neg logit contributions
idth-0.329
esters-0.274
ammy-0.249
ilyn-0.245
uv-0.244
Token May
Token ID1737
Feature activation-2.28e-01

Pos logit contributions
idth+1.141
esters+0.877
 Rated+0.752
ammy+0.744
ilyn+0.739
Neg logit contributions
 mathemat-2.361
ADRA-2.202
��-2.182
ingly-2.112
MAP-2.092
Token 29
Token ID2808
Feature activation+2.36e-01

Pos logit contributions
idth+0.703
esters+0.587
ammy+0.535
ilyn+0.528
uv+0.521
Neg logit contributions
 mathemat-0.704
��-0.658
ADRA-0.658
ingly-0.649
MAP-0.637
Token.
Token ID13
Feature activation+1.87e-01

Pos logit contributions
 mathemat+0.278
ADRA+0.262
��+0.262
ingly+0.261
MAP+0.255
Neg logit contributions
idth-0.278
esters-0.231
ammy-0.213
ilyn-0.209
 Rated-0.206
Token (
Token ID357
Feature activation+2.50e-02

Pos logit contributions
 mathemat+0.544
ADRA+0.501
��+0.501
ingly+0.486
MAP+0.477
Neg logit contributions
idth-0.619
esters-0.523
ammy-0.479
ilyn-0.476
 Rated-0.473
Token 1981
Token ID14745
Feature activation-4.39e-01

Pos logit contributions
 mathemat+0.093
��+0.087
ADRA+0.087
ingly+0.085
MAP+0.084
Neg logit contributions
idth-0.062
esters-0.050
ammy-0.044
ilyn-0.043
uv-0.043
Token ).
Token ID6739
Feature activation-3.24e-01

Pos logit contributions
idth+1.528
esters+1.306
ammy+1.206
ilyn+1.190
uv+1.175
Neg logit contributions
 mathemat-1.177
��-1.089
ADRA-1.089
ingly-1.074
MAP-1.050
Token A
Token ID317
Feature activation-1.02e-01

Pos logit contributions
idth+1.017
esters+0.856
 Rated+0.781
ammy+0.778
ilyn+0.770
Neg logit contributions
 mathemat-0.988
��-0.918
ADRA-0.910
ingly-0.893
WARN-0.877
Token psych
Token ID3795
Feature activation-1.18e+00

Pos logit contributions
idth+0.342
esters+0.291
ammy+0.266
ilyn+0.265
uv+0.261
Neg logit contributions
 mathemat-0.285
��-0.266
ADRA-0.265
ingly-0.260
WARN-0.255
Tokenometric
Token ID16996
Feature activation+8.49e-02

Pos logit contributions
idth+3.378
esters+2.791
ilyn+2.503
ammy+2.491
uv+2.425
Neg logit contributions
 mathemat-3.841
��-3.683
ADRA-3.647
ingly-3.515
MAP-3.480
Token investigation
Token ID3645
Feature activation-1.79e-01

Pos logit contributions
 mathemat+0.245
��+0.230
ADRA+0.228
ingly+0.222
MAP+0.220
Neg logit contributions
idth-0.277
esters-0.234
ammy-0.213
ilyn-0.213
 Rated-0.210
Token of
Token ID286
Feature activation+2.47e-01

Pos logit contributions
idth+0.601
esters+0.507
ammy+0.465
ilyn+0.461
uv+0.456
Neg logit contributions
 mathemat-0.505
ADRA-0.472
��-0.471
ingly-0.460
MAP-0.452
Token the
Token ID262
Feature activation+1.02e-01

Pos logit contributions
 mathemat+0.713
��+0.666
ADRA+0.666
ingly+0.653
MAP+0.642
Neg logit contributions
idth-0.808
esters-0.683
ammy-0.625
ilyn-0.618
uv-0.610
Token standard
Token ID3210
Feature activation-2.17e-01

Pos logit contributions
 mathemat+0.294
��+0.277
ADRA+0.277
ingly+0.270
WARN+0.266
Neg logit contributions
idth-0.333
esters-0.280
ammy-0.258
ilyn-0.254
 Rated-0.252
Tokent
Token ID83
Feature activation-6.28e-01

Pos logit contributions
idth+0.973
esters+0.810
ammy+0.737
uv+0.735
ilyn+0.727
Neg logit contributions
 mathemat-1.042
ADRA-0.958
��-0.940
 juven-0.907
ingly-0.901
Token.
Token ID13
Feature activation-7.66e-01

Pos logit contributions
idth+1.821
esters+1.508
ammy+1.376
uv+1.362
ples+1.349
Neg logit contributions
 mathemat-2.139
ADRA-1.967
��-1.950
 juven-1.863
ingly-1.834
Tokenco
Token ID1073
Feature activation-5.52e-01

Pos logit contributions
idth+2.021
esters+1.638
uv+1.480
ammy+1.470
ilyn+1.467
Neg logit contributions
 mathemat-2.820
ADRA-2.617
��-2.551
 juven-2.466
ingly-2.441
Token/
Token ID14
Feature activation-7.79e-01

Pos logit contributions
idth+1.321
esters+1.046
uv+0.916
ammy+0.911
ilyn+0.903
Neg logit contributions
 mathemat-2.155
ADRA-1.994
��-1.982
 juven-1.911
ingly-1.908
TokenX
Token ID55
Feature activation-1.10e+00

Pos logit contributions
idth+2.428
esters+2.029
uv+1.913
ammy+1.878
ilyn+1.849
Neg logit contributions
 mathemat-2.405
��-2.211
ADRA-2.156
ingly-2.135
 Leopard-2.091
Tokenh
Token ID71
Feature activation-1.14e+00

Pos logit contributions
idth+3.393
esters+2.852
uv+2.717
ammy+2.644
ilyn+2.600
Neg logit contributions
 mathemat-3.382
��-3.108
ADRA-3.006
ingly-2.960
 Leopard-2.924
Token7
Token ID22
Feature activation-9.26e-01

Pos logit contributions
idth+3.531
esters+2.935
uv+2.797
ammy+2.732
ples+2.685
Neg logit contributions
 mathemat-3.492
��-3.228
ADRA-3.108
ingly-3.091
 Leopard-3.025
TokenJ
Token ID41
Feature activation-1.05e+00

Pos logit contributions
idth+2.962
esters+2.484
uv+2.365
ammy+2.313
ilyn+2.274
Neg logit contributions
 mathemat-2.777
��-2.549
ADRA-2.469
ingly-2.439
 Leopard-2.389
Tokenm
Token ID76
Feature activation-1.08e+00

Pos logit contributions
idth+3.231
esters+2.694
uv+2.552
ammy+2.501
ilyn+2.456
Neg logit contributions
 mathemat-3.287
��-3.038
ADRA-2.937
ingly-2.887
 Leopard-2.852
TokenB
Token ID33
Feature activation-1.05e+00

Pos logit contributions
idth+3.337
esters+2.761
uv+2.642
ammy+2.580
ilyn+2.536
Neg logit contributions
 mathemat-3.336
��-3.069
ADRA-2.948
ingly-2.942
 Leopard-2.883
Tokenz
Token ID89
Feature activation-1.03e+00

Pos logit contributions
idth+3.148
esters+2.592
uv+2.457
ammy+2.390
ilyn+2.358
Neg logit contributions
 mathemat-3.381
��-3.138
ADRA-3.035
ingly-2.974
 juven-2.948
Tokenwww
Token ID2503
Feature activation+4.70e-02

Pos logit contributions
idth+2.061
esters+1.725
ammy+1.538
uv+1.537
ples+1.515
Neg logit contributions
 mathemat-2.522
ADRA-2.354
��-2.295
 juven-2.206
MAP-2.200
Token.
Token ID13
Feature activation-4.45e-01

Pos logit contributions
 mathemat+0.143
ADRA+0.133
��+0.133
ingly+0.131
MAP+0.128
Neg logit contributions
idth-0.147
esters-0.124
ammy-0.113
ilyn-0.111
uv-0.110
Tokenbel
Token ID6667
Feature activation-8.18e-01

Pos logit contributions
idth+1.423
esters+1.198
ammy+1.084
uv+1.070
ilyn+1.060
Neg logit contributions
 mathemat-1.373
ADRA-1.283
��-1.261
MAP-1.222
 Leopard-1.207
Tokenfast
Token ID7217
Feature activation-8.94e-01

Pos logit contributions
idth+2.668
esters+2.263
uv+2.069
ples+2.064
ammy+2.041
Neg logit contributions
 mathemat-2.555
ADRA-2.301
��-2.265
 juven-2.199
MAP-2.153
Tokente
Token ID660
Feature activation-5.97e-01

Pos logit contributions
idth+2.834
esters+2.411
ammy+2.198
uv+2.181
ples+2.165
Neg logit contributions
 mathemat-2.792
ADRA-2.623
��-2.552
 juven-2.416
MAP-2.387
Tokenlegraph
Token ID16606
Feature activation-1.36e+00

Pos logit contributions
idth+1.589
esters+1.352
uv+1.185
ammy+1.185
ples+1.150
Neg logit contributions
 mathemat-2.191
ADRA-2.023
��-1.990
 juven-1.940
MAP-1.908
Token.
Token ID13
Feature activation-6.51e-01

Pos logit contributions
idth+3.413
esters+2.860
uv+2.592
 Rated+2.523
ples+2.519
Neg logit contributions
 mathemat-5.333
ADRA-4.746
��-4.691
 juven-4.610
MAP-4.220
Tokenco
Token ID1073
Feature activation-4.69e-01

Pos logit contributions
idth+1.641
esters+1.362
uv+1.211
ammy+1.206
ples+1.171
Neg logit contributions
 mathemat-2.506
ADRA-2.298
��-2.245
 juven-2.198
MAP-2.132
Token.
Token ID13
Feature activation-7.80e-01

Pos logit contributions
idth+1.335
esters+1.122
uv+1.014
ammy+1.005
 Rated+0.992
Neg logit contributions
 mathemat-1.634
ADRA-1.489
��-1.476
 juven-1.413
MAP-1.376
Tokenuk
Token ID2724
Feature activation-8.06e-01

Pos logit contributions
idth+2.167
esters+1.877
uv+1.756
ammy+1.702
ples+1.634
Neg logit contributions
 mathemat-2.802
ADRA-2.500
��-2.446
 juven-2.437
MAP-2.279
Token/
Token ID14
Feature activation-1.22e+00

Pos logit contributions
idth+1.779
esters+1.414
 Rated+1.230
uv+1.226
ammy+1.210
Neg logit contributions
 mathemat-3.343
ADRA-3.051
��-3.032
 juven-2.953
ingly-2.861