TOP ACTIVATIONS
MAX = 1.310

x . x . 164 103 15 24 x . x
x . x . 151 103 30 29 x . x
x . x . 171 102 30 86 x . x
x . x . 95 103 16 53 x . x
in form , in moving , how express an admirable
Kn ob which appears rec essed into a hill
in moving , how express an admirable ! in action how
est ical roof fre tted with golden fire ,
x . x . 135 106 19 41 x . x
by bi otechnology critic Jeremy R if kin , protest the
favorite favorite favorite favorite favorite - December 5 , 2009
x . x . 91 106 99 63 x . x
8 341 c 4 fe 353 ef 015 390 b 0
brute grew hard panels of imp ossibly thick bone all over
x . x . 173 107 10 52 x . x
x . x . 80 106 27 66 x . x
+ Natural leader , Bon v ive ur , Char
the National Council of La R aza and the American Friends
x . x . 123 104 66 29 x . x
keeps adding fuel to the bon fire . His comment to

MIN ACTIVATIONS
MIN = -1.174

unused work from between 1985 and 1990 . Check out more
three meaningful actions you can take each week .
x . x . x . 176 98 37 00 x
at http :// www - n rd . n h ts
more movie news , follow EW s Anthony Bre
were other top targets of consumer wrath . In
C FP B launched a consumer help center where Americans can
lling . OB B Par ce lling
( which , it should be noted , came two months
Nobody in their right mind would say that
planet , no matter what our backgrounds may be ,
invoke it locally with sl s invoke local - f test
20 21 22 23 24 25 26 27 # loop over
. Dem o Application Architecture All the
of contributors are their own and are not the views of
is just that and must be taken with a grain of
receive occasional updates and special offers for The New York Times
ll send you three meaningful actions you can take each week
policy has now been well te leg rap hed and will
, B aid u and Y and ex . When you

INTERVAL 0.689 - 1.310
CONTAINS 0.193%

. x . 104 10 446 01 x . x .
high - level of intellectual rig or . But they surrounded
said the size of the bonus pool each year has historically
. should be clone able , and those clones
it time for us to go ? Ell en

INTERVAL 0.068 - 0.689
CONTAINS 36.446%

He beats opponents physically and gets under their skin between plays
[ since the trade ], going back there , but I
against Lenin and Trotsky , for worldwide anti -
, which is his welcoming song to the kids
, the Store ! There s a

INTERVAL -0.553 - 0.068
CONTAINS 62.596%

does not solely pay attention to polls and fundraising , but
back in June . Talk To employs a large number of
use themselves , giving rise to an appeal to the U
be ). Disclaimer : These tips are based on
curl , a comeback , a nine , or a sl

INTERVAL -1.174 - -0.553
CONTAINS 0.765%

[ 10 : 48 ] < min ion 13 37 >
, Louis H . Feldman , James L . K ug
. Stephan opol ous will be in Boston on Tuesday morning
is even scar ier than her appearance now .
Black President - White Vs Black - Jer kin Off On
Tokenx
Token ID87
Feature activation+1.17e-01

Pos logit contributions
theless+4.229
ammu+3.261
emo+3.051
ellen+2.998
ayer+2.989
Neg logit contributions
ourke-4.996
versions-4.559
acters-4.245
onda-4.177
ggle-4.107
Token.
Token ID13
Feature activation-2.46e-01

Pos logit contributions
ourke+1.169
versions+1.071
acters+0.936
eca+0.914
onda+0.912
Neg logit contributions
theless-1.768
ーテ-1.360
ammu-1.345
 Lauder-1.290
 Gors-1.286
Tokenx
Token ID87
Feature activation+1.22e-01

Pos logit contributions
theless+2.901
ammu+2.217
emo+2.064
ayer+2.047
ellen+2.041
Neg logit contributions
ourke-3.055
versions-2.798
acters-2.573
onda-2.522
ggle-2.478
Token.
Token ID13
Feature activation-2.62e-01

Pos logit contributions
ourke+1.228
versions+1.130
acters+0.973
eca+0.966
onda+0.961
Neg logit contributions
theless-1.836
ーテ-1.406
ammu-1.394
 Lauder-1.344
 Gors-1.332
Token164
Token ID23237
Feature activation+6.63e-02

Pos logit contributions
theless+2.978
ammu+2.214
emo+2.042
ーテ+2.016
ayer+2.014
Neg logit contributions
ourke-3.429
versions-3.114
acters-2.941
onda-2.850
ggle-2.791
Token 103
Token ID15349
Feature activation+1.31e+00

Pos logit contributions
ourke+0.758
versions+0.700
acters+0.625
onda+0.616
eca+0.610
Neg logit contributions
theless-0.876
ammu-0.667
ーテ-0.645
 Lauder-0.626
ayer-0.626
Token15
Token ID1314
Feature activation+1.47e-01

Pos logit contributions
ourke+7.241
versions+6.757
eca+6.291
ington+5.917
acters+5.892
Neg logit contributions
theless-19.382
etheless-16.763
 eleph-16.692
 subur-16.362
 pione-15.780
Token24
Token ID1731
Feature activation-5.55e-01

Pos logit contributions
ourke+1.597
versions+1.467
acters+1.299
eca+1.286
onda+1.271
Neg logit contributions
theless-2.037
ammu-1.545
ーテ-1.536
 Lauder-1.466
ayer-1.460
Token x
Token ID2124
Feature activation+1.11e-01

Pos logit contributions
theless+3.545
ammu+2.109
 Lauder+1.937
 Gors+1.866
ellen+1.848
Neg logit contributions
ourke-9.426
versions-8.912
acters-8.344
onda-8.272
ggle-8.182
Token.
Token ID13
Feature activation-4.20e-01

Pos logit contributions
ourke+1.103
versions+1.010
acters+0.882
eca+0.862
onda+0.858
Neg logit contributions
theless-1.678
ーテ-1.286
ammu-1.278
 Lauder-1.224
 Gors-1.224
Tokenx
Token ID87
Feature activation+1.31e-01

Pos logit contributions
theless+4.451
ammu+3.443
emo+3.223
ellen+3.167
ayer+3.145
Neg logit contributions
ourke-5.436
versions-4.966
acters-4.628
onda-4.555
 defe-4.484
Tokenx
Token ID87
Feature activation+1.64e-01

Pos logit contributions
theless+3.894
ammu+2.994
emo+2.800
ellen+2.764
ayer+2.753
Neg logit contributions
ourke-4.530
versions-4.144
acters-3.840
onda-3.780
ggle-3.712
Token.
Token ID13
Feature activation-1.89e-01

Pos logit contributions
ourke+1.602
versions+1.470
acters+1.277
eca+1.253
onda+1.243
Neg logit contributions
theless-2.519
ーテ-1.954
ammu-1.912
 Lauder-1.840
 Gors-1.836
Tokenx
Token ID87
Feature activation-4.17e-03

Pos logit contributions
theless+2.293
ammu+1.738
emo+1.621
ーテ+1.613
ayer+1.613
Neg logit contributions
ourke-2.320
versions-2.130
acters-1.942
onda-1.909
eca-1.877
Token.
Token ID13
Feature activation-4.28e-01

Pos logit contributions
theless+0.059
ammu+0.046
ーテ+0.045
 Lauder+0.043
 Gors+0.043
Neg logit contributions
ourke-0.044
versions-0.041
acters-0.036
onda-0.035
eca-0.035
Token151
Token ID24309
Feature activation+3.39e-02

Pos logit contributions
theless+4.724
ammu+3.643
ellen+3.263
ayer+3.166
mere+3.118
Neg logit contributions
ourke-5.501
versions-4.916
acters-4.800
ggle-4.592
yg-4.563
Token 103
Token ID15349
Feature activation+1.28e+00

Pos logit contributions
ourke+0.398
versions+0.367
acters+0.330
onda+0.325
eca+0.321
Neg logit contributions
theless-0.438
ammu-0.331
ーテ-0.319
 Lauder-0.310
ayer-0.310
Token30
Token ID1270
Feature activation+1.15e-01

Pos logit contributions
ourke+6.849
versions+6.478
eca+5.895
(-+5.659
ington+5.601
Neg logit contributions
theless-19.246
 eleph-17.073
etheless-16.945
 subur-16.774
 pione-16.191
Token29
Token ID1959
Feature activation-5.25e-01

Pos logit contributions
ourke+1.309
versions+1.205
acters+1.076
eca+1.060
onda+1.054
Neg logit contributions
theless-1.540
ammu-1.162
ーテ-1.138
 Lauder-1.096
ayer-1.092
Token x
Token ID2124
Feature activation+5.97e-02

Pos logit contributions
theless+3.559
ammu+2.111
 Lauder+1.927
 Gors+1.838
emo+1.833
Neg logit contributions
ourke-8.878
versions-8.383
acters-7.823
onda-7.751
ggle-7.666
Token.
Token ID13
Feature activation-3.36e-01

Pos logit contributions
ourke+0.610
versions+0.559
acters+0.490
onda+0.478
eca+0.478
Neg logit contributions
theless-0.883
ammu-0.675
ーテ-0.672
 Lauder-0.645
 Gors-0.643
Tokenx
Token ID87
Feature activation+4.64e-02

Pos logit contributions
theless+3.785
ammu+2.888
emo+2.695
ellen+2.663
ayer+2.658
Neg logit contributions
ourke-4.269
versions-3.898
acters-3.612
onda-3.556
ggle-3.499
Tokenx
Token ID87
Feature activation+1.31e-01

Pos logit contributions
theless+4.140
ammu+3.192
emo+2.981
ellen+2.937
ayer+2.928
Neg logit contributions
ourke-4.957
versions-4.538
acters-4.211
onda-4.147
ggle-4.075
Token.
Token ID13
Feature activation-2.79e-01

Pos logit contributions
ourke+1.300
versions+1.192
acters+1.039
eca+1.018
onda+1.013
Neg logit contributions
theless-1.986
ーテ-1.533
ammu-1.507
 Lauder-1.448
 Gors-1.445
Tokenx
Token ID87
Feature activation+9.69e-03

Pos logit contributions
theless+3.203
ammu+2.452
emo+2.288
ayer+2.261
ellen+2.257
Neg logit contributions
ourke-3.512
versions-3.220
acters-2.964
onda-2.908
ggle-2.859
Token.
Token ID13
Feature activation-8.82e-01

Pos logit contributions
ourke+0.102
versions+0.093
acters+0.082
onda+0.081
eca+0.080
Neg logit contributions
theless-0.139
ammu-0.107
ーテ-0.105
 Lauder-0.102
 Gors-0.101
Token171
Token ID27192
Feature activation+5.97e-02

Pos logit contributions
theless+6.862
ammu+5.573
wolves+4.699
ayer+4.509
ivably+4.408
Neg logit contributions
ourke-12.964
acters-11.443
versions-11.086
onde-10.895
eca-10.859
Token 102
Token ID15143
Feature activation+1.26e+00

Pos logit contributions
ourke+0.680
versions+0.628
acters+0.560
onda+0.551
eca+0.546
Neg logit contributions
theless-0.794
ammu-0.605
ーテ-0.585
 Lauder-0.569
ayer-0.568
Token30
Token ID1270
Feature activation+1.54e-01

Pos logit contributions
ourke+6.898
versions+6.597
eca+6.138
ington+5.722
(-+5.640
Neg logit contributions
theless-18.782
etheless-16.074
 eleph-15.809
 subur-15.649
ーテ-15.268
Token86
Token ID4521
Feature activation-3.29e-01

Pos logit contributions
ourke+1.689
versions+1.556
acters+1.371
eca+1.362
onda+1.345
Neg logit contributions
theless-2.136
ーテ-1.615
ammu-1.613
 Lauder-1.533
ayer-1.528
Token x
Token ID2124
Feature activation+3.93e-02

Pos logit contributions
theless+2.532
ammu+1.579
 Lauder+1.435
emo+1.409
ーテ+1.400
Neg logit contributions
ourke-5.441
versions-5.113
acters-4.761
onda-4.711
ggle-4.673
Token.
Token ID13
Feature activation-4.27e-01

Pos logit contributions
ourke+0.405
versions+0.371
acters+0.326
onda+0.318
eca+0.318
Neg logit contributions
theless-0.576
ammu-0.440
ーテ-0.437
 Lauder-0.419
 Gors-0.418
Tokenx
Token ID87
Feature activation+6.82e-02

Pos logit contributions
theless+4.494
ammu+3.473
emo+3.244
ellen+3.191
ayer+3.167
Neg logit contributions
ourke-5.542
versions-5.059
acters-4.726
onda-4.652
 defe-4.590
Tokenx
Token ID87
Feature activation+1.49e-01

Pos logit contributions
theless+4.144
ammu+3.185
emo+2.974
ellen+2.932
ayer+2.923
Neg logit contributions
ourke-4.919
versions-4.485
acters-4.188
onda-4.103
ggle-4.025
Token.
Token ID13
Feature activation-2.03e-01

Pos logit contributions
ourke+1.468
versions+1.346
acters+1.169
eca+1.150
onda+1.142
Neg logit contributions
theless-2.280
ーテ-1.760
ammu-1.732
 Lauder-1.663
 Gors-1.657
Tokenx
Token ID87
Feature activation+2.08e-01

Pos logit contributions
theless+2.452
ammu+1.864
emo+1.741
ayer+1.729
ーテ+1.724
Neg logit contributions
ourke-2.509
versions-2.302
acters-2.106
onda-2.063
eca-2.028
Token.
Token ID13
Feature activation-2.37e-01

Pos logit contributions
ourke+1.974
versions+1.817
acters+1.567
eca+1.552
onda+1.535
Neg logit contributions
theless-3.249
ーテ-2.527
ammu-2.470
 Lauder-2.374
 Gors-2.370
Token95
Token ID3865
Feature activation-6.63e-02

Pos logit contributions
theless+2.666
ammu+1.928
emo+1.838
ellen+1.810
ーテ+1.777
Neg logit contributions
ourke-3.182
versions-2.898
acters-2.679
onda-2.596
yg-2.582
Token 103
Token ID15349
Feature activation+1.22e+00

Pos logit contributions
theless+0.818
ammu+0.606
ーテ+0.579
 Lauder+0.570
 Gors+0.564
Neg logit contributions
ourke-0.822
versions-0.757
acters-0.684
onda-0.674
eca-0.668
Token16
Token ID1433
Feature activation+2.82e-01

Pos logit contributions
ourke+7.246
versions+6.857
eca+6.099
acters+5.893
ington+5.891
Neg logit contributions
theless-17.937
etheless-15.420
 eleph-15.389
 subur-15.237
 pione-14.730
Token53
Token ID4310
Feature activation-3.72e-01

Pos logit contributions
ourke+3.010
versions+2.768
acters+2.433
eca+2.429
onda+2.380
Neg logit contributions
theless-3.920
ーテ-3.002
ammu-2.981
 Lauder-2.825
 Gors-2.817
Token x
Token ID2124
Feature activation+1.24e-01

Pos logit contributions
theless+2.890
ammu+1.837
 Lauder+1.683
emo+1.617
ーテ+1.605
Neg logit contributions
ourke-6.101
versions-5.728
acters-5.323
onda-5.273
ggle-5.220
Token.
Token ID13
Feature activation-2.73e-01

Pos logit contributions
ourke+1.226
versions+1.121
acters+0.979
eca+0.958
onda+0.951
Neg logit contributions
theless-1.882
ーテ-1.443
ammu-1.435
 Lauder-1.373
 Gors-1.373
Tokenx
Token ID87
Feature activation+1.45e-01

Pos logit contributions
theless+3.174
ammu+2.419
emo+2.258
ayer+2.233
ellen+2.227
Neg logit contributions
ourke-3.425
versions-3.132
acters-2.883
onda-2.833
ggle-2.779
Token
Token ID198
Feature activation+5.30e-01

Pos logit contributions
ourke+3.345
versions+3.136
ington+2.650
acters+2.611
eca+2.593
Neg logit contributions
theless-5.761
ーテ-4.615
etheless-4.416
ammu-4.353
 Gors-4.198
Tokenin
Token ID259
Feature activation+8.33e-01

Pos logit contributions
ourke+5.670
versions+5.191
eca+4.517
ggle+4.455
ington+4.452
Neg logit contributions
theless-7.128
ーテ-5.491
ammu-5.267
 Gors-5.179
 Lauder-5.157
Token form
Token ID1296
Feature activation+4.06e-01

Pos logit contributions
ourke+8.362
versions+7.843
acters+6.788
ggle+6.592
yg+6.550
Neg logit contributions
theless-10.472
ammu-8.087
ーテ-8.031
 Gors-7.819
ellen-7.780
Token,
Token ID11
Feature activation+4.00e-01

Pos logit contributions
ourke+3.936
versions+3.594
eca+3.101
acters+3.087
ggle+3.068
Neg logit contributions
theless-5.773
ammu-4.568
ーテ-4.564
emo-4.373
 Gors-4.372
Token in
Token ID287
Feature activation+6.49e-01

Pos logit contributions
ourke+4.130
versions+3.777
acters+3.298
ggle+3.249
yg+3.201
Neg logit contributions
theless-5.525
ammu-4.300
ーテ-4.282
emo-4.117
ayer-4.050
Token moving
Token ID3867
Feature activation+1.22e+00

Pos logit contributions
ourke+6.323
versions+5.982
acters+5.167
ggle+5.051
yg+4.985
Neg logit contributions
theless-8.785
ammu-6.902
ーテ-6.894
emo-6.566
Berry-6.516
Token,
Token ID11
Feature activation+7.63e-01

Pos logit contributions
ourke+9.900
versions+9.835
ggle+8.539
 marrow+8.485
 pane+8.403
Neg logit contributions
theless-14.759
ーテ-12.330
ammu-11.720
emo-11.532
ellen-11.466
Token how
Token ID703
Feature activation+8.96e-01

Pos logit contributions
ourke+6.986
versions+6.364
ggle+5.462
acters+5.353
eca+5.318
Neg logit contributions
theless-10.461
ーテ-8.348
ammu-8.277
emo-8.017
Berry-7.993
Token express
Token ID4911
Feature activation+6.36e-01

Pos logit contributions
ourke+8.096
versions+7.870
ggle+6.922
 pane+6.754
ington+6.601
Neg logit contributions
theless-11.140
ーテ-9.226
ammu-9.042
ellen-8.508
ayer-8.496
Token an
Token ID281
Feature activation+1.13e+00

Pos logit contributions
ourke+6.109
versions+5.648
eca+4.832
ggle+4.828
acters+4.747
Neg logit contributions
theless-8.683
ーテ-6.951
ammu-6.879
emo-6.657
ellen-6.602
Token admirable
Token ID37959
Feature activation+8.43e-01

Pos logit contributions
ourke+10.406
versions+9.507
acters+8.326
eca+8.319
onda+8.315
Neg logit contributions
theless-14.052
ーテ-10.999
Berry-10.728
 Gors-10.704
 Lauder-10.620
Token Kn
Token ID6102
Feature activation+8.43e-02

Pos logit contributions
ourke+2.108
versions+1.929
onda+1.714
eca+1.705
acters+1.705
Neg logit contributions
theless-2.500
ammu-1.884
ーテ-1.826
 Lauder-1.777
ayer-1.771
Tokenob
Token ID672
Feature activation+8.14e-02

Pos logit contributions
ourke+0.728
versions+0.646
onda+0.552
acters+0.548
eca+0.532
Neg logit contributions
theless-1.378
ーテ-1.093
ammu-1.081
 Lauder-1.055
 Gors-1.043
Token
Token ID851
Feature activation+4.17e-02

Pos logit contributions
ourke+0.885
versions+0.808
acters+0.716
onda+0.708
eca+0.701
Neg logit contributions
theless-1.129
ammu-0.867
ーテ-0.839
 Lauder-0.820
ayer-0.816
Token which
Token ID543
Feature activation-4.33e-02

Pos logit contributions
ourke+0.471
versions+0.433
acters+0.385
onda+0.380
eca+0.376
Neg logit contributions
theless-0.558
ammu-0.426
ーテ-0.410
 Lauder-0.401
emo-0.400
Token appears
Token ID3568
Feature activation-8.81e-03

Pos logit contributions
theless+0.575
ammu+0.437
ーテ+0.420
 Lauder+0.413
ayer+0.410
Neg logit contributions
ourke-0.497
versions-0.455
acters-0.407
onda-0.400
eca-0.396
Token rec
Token ID664
Feature activation+1.18e+00

Pos logit contributions
theless+0.118
ammu+0.090
ーテ+0.087
 Lauder+0.085
ayer+0.085
Neg logit contributions
ourke-0.100
versions-0.091
acters-0.081
onda-0.080
eca-0.079
Tokenessed
Token ID6676
Feature activation+8.10e-02

Pos logit contributions
ourke+9.421
versions+8.736
onda+8.290
onde+7.932
yg+7.668
Neg logit contributions
theless-16.472
ーテ-13.481
ammu-12.768
Berry-12.709
 Gors-12.372
Token into
Token ID656
Feature activation+3.14e-01

Pos logit contributions
ourke+0.877
versions+0.805
acters+0.712
onda+0.700
eca+0.694
Neg logit contributions
theless-1.124
ammu-0.865
ーテ-0.835
 Lauder-0.818
ayer-0.812
Token a
Token ID257
Feature activation+3.10e-01

Pos logit contributions
ourke+3.187
versions+2.998
onda+2.546
acters+2.535
eca+2.534
Neg logit contributions
theless-4.465
ammu-3.463
ーテ-3.371
emo-3.291
ellen-3.256
Token hill
Token ID12788
Feature activation-1.16e-01

Pos logit contributions
ourke+3.371
versions+3.161
onda+2.705
eca+2.702
acters+2.682
Neg logit contributions
theless-4.177
ammu-3.230
ーテ-3.135
emo-3.033
ayer-3.005
Token
Token ID851
Feature activation+3.58e-01

Pos logit contributions
theless+1.655
ammu+1.295
ーテ+1.246
ayer+1.225
 Lauder+1.221
Neg logit contributions
ourke-1.209
versions-1.094
acters-0.970
onda-0.946
eca-0.940
Token in
Token ID287
Feature activation+6.49e-01

Pos logit contributions
ourke+4.130
versions+3.777
acters+3.298
ggle+3.249
yg+3.201
Neg logit contributions
theless-5.525
ammu-4.300
ーテ-4.282
emo-4.117
ayer-4.050
Token moving
Token ID3867
Feature activation+1.22e+00

Pos logit contributions
ourke+6.323
versions+5.982
acters+5.167
ggle+5.051
yg+4.985
Neg logit contributions
theless-8.785
ammu-6.902
ーテ-6.894
emo-6.566
Berry-6.516
Token,
Token ID11
Feature activation+7.63e-01

Pos logit contributions
ourke+9.900
versions+9.835
ggle+8.539
 marrow+8.485
 pane+8.403
Neg logit contributions
theless-14.759
ーテ-12.330
ammu-11.720
emo-11.532
ellen-11.466
Token how
Token ID703
Feature activation+8.96e-01

Pos logit contributions
ourke+6.986
versions+6.364
ggle+5.462
acters+5.353
eca+5.318
Neg logit contributions
theless-10.461
ーテ-8.348
ammu-8.277
emo-8.017
Berry-7.993
Token express
Token ID4911
Feature activation+6.36e-01

Pos logit contributions
ourke+8.096
versions+7.870
ggle+6.922
 pane+6.754
ington+6.601
Neg logit contributions
theless-11.140
ーテ-9.226
ammu-9.042
ellen-8.508
ayer-8.496
Token an
Token ID281
Feature activation+1.13e+00

Pos logit contributions
ourke+6.109
versions+5.648
eca+4.832
ggle+4.828
acters+4.747
Neg logit contributions
theless-8.683
ーテ-6.951
ammu-6.879
emo-6.657
ellen-6.602
Token admirable
Token ID37959
Feature activation+8.43e-01

Pos logit contributions
ourke+10.406
versions+9.507
acters+8.326
eca+8.319
onda+8.315
Neg logit contributions
theless-14.052
ーテ-10.999
Berry-10.728
 Gors-10.704
 Lauder-10.620
Token!
Token ID0
Feature activation+5.10e-01

Pos logit contributions
ourke+7.979
versions+7.602
acters+6.504
ggle+6.450
 pane+6.429
Neg logit contributions
theless-10.895
ーテ-8.776
ammu-8.455
emo-8.196
ellen-8.096
Token in
Token ID287
Feature activation+7.79e-01

Pos logit contributions
ourke+5.260
versions+4.748
acters+4.148
ggle+4.137
eca+4.136
Neg logit contributions
theless-7.008
ammu-5.400
ーテ-5.336
emo-5.189
ayer-5.149
Token action
Token ID2223
Feature activation+3.52e-01

Pos logit contributions
ourke+7.462
versions+7.117
acters+6.081
ggle+6.034
 marrow+5.894
Neg logit contributions
theless-10.239
ーテ-8.108
ammu-7.976
emo-7.715
Berry-7.678
Token how
Token ID703
Feature activation+9.91e-01

Pos logit contributions
ourke+3.389
versions+3.102
acters+2.678
ggle+2.675
eca+2.656
Neg logit contributions
theless-5.088
ammu-4.032
ーテ-4.017
emo-3.859
 Gors-3.827
Tokenest
Token ID395
Feature activation-3.78e-01

Pos logit contributions
ourke+6.475
versions+5.713
eca+5.229
onda+5.057
ington+4.978
Neg logit contributions
theless-10.690
ーテ-8.826
 Lauder-8.362
ammu-8.151
 Gors-8.123
Tokenical
Token ID605
Feature activation+4.96e-01

Pos logit contributions
theless+3.608
ammu+2.476
ayer+2.398
emo+2.357
ellen+2.304
Neg logit contributions
ourke-5.390
versions-5.173
acters-4.769
onda-4.648
ggle-4.550
Token roof
Token ID9753
Feature activation+3.69e-01

Pos logit contributions
ourke+5.269
versions+4.851
onda+4.313
eca+4.242
ggle+4.199
Neg logit contributions
theless-6.528
ammu-5.000
ーテ-4.967
emo-4.734
ellen-4.732
Token fre
Token ID2030
Feature activation+4.88e-01

Pos logit contributions
ourke+3.701
versions+3.361
ggle+2.926
eca+2.910
onda+2.893
Neg logit contributions
theless-5.266
ammu-4.094
ーテ-4.072
 Lauder-3.957
 Gors-3.917
Tokentted
Token ID28734
Feature activation+6.87e-01

Pos logit contributions
ourke+5.596
versions+5.251
ggle+4.692
onda+4.637
eca+4.593
Neg logit contributions
theless-6.179
ーテ-4.827
ammu-4.652
 Lauder-4.490
 Gors-4.430
Token with
Token ID351
Feature activation+1.13e+00

Pos logit contributions
ourke+6.295
versions+5.704
ggle+5.083
yg+5.001
ington+4.938
Neg logit contributions
theless-9.470
ーテ-7.646
ammu-7.453
ellen-7.120
 Lauder-7.105
Token golden
Token ID10861
Feature activation+9.20e-01

Pos logit contributions
ourke+10.419
versions+10.020
 marrow+8.724
 pane+8.703
yg+8.572
Neg logit contributions
theless-13.474
ーテ-10.658
ammu-10.474
ellen-10.012
emo-9.688
Token fire
Token ID2046
Feature activation+7.11e-01

Pos logit contributions
ourke+8.730
versions+8.246
 pane+7.333
onda+7.240
acters+7.229
Neg logit contributions
theless-11.545
ammu-8.854
ーテ-8.751
emo-8.382
ellen-8.341
Token,
Token ID11
Feature activation+3.31e-01

Pos logit contributions
ourke+6.458
versions+6.151
onda+5.303
acters+5.270
ggle+5.234
Neg logit contributions
theless-9.944
ーテ-7.921
ammu-7.740
emo-7.433
 Lauder-7.404
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
ourke+3.586
versions+3.251
acters+2.866
onda+2.843
ggle+2.837
Neg logit contributions
theless-4.540
ammu-3.479
ーテ-3.409
emo-3.298
 Lauder-3.297
Token
Token ID198
Feature activation+1.65e-01

Pos logit contributions
ourke+3.275
versions+3.071
ington+2.591
acters+2.569
eca+2.545
Neg logit contributions
theless-5.601
ーテ-4.521
etheless-4.281
ammu-4.245
 Gors-4.098
Tokenx
Token ID87
Feature activation+7.79e-02

Pos logit contributions
theless+4.227
ammu+3.250
emo+3.051
ellen+3.015
ayer+2.995
Neg logit contributions
ourke-5.058
versions-4.622
acters-4.302
onda-4.232
ggle-4.148
Token.
Token ID13
Feature activation-3.28e-01

Pos logit contributions
ourke+0.793
versions+0.727
acters+0.637
onda+0.622
eca+0.622
Neg logit contributions
theless-1.155
ーテ-0.883
ammu-0.880
 Lauder-0.843
 Gors-0.839
Tokenx
Token ID87
Feature activation-8.67e-03

Pos logit contributions
theless+3.679
ammu+2.830
emo+2.651
ayer+2.623
ellen+2.613
Neg logit contributions
ourke-4.163
versions-3.804
acters-3.533
onda-3.456
ggle-3.392
Token.
Token ID13
Feature activation-3.98e-01

Pos logit contributions
theless+0.125
ammu+0.095
ーテ+0.094
 Lauder+0.091
 Gors+0.090
Neg logit contributions
ourke-0.091
versions-0.084
acters-0.074
onda-0.072
eca-0.072
Token135
Token ID17059
Feature activation-9.09e-03

Pos logit contributions
theless+4.731
ammu+3.652
emo+3.451
ellen+3.311
ーテ+3.295
Neg logit contributions
ourke-4.933
versions-4.392
acters-4.102
onda-4.001
eca-3.918
Token 106
Token ID15696
Feature activation+1.12e+00

Pos logit contributions
theless+0.116
ammu+0.088
ーテ+0.084
 Lauder+0.082
emo+0.082
Neg logit contributions
ourke-0.108
versions-0.100
acters-0.090
onda-0.088
eca-0.087
Token19
Token ID1129
Feature activation+2.22e-01

Pos logit contributions
ourke+6.460
versions+6.350
eca+5.608
ington+5.321
yg+5.171
Neg logit contributions
theless-17.273
etheless-14.720
 eleph-14.567
 subur-14.451
 pione-13.937
Token41
Token ID3901
Feature activation-4.96e-01

Pos logit contributions
ourke+2.293
versions+2.094
acters+1.839
eca+1.824
onda+1.799
Neg logit contributions
theless-3.187
ーテ-2.453
ammu-2.438
 Lauder-2.325
ayer-2.320
Token x
Token ID2124
Feature activation-3.45e-03

Pos logit contributions
theless+3.366
ammu+2.071
 Lauder+1.889
emo+1.825
 Gors+1.812
Neg logit contributions
ourke-8.356
versions-7.894
acters-7.377
onda-7.280
ggle-7.204
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
theless+0.050
ammu+0.038
ーテ+0.037
 Lauder+0.036
 Gors+0.036
Neg logit contributions
ourke-0.036
versions-0.033
acters-0.029
onda-0.029
eca-0.028
Tokenx
Token ID87
Feature activation+1.21e-01

Pos logit contributions
theless+3.697
ammu+2.833
emo+2.647
ellen+2.614
ayer+2.610
Neg logit contributions
ourke-4.214
versions-3.857
acters-3.567
onda-3.510
ggle-3.443
Token by
Token ID416
Feature activation+4.91e-02

Pos logit contributions
ourke+1.702
versions+1.559
acters+1.393
onda+1.370
eca+1.354
Neg logit contributions
theless-2.048
ammu-1.565
ーテ-1.504
 Lauder-1.473
emo-1.466
Token bi
Token ID3182
Feature activation+2.27e-01

Pos logit contributions
ourke+0.567
versions+0.520
acters+0.465
onda+0.459
eca+0.454
Neg logit contributions
theless-0.646
ammu-0.491
ーテ-0.473
emo-0.460
 Lauder-0.460
Tokenotechnology
Token ID31201
Feature activation+2.69e-01

Pos logit contributions
ourke+2.861
versions+2.663
onda+2.413
acters+2.400
eca+2.381
Neg logit contributions
theless-2.752
ammu-1.981
ーテ-1.926
 Lauder-1.847
 Gors-1.840
Token critic
Token ID4014
Feature activation+2.72e-01

Pos logit contributions
ourke+2.922
versions+2.683
acters+2.371
onda+2.351
eca+2.322
Neg logit contributions
theless-3.697
ammu-2.805
ーテ-2.704
emo-2.654
ellen-2.626
Token Jeremy
Token ID11753
Feature activation+1.74e-01

Pos logit contributions
ourke+3.265
versions+3.031
onda+2.719
eca+2.699
acters+2.687
Neg logit contributions
theless-3.425
ammu-2.550
ーテ-2.461
emo-2.360
 Lauder-2.322
Token R
Token ID371
Feature activation+1.11e+00

Pos logit contributions
ourke+2.136
versions+1.982
acters+1.770
onda+1.759
eca+1.753
Neg logit contributions
theless-2.165
ammu-1.595
ーテ-1.533
emo-1.483
ayer-1.467
Tokenif
Token ID361
Feature activation+2.62e-01

Pos logit contributions
ourke+12.687
versions+10.278
onda+9.994
onde+9.767
ington+9.571
Neg logit contributions
theless-13.940
ーテ-11.041
ammu-9.585
 Lauder-9.562
etheless-9.544
Tokenkin
Token ID5116
Feature activation+2.99e-01

Pos logit contributions
ourke+2.515
versions+2.241
onda+1.950
acters+1.945
eca+1.928
Neg logit contributions
theless-3.964
ammu-3.114
ーテ-3.088
 Lauder-2.981
 Gors-2.964
Token,
Token ID11
Feature activation+1.33e-01

Pos logit contributions
ourke+3.063
versions+2.761
eca+2.410
onda+2.407
acters+2.398
Neg logit contributions
theless-4.298
ammu-3.327
ーテ-3.238
emo-3.135
 Lauder-3.133
Token protest
Token ID5402
Feature activation+1.09e-01

Pos logit contributions
ourke+1.513
versions+1.394
acters+1.235
onda+1.227
eca+1.216
Neg logit contributions
theless-1.772
ammu-1.346
ーテ-1.314
emo-1.269
 Lauder-1.264
Token the
Token ID262
Feature activation-1.39e-02

Pos logit contributions
ourke+1.177
versions+1.077
acters+0.951
onda+0.941
eca+0.932
Neg logit contributions
theless-1.529
ammu-1.179
ーテ-1.142
 Lauder-1.109
emo-1.108
Token favorite
Token ID4004
Feature activation+6.20e-03

Pos logit contributions
ourke+4.324
versions+4.064
onda+3.396
ggle+3.166
eca+3.103
Neg logit contributions
theless-8.816
ーテ-7.162
ammu-6.787
etheless-6.587
 Lauder-6.520
Token favorite
Token ID4004
Feature activation+6.79e-03

Pos logit contributions
ourke+0.065
versions+0.060
acters+0.053
onda+0.052
eca+0.051
Neg logit contributions
theless-0.088
ammu-0.068
ーテ-0.066
 Lauder-0.064
emo-0.064
Token favorite
Token ID4004
Feature activation-1.06e-01

Pos logit contributions
ourke+0.072
versions+0.066
acters+0.058
onda+0.057
eca+0.056
Neg logit contributions
theless-0.096
ammu-0.074
ーテ-0.073
 Lauder-0.070
emo-0.070
Token favorite
Token ID4004
Feature activation-1.63e-01

Pos logit contributions
theless+1.434
ammu+1.101
ーテ+1.054
 Lauder+1.040
emo+1.035
Neg logit contributions
ourke-1.175
versions-1.070
acters-0.960
onda-0.937
eca-0.931
Token favorite
Token ID4004
Feature activation-2.12e-01

Pos logit contributions
theless+2.023
ammu+1.524
ーテ+1.441
ayer+1.439
emo+1.431
Neg logit contributions
ourke-1.969
versions-1.792
acters-1.639
eca-1.592
ggle-1.585
Token -
Token ID532
Feature activation+1.11e+00

Pos logit contributions
theless+2.393
ammu+1.771
ーテ+1.672
ayer+1.670
ellen+1.656
Neg logit contributions
ourke-2.771
versions-2.524
acters-2.319
eca-2.269
ggle-2.263
Token December
Token ID3426
Feature activation+5.18e-01

Pos logit contributions
versions+5.641
ourke+5.535
ington+4.643
 ASC+4.408
zu+4.112
Neg logit contributions
theless-17.415
ーテ-14.708
ammu-14.572
etheless-14.226
emo-13.992
Token 5
Token ID642
Feature activation-1.46e-01

Pos logit contributions
ourke+4.317
versions+4.159
ington+3.513
acters+3.419
eca+3.399
Neg logit contributions
theless-7.986
ーテ-6.387
etheless-6.231
ammu-6.194
emo-5.959
Token,
Token ID11
Feature activation+3.07e-01

Pos logit contributions
theless+1.990
ammu+1.540
ーテ+1.467
 Lauder+1.453
emo+1.448
Neg logit contributions
ourke-1.613
versions-1.470
acters-1.311
onda-1.289
ggle-1.271
Token 2009
Token ID3717
Feature activation-2.05e-01

Pos logit contributions
ourke+3.471
versions+3.196
acters+2.864
onda+2.848
eca+2.780
Neg logit contributions
theless-4.085
ammu-3.086
ーテ-2.983
 Lauder-2.931
ellen-2.894
Token
Token ID198
Feature activation+1.86e-01

Pos logit contributions
theless+2.537
ammu+1.922
 Lauder+1.799
ーテ+1.793
ayer+1.788
Neg logit contributions
ourke-2.501
versions-2.287
acters-2.078
onda-2.049
eca-2.015
Tokenx
Token ID87
Feature activation+1.65e-01

Pos logit contributions
theless+4.077
ammu+3.124
emo+2.924
ellen+2.884
ayer+2.873
Neg logit contributions
ourke-4.713
versions-4.295
acters-4.000
onda-3.931
ggle-3.848
Token.
Token ID13
Feature activation-2.95e-01

Pos logit contributions
ourke+1.606
versions+1.474
acters+1.282
eca+1.260
onda+1.248
Neg logit contributions
theless-2.529
ーテ-1.955
ammu-1.922
 Lauder-1.846
 Gors-1.841
Tokenx
Token ID87
Feature activation+1.50e-01

Pos logit contributions
theless+3.416
ammu+2.638
emo+2.484
ellen+2.452
ayer+2.449
Neg logit contributions
ourke-3.667
versions-3.355
acters-3.099
onda-3.013
ggle-2.957
Token.
Token ID13
Feature activation-1.60e-01

Pos logit contributions
ourke+1.482
versions+1.359
acters+1.190
onda+1.171
eca+1.164
Neg logit contributions
theless-2.285
ーテ-1.758
ammu-1.733
 Lauder-1.675
 Gors-1.670
Token91
Token ID6420
Feature activation-2.67e-01

Pos logit contributions
theless+1.905
ammu+1.433
emo+1.334
ayer+1.323
ーテ+1.321
Neg logit contributions
ourke-2.029
versions-1.856
acters-1.689
onda-1.649
eca-1.643
Token 106
Token ID15696
Feature activation+1.09e+00

Pos logit contributions
theless+3.248
ammu+2.389
ーテ+2.269
 Lauder+2.252
 Gors+2.235
Neg logit contributions
ourke-3.333
versions-3.039
acters-2.757
onda-2.707
ggle-2.701
Token99
Token ID2079
Feature activation+8.47e-02

Pos logit contributions
ourke+6.900
versions+6.691
eca+5.886
ington+5.598
acters+5.494
Neg logit contributions
theless-16.212
etheless-13.379
 eleph-12.982
ayer-12.953
ammu-12.944
Token63
Token ID5066
Feature activation-2.90e-01

Pos logit contributions
ourke+0.945
versions+0.864
acters+0.773
eca+0.757
onda+0.756
Neg logit contributions
theless-1.149
ammu-0.877
ーテ-0.852
 Lauder-0.826
ayer-0.823
Token x
Token ID2124
Feature activation+6.01e-02

Pos logit contributions
theless+2.354
ammu+1.500
 Lauder+1.378
ーテ+1.352
 Gors+1.332
Neg logit contributions
ourke-4.701
versions-4.417
acters-4.113
onda-4.064
ggle-4.019
Token.
Token ID13
Feature activation-2.90e-01

Pos logit contributions
ourke+0.613
versions+0.561
acters+0.492
eca+0.481
onda+0.480
Neg logit contributions
theless-0.889
ammu-0.680
ーテ-0.676
 Lauder-0.650
 Gors-0.648
Tokenx
Token ID87
Feature activation+1.15e-01

Pos logit contributions
theless+3.353
ammu+2.553
emo+2.385
ayer+2.352
ellen+2.351
Neg logit contributions
ourke-3.651
versions-3.341
acters-3.081
onda-3.024
ggle-2.971
Token8
Token ID23
Feature activation+1.49e-01

Pos logit contributions
ourke+0.200
versions+0.184
acters+0.164
onda+0.161
eca+0.160
Neg logit contributions
theless-0.240
ammu-0.183
ーテ-0.177
 Lauder-0.174
 Gors-0.172
Token341
Token ID33660
Feature activation+1.04e+00

Pos logit contributions
ourke+1.573
versions+1.442
acters+1.279
eca+1.267
onda+1.259
Neg logit contributions
theless-2.092
ammu-1.616
ーテ-1.584
 Lauder-1.546
 Gors-1.525
Tokenc
Token ID66
Feature activation+2.49e-01

Pos logit contributions
ourke+8.995
versions+8.574
eca+8.329
onda+7.628
acters+7.600
Neg logit contributions
theless-13.390
ーテ-10.820
 Lauder-10.465
ammu-10.231
 Gors-9.948
Token4
Token ID19
Feature activation+3.85e-01

Pos logit contributions
ourke+2.728
versions+2.516
acters+2.264
eca+2.253
onda+2.234
Neg logit contributions
theless-3.369
ammu-2.562
ーテ-2.528
 Lauder-2.473
 Gors-2.444
Tokenfe
Token ID5036
Feature activation+5.16e-01

Pos logit contributions
ourke+3.914
versions+3.634
eca+3.298
acters+3.210
onda+3.191
Neg logit contributions
theless-5.343
ammu-4.146
ーテ-4.133
 Lauder-4.031
 Gors-3.928
Token353
Token ID33319
Feature activation+1.09e+00

Pos logit contributions
ourke+5.372
versions+4.987
eca+4.633
acters+4.498
onda+4.463
Neg logit contributions
theless-6.880
ーテ-5.315
ammu-5.267
 Lauder-5.109
 Gors-5.095
Tokenef
Token ID891
Feature activation+6.70e-01

Pos logit contributions
ourke+9.281
versions+8.940
eca+8.747
onda+7.966
acters+7.821
Neg logit contributions
theless-13.607
ーテ-11.345
 Lauder-10.895
ammu-10.567
 Gors-10.412
Token015
Token ID25150
Feature activation+9.20e-01

Pos logit contributions
ourke+6.761
versions+6.309
eca+5.972
onda+5.707
acters+5.702
Neg logit contributions
theless-8.750
ーテ-6.879
ammu-6.675
 Lauder-6.560
 Gors-6.508
Token390
Token ID25964
Feature activation+6.78e-01

Pos logit contributions
ourke+8.457
versions+8.000
eca+7.714
onda+7.106
acters+7.059
Neg logit contributions
theless-11.881
ーテ-9.625
 Lauder-9.201
ammu-9.118
 Gors-8.959
Tokenb
Token ID65
Feature activation+4.35e-01

Pos logit contributions
ourke+6.430
versions+6.070
eca+5.645
acters+5.367
onda+5.307
Neg logit contributions
theless-9.238
ーテ-7.388
 Lauder-7.112
ammu-7.109
 Gors-6.916
Token0
Token ID15
Feature activation+5.86e-01

Pos logit contributions
ourke+4.637
versions+4.318
eca+3.917
acters+3.884
onda+3.817
Neg logit contributions
theless-5.817
ーテ-4.473
ammu-4.439
 Lauder-4.348
 Gors-4.277
Token brute
Token ID33908
Feature activation-1.74e-01

Pos logit contributions
ourke+3.248
versions+3.027
onda+2.660
acters+2.639
eca+2.611
Neg logit contributions
theless-4.046
ammu-3.061
ーテ-3.052
emo-2.880
ayer-2.879
Token grew
Token ID6348
Feature activation+4.15e-01

Pos logit contributions
theless+2.268
ammu+1.719
 Lauder+1.632
ーテ+1.629
 Gors+1.622
Neg logit contributions
ourke-2.035
versions-1.853
acters-1.676
onda-1.636
eca-1.614
Token hard
Token ID1327
Feature activation+2.18e-01

Pos logit contributions
ourke+4.029
versions+3.754
ggle+3.291
onda+3.249
acters+3.211
Neg logit contributions
theless-5.927
ーテ-4.623
ammu-4.601
ellen-4.354
ayer-4.312
Token panels
Token ID13043
Feature activation+1.21e-01

Pos logit contributions
ourke+2.258
versions+2.058
acters+1.804
onda+1.789
ggle+1.768
Neg logit contributions
theless-3.098
ammu-2.417
ーテ-2.355
 Lauder-2.292
emo-2.276
Token of
Token ID286
Feature activation+3.80e-01

Pos logit contributions
ourke+1.262
versions+1.151
acters+1.014
onda+1.000
eca+0.991
Neg logit contributions
theless-1.712
ammu-1.326
ーテ-1.293
 Lauder-1.264
emo-1.247
Token imp
Token ID848
Feature activation+1.08e+00

Pos logit contributions
ourke+4.050
versions+3.714
onda+3.244
ggle+3.235
yg+3.225
Neg logit contributions
theless-5.183
ammu-3.980
ーテ-3.919
ellen-3.743
emo-3.735
Tokenossibly
Token ID20846
Feature activation+2.44e-01

Pos logit contributions
ourke+9.022
versions+8.203
usable+7.560
eca+7.558
onda+7.478
Neg logit contributions
theless-14.385
ーテ-11.899
ammu-11.272
 Gors-11.152
-10.983
Token thick
Token ID6546
Feature activation+2.80e-01

Pos logit contributions
ourke+2.640
versions+2.454
onda+2.145
acters+2.140
ggle+2.124
Neg logit contributions
theless-3.318
ammu-2.546
ーテ-2.489
 Lauder-2.395
emo-2.391
Token bone
Token ID9970
Feature activation+3.29e-01

Pos logit contributions
ourke+3.084
versions+2.829
acters+2.506
onda+2.494
eca+2.468
Neg logit contributions
theless-3.767
ammu-2.868
ーテ-2.784
 Lauder-2.712
ellen-2.697
Token all
Token ID477
Feature activation+2.29e-01

Pos logit contributions
ourke+3.313
versions+3.020
acters+2.669
onda+2.592
ggle+2.581
Neg logit contributions
theless-4.717
ammu-3.640
ーテ-3.586
 Lauder-3.497
emo-3.455
Token over
Token ID625
Feature activation-3.12e-01

Pos logit contributions
ourke+2.251
versions+2.076
acters+1.784
onda+1.755
ggle+1.755
Neg logit contributions
theless-3.338
ammu-2.631
ーテ-2.546
 Lauder-2.503
ellen-2.480
Tokenx
Token ID87
Feature activation+1.18e-01

Pos logit contributions
theless+4.483
ammu+3.476
emo+3.250
ellen+3.204
ayer+3.187
Neg logit contributions
ourke-5.527
versions-5.045
acters-4.705
onda-4.633
 defe-4.571
Token.
Token ID13
Feature activation-2.45e-01

Pos logit contributions
ourke+1.179
versions+1.080
acters+0.941
eca+0.923
onda+0.921
Neg logit contributions
theless-1.782
ーテ-1.373
ammu-1.355
 Lauder-1.301
 Gors-1.296
Tokenx
Token ID87
Feature activation-5.49e-02

Pos logit contributions
theless+2.871
ammu+2.190
emo+2.043
ayer+2.031
ellen+2.024
Neg logit contributions
ourke-3.055
versions-2.801
acters-2.573
onda-2.522
ggle-2.481
Token.
Token ID13
Feature activation-5.94e-01

Pos logit contributions
theless+0.766
ammu+0.589
ーテ+0.571
 Lauder+0.558
emo+0.554
Neg logit contributions
ourke-0.594
versions-0.543
acters-0.481
onda-0.473
eca-0.468
Token173
Token ID25399
Feature activation+5.07e-02

Pos logit contributions
theless+5.635
ammu+4.226
ayer+3.894
emo+3.620
ellen+3.614
Neg logit contributions
ourke-8.311
versions-7.417
acters-7.375
onda-7.030
ggle-6.967
Token 107
Token ID16226
Feature activation+1.08e+00

Pos logit contributions
ourke+0.579
versions+0.534
acters+0.477
onda+0.470
eca+0.465
Neg logit contributions
theless-0.672
ammu-0.511
ーテ-0.494
ayer-0.480
 Lauder-0.479
Token10
Token ID940
Feature activation+1.42e-01

Pos logit contributions
ourke+7.201
versions+6.820
eca+6.169
acters+5.993
ington+5.934
Neg logit contributions
theless-16.004
etheless-13.506
 eleph-13.235
 subur-13.027
ーテ-12.732
Token52
Token ID4309
Feature activation-5.49e-01

Pos logit contributions
ourke+1.578
versions+1.453
acters+1.289
eca+1.276
onda+1.262
Neg logit contributions
theless-1.937
ammu-1.459
ーテ-1.447
 Lauder-1.387
ayer-1.378
Token x
Token ID2124
Feature activation+8.30e-02

Pos logit contributions
theless+3.492
ammu+2.128
 Lauder+1.928
ellen+1.834
emo+1.833
Neg logit contributions
ourke-9.338
versions-8.775
acters-8.215
onda-8.160
ggle-8.088
Token.
Token ID13
Feature activation-3.15e-01

Pos logit contributions
ourke+0.839
versions+0.768
acters+0.673
eca+0.656
onda+0.655
Neg logit contributions
theless-1.238
ammu-0.945
ーテ-0.944
 Lauder-0.903
 Gors-0.901
Tokenx
Token ID87
Feature activation+9.91e-02

Pos logit contributions
theless+3.551
ammu+2.720
emo+2.538
ellen+2.500
ayer+2.495
Neg logit contributions
ourke-3.996
versions-3.658
acters-3.376
onda-3.325
ggle-3.272
Tokenx
Token ID87
Feature activation-1.77e-02

Pos logit contributions
theless+2.267
ammu+1.710
ーテ+1.598
emo+1.571
 Lauder+1.568
Neg logit contributions
ourke-2.414
versions-2.233
acters-2.034
onda-1.991
ggle-1.960
Token.
Token ID13
Feature activation-2.21e-01

Pos logit contributions
theless+0.253
ammu+0.194
ーテ+0.190
 Lauder+0.184
 Gors+0.183
Neg logit contributions
ourke-0.187
versions-0.171
acters-0.152
onda-0.148
eca-0.148
Tokenx
Token ID87
Feature activation+7.24e-02

Pos logit contributions
theless+2.567
ammu+1.940
ーテ+1.826
emo+1.797
ellen+1.761
Neg logit contributions
ourke-2.794
versions-2.577
acters-2.358
onda-2.294
ggle-2.266
Token.
Token ID13
Feature activation-1.52e-01

Pos logit contributions
ourke+0.738
versions+0.675
acters+0.595
eca+0.584
onda+0.582
Neg logit contributions
theless-1.066
ammu-0.817
ーテ-0.804
 Lauder-0.779
ayer-0.777
Token80
Token ID1795
Feature activation-1.47e-01

Pos logit contributions
theless+2.009
ammu+1.541
ーテ+1.480
emo+1.465
ellen+1.437
Neg logit contributions
ourke-1.731
versions-1.577
acters-1.425
onda-1.389
ggle-1.370
Token 106
Token ID15696
Feature activation+1.07e+00

Pos logit contributions
theless+1.775
ammu+1.318
ーテ+1.244
 Lauder+1.232
 Gors+1.225
Neg logit contributions
ourke-1.848
versions-1.694
acters-1.530
onda-1.505
ggle-1.498
Token27
Token ID1983
Feature activation+1.29e-01

Pos logit contributions
ourke+7.609
versions+7.147
eca+6.483
acters+6.153
ington+6.060
Neg logit contributions
theless-15.321
ammu-12.076
etheless-12.033
ayer-11.904
ーテ-11.698
Token66
Token ID2791
Feature activation-9.13e-02

Pos logit contributions
ourke+1.423
versions+1.302
acters+1.164
eca+1.142
onda+1.138
Neg logit contributions
theless-1.767
ammu-1.354
ーテ-1.312
 Lauder-1.271
ayer-1.270
Token x
Token ID2124
Feature activation+7.50e-02

Pos logit contributions
theless+0.778
ammu+0.498
ーテ+0.463
 Lauder+0.451
 Gors+0.443
Neg logit contributions
ourke-1.470
versions-1.383
acters-1.279
onda-1.261
ggle-1.249
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
ourke+0.758
versions+0.691
acters+0.609
eca+0.594
onda+0.592
Neg logit contributions
theless-1.119
ammu-0.856
ーテ-0.849
 Lauder-0.817
 Gors-0.816
Tokenx
Token ID87
Feature activation+2.31e-02

Pos logit contributions
theless+3.156
ammu+2.399
emo+2.235
ayer+2.215
ellen+2.205
Neg logit contributions
ourke-3.269
versions-2.995
acters-2.740
onda-2.690
ggle-2.645
Token
Token ID198
Feature activation-2.00e-01

Pos logit contributions
theless+1.810
ammu+1.373
ーテ+1.298
 Lauder+1.287
emo+1.282
Neg logit contributions
ourke-1.700
versions-1.558
acters-1.405
onda-1.387
eca-1.367
Token+
Token ID10
Feature activation-1.34e-01

Pos logit contributions
theless+2.562
ammu+2.017
ーテ+1.885
emo+1.863
 Lauder+1.848
Neg logit contributions
ourke-2.292
versions-2.095
acters-1.882
onda-1.870
eca-1.828
Token Natural
Token ID12068
Feature activation-3.02e-01

Pos logit contributions
theless+1.723
ammu+1.306
ーテ+1.249
 Lauder+1.239
 Gors+1.227
Neg logit contributions
ourke-1.579
versions-1.440
acters-1.307
onda-1.283
eca-1.261
Token leader
Token ID3554
Feature activation+6.80e-02

Pos logit contributions
theless+4.025
ammu+3.108
 Lauder+2.948
ーテ+2.932
ayer+2.926
Neg logit contributions
ourke-3.371
versions-3.053
acters-2.790
onda-2.712
ggle-2.658
Token,
Token ID11
Feature activation-3.13e-01

Pos logit contributions
ourke+0.736
versions+0.672
acters+0.597
onda+0.586
eca+0.580
Neg logit contributions
theless-0.946
ammu-0.730
ーテ-0.704
 Lauder-0.690
emo-0.686
Token Bon
Token ID7979
Feature activation+1.06e+00

Pos logit contributions
theless+4.093
ammu+3.125
 Lauder+2.979
ーテ+2.963
 Gors+2.950
Neg logit contributions
ourke-3.575
versions-3.233
acters-2.977
onda-2.889
ggle-2.840
Token v
Token ID410
Feature activation-1.50e-03

Pos logit contributions
ourke+11.085
versions+10.977
ington+9.914
yg+9.899
zie+9.765
Neg logit contributions
theless-12.398
ーテ-8.936
 Lauder-8.493
 Gors-8.367
-8.327
Tokenive
Token ID425
Feature activation-2.52e-01

Pos logit contributions
theless+0.014
ammu+0.009
ーテ+0.008
 Lauder+0.008
emo+0.008
Neg logit contributions
ourke-0.024
versions-0.022
acters-0.021
onda-0.020
eca-0.020
Tokenur
Token ID333
Feature activation-3.92e-01

Pos logit contributions
theless+3.319
ammu+2.476
 Lauder+2.380
ーテ+2.356
ayer+2.337
Neg logit contributions
ourke-2.910
versions-2.674
acters-2.428
eca-2.368
onda-2.354
Token,
Token ID11
Feature activation-3.09e-01

Pos logit contributions
theless+5.003
ammu+3.787
 Lauder+3.591
ayer+3.590
ーテ+3.523
Neg logit contributions
ourke-4.503
versions-4.140
acters-3.764
onda-3.668
ggle-3.637
Token Char
Token ID3178
Feature activation-1.03e-01

Pos logit contributions
theless+4.001
ammu+3.053
 Lauder+2.916
 Gors+2.889
ーテ+2.887
Neg logit contributions
ourke-3.550
versions-3.216
acters-2.965
onda-2.872
ggle-2.834
Token the
Token ID262
Feature activation-2.32e-01

Pos logit contributions
ourke+1.458
versions+1.342
acters+1.185
onda+1.177
eca+1.167
Neg logit contributions
theless-1.751
ammu-1.341
ーテ-1.297
emo-1.260
ellen-1.248
Token National
Token ID2351
Feature activation-5.61e-01

Pos logit contributions
theless+2.948
ammu+2.215
 Lauder+2.150
ayer+2.109
ーテ+2.102
Neg logit contributions
ourke-2.757
versions-2.518
acters-2.317
onda-2.242
ggle-2.216
Token Council
Token ID4281
Feature activation-2.42e-01

Pos logit contributions
theless+6.739
 Lauder+5.258
ammu+5.210
 Gors+5.124
ayer+4.995
Neg logit contributions
ourke-6.473
versions-5.996
acters-5.398
onda-5.196
ggle-5.152
Token of
Token ID286
Feature activation+1.16e-01

Pos logit contributions
theless+3.068
ammu+2.359
 Lauder+2.251
ayer+2.204
ーテ+2.194
Neg logit contributions
ourke-2.817
versions-2.609
acters-2.376
onda-2.331
ggle-2.298
Token La
Token ID4689
Feature activation-1.09e-01

Pos logit contributions
ourke+1.394
versions+1.304
acters+1.159
onda+1.153
eca+1.141
Neg logit contributions
theless-1.477
ammu-1.086
ーテ-1.055
emo-1.025
ellen-1.010
Token R
Token ID371
Feature activation+1.06e+00

Pos logit contributions
theless+1.353
ammu+1.028
ーテ+0.983
emo+0.965
ayer+0.964
Neg logit contributions
ourke-1.317
versions-1.207
acters-1.096
ggle-1.068
onda-1.067
Tokenaza
Token ID7056
Feature activation-7.80e-02

Pos logit contributions
ourke+5.752
onda+4.331
onde+3.968
eca+3.469
versions+3.411
Neg logit contributions
theless-20.409
ーテ-17.492
 Lauder-16.411
etheless-16.272
 Gors-15.659
Token and
Token ID290
Feature activation+2.65e-02

Pos logit contributions
theless+1.050
ammu+0.811
ーテ+0.776
 Lauder+0.773
ayer+0.764
Neg logit contributions
ourke-0.869
versions-0.796
acters-0.714
onda-0.694
ggle-0.686
Token the
Token ID262
Feature activation-3.21e-01

Pos logit contributions
ourke+0.303
versions+0.278
acters+0.249
onda+0.245
eca+0.242
Neg logit contributions
theless-0.352
ammu-0.268
ーテ-0.258
 Lauder-0.253
emo-0.251
Token American
Token ID1605
Feature activation-3.93e-01

Pos logit contributions
theless+3.995
ammu+3.011
 Lauder+2.972
ayer+2.872
 Gors+2.869
Neg logit contributions
ourke-3.819
versions-3.500
acters-3.239
onda-3.110
ggle-3.087
Token Friends
Token ID14213
Feature activation-1.43e-01

Pos logit contributions
theless+4.499
ammu+3.354
 Gors+3.243
 Lauder+3.220
ーテ+3.216
Neg logit contributions
ourke-4.983
versions-4.627
acters-4.188
onda-4.127
ggle-4.067
Tokenx
Token ID87
Feature activation+1.41e-01

Pos logit contributions
theless+3.895
ammu+2.988
emo+2.789
ellen+2.750
ayer+2.744
Neg logit contributions
ourke-4.519
versions-4.133
acters-3.831
onda-3.770
ggle-3.697
Token.
Token ID13
Feature activation-1.67e-01

Pos logit contributions
ourke+1.385
versions+1.269
acters+1.106
eca+1.083
onda+1.079
Neg logit contributions
theless-2.139
ーテ-1.653
ammu-1.627
 Lauder-1.560
 Gors-1.557
Tokenx
Token ID87
Feature activation+6.28e-01

Pos logit contributions
theless+2.060
ammu+1.558
ーテ+1.460
emo+1.454
ayer+1.448
Neg logit contributions
ourke-2.046
versions-1.879
acters-1.709
onda-1.678
eca-1.654
Token.
Token ID13
Feature activation+6.01e-01

Pos logit contributions
ourke+4.921
versions+4.409
eca+3.780
acters+3.662
ington+3.615
Neg logit contributions
theless-10.497
ーテ-8.703
 pione-8.190
etheless-8.061
 eleph-8.028
Token123
Token ID10163
Feature activation-3.83e-02

Pos logit contributions
ourke+5.086
versions+4.612
eca+4.084
acters+3.720
onda+3.717
Neg logit contributions
theless-9.415
ーテ-7.915
 Gors-7.395
ammu-7.394
 Lauder-7.314
Token 104
Token ID14436
Feature activation+1.05e+00

Pos logit contributions
theless+0.481
ammu+0.360
ーテ+0.345
 Lauder+0.338
emo+0.335
Neg logit contributions
ourke-0.466
versions-0.430
acters-0.387
onda-0.382
eca-0.378
Token66
Token ID2791
Feature activation+1.21e-01

Pos logit contributions
ourke+6.594
versions+6.355
eca+5.476
ington+5.339
acters+5.225
Neg logit contributions
theless-16.100
 eleph-13.642
etheless-13.600
 subur-13.530
 pione-13.093
Token29
Token ID1959
Feature activation-5.03e-01

Pos logit contributions
ourke+1.332
versions+1.224
acters+1.086
eca+1.072
onda+1.063
Neg logit contributions
theless-1.655
ammu-1.254
ーテ-1.235
 Lauder-1.186
ayer-1.182
Token x
Token ID2124
Feature activation+1.17e-01

Pos logit contributions
theless+3.437
ammu+2.068
 Lauder+1.891
emo+1.815
 Gors+1.809
Neg logit contributions
ourke-8.487
versions-8.026
acters-7.475
onda-7.393
ggle-7.318
Token.
Token ID13
Feature activation-3.41e-01

Pos logit contributions
ourke+1.154
versions+1.057
acters+0.923
eca+0.902
onda+0.896
Neg logit contributions
theless-1.767
ーテ-1.356
ammu-1.347
 Lauder-1.291
 Gors-1.290
Tokenx
Token ID87
Feature activation+1.99e-01

Pos logit contributions
theless+3.790
ammu+2.909
emo+2.717
ellen+2.683
ayer+2.673
Neg logit contributions
ourke-4.348
versions-3.973
acters-3.682
onda-3.619
ggle-3.551
Token keeps
Token ID7622
Feature activation-2.99e-01

Pos logit contributions
theless+0.848
ammu+0.644
ーテ+0.619
 Lauder+0.608
emo+0.604
Neg logit contributions
ourke-0.732
versions-0.671
acters-0.601
onda-0.591
eca-0.585
Token adding
Token ID4375
Feature activation+1.96e-02

Pos logit contributions
theless+3.920
ammu+2.973
emo+2.840
ーテ+2.831
 Lauder+2.829
Neg logit contributions
ourke-3.427
versions-3.108
acters-2.838
onda-2.755
eca-2.730
Token fuel
Token ID5252
Feature activation+2.52e-01

Pos logit contributions
ourke+0.217
versions+0.198
acters+0.176
onda+0.173
eca+0.171
Neg logit contributions
theless-0.268
ammu-0.206
ーテ-0.198
 Lauder-0.194
emo-0.193
Token to
Token ID284
Feature activation+8.94e-02

Pos logit contributions
ourke+2.705
versions+2.482
acters+2.204
onda+2.180
eca+2.110
Neg logit contributions
theless-3.511
ammu-2.684
ーテ-2.579
 Lauder-2.526
ayer-2.524
Token the
Token ID262
Feature activation+1.84e-01

Pos logit contributions
ourke+0.970
versions+0.889
acters+0.787
onda+0.775
eca+0.764
Neg logit contributions
theless-1.238
ammu-0.953
ーテ-0.921
 Lauder-0.900
ayer-0.897
Token bon
Token ID5351
Feature activation+1.04e+00

Pos logit contributions
ourke+2.005
versions+1.850
acters+1.630
onda+1.623
eca+1.578
Neg logit contributions
theless-2.528
ammu-1.942
ーテ-1.898
ayer-1.834
 Lauder-1.830
Tokenfire
Token ID6495
Feature activation+4.82e-01

Pos logit contributions
ourke+7.210
versions+6.901
acters+6.087
ington+5.939
yg+5.907
Neg logit contributions
theless-15.136
 Lauder-12.456
ーテ-12.354
ammu-11.893
Berry-11.765
Token.
Token ID13
Feature activation-2.00e-01

Pos logit contributions
ourke+4.568
versions+4.185
acters+3.657
onda+3.614
ggle+3.523
Neg logit contributions
theless-7.017
ammu-5.459
ーテ-5.451
 Lauder-5.194
ayer-5.176
Token His
Token ID2399
Feature activation-1.90e-01

Pos logit contributions
theless+2.527
ammu+1.903
ーテ+1.835
 Lauder+1.822
 Gors+1.807
Neg logit contributions
ourke-2.392
versions-2.200
acters-1.981
onda-1.961
ggle-1.932
Token comment
Token ID2912
Feature activation+2.52e-03

Pos logit contributions
theless+2.533
ammu+1.931
ーテ+1.839
 Lauder+1.829
emo+1.820
Neg logit contributions
ourke-2.151
versions-1.944
acters-1.765
onda-1.729
eca-1.710
Token to
Token ID284
Feature activation+1.95e-01

Pos logit contributions
ourke+0.027
versions+0.024
acters+0.022
onda+0.021
eca+0.021
Neg logit contributions
theless-0.036
ammu-0.028
ーテ-0.027
 Lauder-0.026
emo-0.026
Token unused
Token ID21958
Feature activation-5.43e-01

Pos logit contributions
theless+5.554
ammu+4.167
 Lauder+4.078
 Gors+4.051
ーテ+4.049
Neg logit contributions
ourke-4.801
versions-4.332
acters-3.950
onda-3.938
ggle-3.895
Token work
Token ID670
Feature activation-4.10e-03

Pos logit contributions
theless+6.605
ammu+4.868
ーテ+4.774
 Gors+4.728
 Lauder+4.715
Neg logit contributions
ourke-6.369
versions-5.671
ggle-5.248
acters-5.232
onda-5.211
Token from
Token ID422
Feature activation-1.12e-01

Pos logit contributions
theless+0.056
ammu+0.043
ーテ+0.042
 Lauder+0.041
emo+0.041
Neg logit contributions
ourke-0.045
versions-0.041
acters-0.037
onda-0.036
eca-0.036
Token between
Token ID1022
Feature activation-4.38e-02

Pos logit contributions
theless+1.482
ammu+1.124
ーテ+1.085
 Lauder+1.074
 Gors+1.064
Neg logit contributions
ourke-1.269
versions-1.160
acters-1.048
onda-1.028
ggle-1.021
Token 1985
Token ID12863
Feature activation-2.81e-01

Pos logit contributions
theless+0.572
ammu+0.433
ーテ+0.416
 Lauder+0.409
emo+0.405
Neg logit contributions
ourke-0.511
versions-0.468
acters-0.422
onda-0.415
eca-0.411
Token and
Token ID290
Feature activation-1.17e+00

Pos logit contributions
theless+3.491
ammu+2.683
 Lauder+2.537
ayer+2.526
ーテ+2.518
Neg logit contributions
ourke-3.319
versions-3.019
acters-2.759
ggle-2.682
eca-2.680
Token 1990
Token ID6303
Feature activation-1.77e-01

Pos logit contributions
theless+9.202
 Lauder+6.604
 Gors+6.500
 Messenger+6.087
 Gaga+6.056
Neg logit contributions
ourke-15.618
versions-14.194
acters-13.881
yg-13.227
ggle-13.163
Token.
Token ID13
Feature activation-1.68e-01

Pos logit contributions
theless+2.324
ammu+1.761
ーテ+1.689
 Lauder+1.669
 Gors+1.656
Neg logit contributions
ourke-2.034
versions-1.873
acters-1.678
onda-1.650
ggle-1.644
Token Check
Token ID6822
Feature activation-6.61e-02

Pos logit contributions
theless+2.148
ammu+1.611
ーテ+1.557
 Lauder+1.540
 Gors+1.527
Neg logit contributions
ourke-1.998
versions-1.825
acters-1.658
onda-1.639
ggle-1.616
Token out
Token ID503
Feature activation+7.69e-02

Pos logit contributions
theless+0.814
ammu+0.607
ーテ+0.579
 Lauder+0.567
emo+0.564
Neg logit contributions
ourke-0.820
versions-0.756
acters-0.685
onda-0.674
eca-0.668
Token more
Token ID517
Feature activation+1.08e-01

Pos logit contributions
ourke+0.847
versions+0.777
acters+0.688
onda+0.678
eca+0.669
Neg logit contributions
theless-1.053
ammu-0.808
ーテ-0.779
 Lauder-0.762
emo-0.761
Token three
Token ID1115
Feature activation+2.10e-01

Pos logit contributions
theless+2.850
ammu+2.178
ーテ+2.091
 Gors+2.067
 Lauder+2.066
Neg logit contributions
ourke-2.318
versions-2.111
acters-1.906
onda-1.878
eca-1.858
Token meaningful
Token ID11570
Feature activation-3.55e-01

Pos logit contributions
ourke+2.482
versions+2.295
acters+2.076
onda+2.033
ington+1.989
Neg logit contributions
theless-2.700
ammu-2.050
ーテ-2.024
 Lauder-1.927
emo-1.908
Token actions
Token ID4028
Feature activation-9.94e-01

Pos logit contributions
theless+4.191
ammu+3.143
ayer+3.033
 Gors+2.948
emo+2.940
Neg logit contributions
ourke-4.299
versions-3.902
onda-3.690
eca-3.473
ggle-3.472
Token you
Token ID345
Feature activation-1.95e-01

Pos logit contributions
theless+7.100
ammu+5.385
ayer+4.820
emo+4.744
 Gors+4.691
Neg logit contributions
ourke-14.140
versions-13.027
escription-12.882
onda-12.112
milo-12.090
Token can
Token ID460
Feature activation-3.67e-01

Pos logit contributions
theless+2.486
ammu+1.912
ーテ+1.800
ayer+1.789
emo+1.785
Neg logit contributions
ourke-2.256
versions-2.085
acters-1.852
onda-1.847
eca-1.808
Token take
Token ID1011
Feature activation-1.16e+00

Pos logit contributions
theless+4.730
ammu+3.761
ayer+3.578
emo+3.488
 Gors+3.468
Neg logit contributions
ourke-3.960
versions-3.640
acters-3.245
onda-3.191
eca-3.176
Token each
Token ID1123
Feature activation-4.42e-01

Pos logit contributions
theless+9.331
ammu+7.013
ayer+6.657
 Gors+6.471
ellen+6.342
Neg logit contributions
ourke-14.057
versions-12.934
escription-12.920
orem-12.604
onde-12.587
Token week
Token ID1285
Feature activation-4.79e-01

Pos logit contributions
theless+5.265
ammu+4.209
ayer+3.979
emo+3.927
 Gors+3.779
Neg logit contributions
ourke-5.111
versions-4.612
acters-4.321
ggle-4.194
eca-4.189
Token.
Token ID13
Feature activation-6.14e-01

Pos logit contributions
theless+5.173
ammu+4.138
ayer+3.870
ellen+3.786
emo+3.783
Neg logit contributions
ourke-5.981
versions-5.447
onda-4.983
acters-4.975
ggle-4.866
Token
Token ID198
Feature activation-3.36e-01

Pos logit contributions
theless+5.905
ammu+4.607
ayer+4.195
emo+4.185
Berry+4.149
Neg logit contributions
ourke-8.197
versions-7.492
onda-6.921
acters-6.843
ggle-6.787
Token
Token ID198
Feature activation-5.58e-01

Pos logit contributions
theless+3.739
ammu+2.858
emo+2.654
ayer+2.648
Berry+2.600
Neg logit contributions
ourke-4.313
versions-3.929
acters-3.609
onda-3.600
ggle-3.511
Token x
Token ID2124
Feature activation+6.95e-03

Pos logit contributions
theless+2.271
ammu+1.442
 Lauder+1.301
ーテ+1.296
emo+1.275
Neg logit contributions
ourke-4.558
versions-4.299
acters-3.990
onda-3.936
ggle-3.906
Token.
Token ID13
Feature activation-4.12e-01

Pos logit contributions
ourke+0.073
versions+0.066
acters+0.059
onda+0.057
eca+0.057
Neg logit contributions
theless-0.100
ammu-0.077
ーテ-0.076
 Lauder-0.073
 Gors-0.073
Tokenx
Token ID87
Feature activation+3.47e-02

Pos logit contributions
theless+4.400
ammu+3.405
emo+3.183
ellen+3.127
ayer+3.115
Neg logit contributions
ourke-5.316
versions-4.859
acters-4.521
onda-4.451
 defe-4.376
Token.
Token ID13
Feature activation-3.80e-01

Pos logit contributions
ourke+0.361
versions+0.330
acters+0.291
onda+0.285
eca+0.283
Neg logit contributions
theless-0.505
ammu-0.386
ーテ-0.383
 Lauder-0.368
 Gors-0.366
Tokenx
Token ID87
Feature activation-6.19e-01

Pos logit contributions
theless+4.120
ammu+3.191
emo+2.965
ayer+2.948
ellen+2.923
Neg logit contributions
ourke-4.872
versions-4.454
acters-4.160
onda-4.066
 defe-4.004
Token.
Token ID13
Feature activation-1.16e+00

Pos logit contributions
theless+6.265
ammu+5.118
ayer+4.787
emo+4.589
ellen+4.577
Neg logit contributions
ourke-7.817
versions-6.984
 defe-6.589
acters-6.515
onda-6.456
Token176
Token ID24096
Feature activation+1.86e-01

Pos logit contributions
theless+5.978
ammu+3.795
ayer+3.316
mere+3.113
Berry+2.562
Neg logit contributions
ourke-19.129
acters-17.015
onde-16.716
 defe-16.495
onda-16.398
Token 98
Token ID9661
Feature activation+2.08e-01

Pos logit contributions
ourke+2.017
versions+1.879
acters+1.655
onda+1.640
eca+1.622
Neg logit contributions
theless-2.541
ammu-1.958
ーテ-1.904
ayer-1.849
emo-1.835
Token37
Token ID2718
Feature activation+1.31e-01

Pos logit contributions
ourke+2.132
versions+1.957
acters+1.729
eca+1.714
onda+1.674
Neg logit contributions
theless-2.992
ammu-2.300
ーテ-2.273
ayer-2.194
 Lauder-2.189
Token00
Token ID405
Feature activation-4.47e-01

Pos logit contributions
ourke+1.420
versions+1.304
acters+1.154
eca+1.141
onda+1.130
Neg logit contributions
theless-1.816
ammu-1.375
ーテ-1.369
 Lauder-1.306
ayer-1.301
Token x
Token ID2124
Feature activation+6.43e-02

Pos logit contributions
theless+3.191
ammu+1.982
 Lauder+1.811
emo+1.717
 Gors+1.711
Neg logit contributions
ourke-7.464
versions-7.069
acters-6.613
onda-6.477
ggle-6.444
Token at
Token ID379
Feature activation-7.19e-02

Pos logit contributions
theless+4.292
ammu+3.233
 Lauder+3.073
ayer+3.072
ellen+3.047
Neg logit contributions
ourke-4.330
versions-3.973
acters-3.610
onda-3.534
ggle-3.522
Token http
Token ID2638
Feature activation-1.33e-02

Pos logit contributions
theless+0.863
ammu+0.642
ーテ+0.607
ayer+0.599
 Gors+0.596
Neg logit contributions
ourke-0.910
versions-0.841
acters-0.763
onda-0.751
eca-0.746
Token://
Token ID1378
Feature activation-1.18e-01

Pos logit contributions
theless+0.141
ammu+0.097
ーテ+0.094
 Lauder+0.090
 Gors+0.089
Neg logit contributions
ourke-0.188
versions-0.176
acters-0.161
onda-0.159
eca-0.158
Tokenwww
Token ID2503
Feature activation+2.22e-01

Pos logit contributions
theless+1.310
ammu+0.943
emo+0.871
ayer+0.866
ーテ+0.858
Neg logit contributions
ourke-1.598
versions-1.469
acters-1.363
onda-1.326
eca-1.320
Token-
Token ID12
Feature activation-3.14e-01

Pos logit contributions
ourke+2.092
versions+1.942
acters+1.660
eca+1.628
ington+1.610
Neg logit contributions
theless-3.469
ーテ-2.731
ammu-2.626
 Lauder-2.567
 Gors-2.545
Tokenn
Token ID77
Feature activation-1.14e+00

Pos logit contributions
theless+4.255
ammu+3.291
emo+3.130
ayer+3.072
ellen+3.049
Neg logit contributions
ourke-3.443
versions-3.074
acters-2.836
ggle-2.709
eca-2.704
Tokenrd
Token ID4372
Feature activation-5.03e-01

Pos logit contributions
theless+11.906
emo+8.960
ayer+8.948
ammu+8.846
wire+8.121
Neg logit contributions
ourke-13.222
versions-11.172
acters-10.709
 defe-10.613
 guiActiveUnfocused-10.456
Token.
Token ID13
Feature activation-8.52e-03

Pos logit contributions
theless+6.393
ammu+4.982
emo+4.855
ayer+4.834
ivably+4.525
Neg logit contributions
ourke-5.443
versions-4.806
acters-4.726
ington-4.293
ggle-4.271
Tokenn
Token ID77
Feature activation-9.76e-01

Pos logit contributions
theless+0.110
ammu+0.084
ーテ+0.079
emo+0.078
ayer+0.078
Neg logit contributions
ourke-0.100
versions-0.092
acters-0.083
onda-0.081
eca-0.081
Tokenh
Token ID71
Feature activation-5.89e-01

Pos logit contributions
theless+10.024
emo+8.109
ammu+7.974
ayer+7.961
ellen+7.250
Neg logit contributions
ourke-11.925
versions-10.473
acters-9.563
 defe-9.482
 guiActiveUnfocused-9.200
Tokents
Token ID912
Feature activation-5.96e-01

Pos logit contributions
theless+5.780
ammu+4.210
ayer+3.860
emo+3.747
ilon+3.618
Neg logit contributions
ourke-8.291
versions-7.388
acters-7.088
 defe-6.925
yg-6.676
Token more
Token ID517
Feature activation+1.91e-01

Pos logit contributions
theless+0.088
ammu+0.068
ーテ+0.065
 Lauder+0.064
emo+0.063
Neg logit contributions
ourke-0.071
versions-0.065
acters-0.058
onda-0.057
eca-0.057
Token movie
Token ID3807
Feature activation+1.88e-01

Pos logit contributions
ourke+2.088
versions+1.914
acters+1.692
onda+1.683
eca+1.649
Neg logit contributions
theless-2.617
ammu-2.017
ーテ-1.955
 Lauder-1.902
emo-1.899
Token news
Token ID1705
Feature activation-2.29e-01

Pos logit contributions
ourke+1.951
versions+1.795
acters+1.591
onda+1.557
eca+1.538
Neg logit contributions
theless-2.679
ammu-2.067
ーテ-2.004
 Lauder-1.967
emo-1.958
Token,
Token ID11
Feature activation-1.11e-01

Pos logit contributions
theless+2.983
ammu+2.285
ーテ+2.192
ayer+2.157
 Gors+2.142
Neg logit contributions
ourke-2.614
versions-2.390
acters-2.128
onda-2.086
eca-2.076
Token follow
Token ID1061
Feature activation+4.44e-01

Pos logit contributions
theless+1.443
ammu+1.101
ーテ+1.044
 Lauder+1.033
ayer+1.030
Neg logit contributions
ourke-1.301
versions-1.191
acters-1.072
eca-1.045
onda-1.042
Token EW
Token ID43255
Feature activation-1.11e+00

Pos logit contributions
ourke+4.730
versions+4.323
onda+3.932
acters+3.810
eca+3.735
Neg logit contributions
theless-6.007
ammu-4.635
ーテ-4.585
 Lauder-4.377
emo-4.350
Token
Token ID447
Feature activation-7.71e-02

Pos logit contributions
theless+9.884
wire+7.885
ammu+7.811
 Messenger+7.457
emo+7.336
Neg logit contributions
ourke-12.558
versions-12.177
ggle-11.476
acters-11.304
milo-11.263
Token
Token ID247
Feature activation-1.41e-01

Pos logit contributions
theless+1.075
ammu+0.827
ーテ+0.798
 Lauder+0.782
emo+0.778
Neg logit contributions
ourke-0.834
versions-0.761
acters-0.677
onda-0.665
eca-0.657
Tokens
Token ID82
Feature activation-3.43e-01

Pos logit contributions
theless+1.855
ammu+1.419
ーテ+1.351
ayer+1.335
emo+1.333
Neg logit contributions
ourke-1.605
versions-1.484
acters-1.318
onda-1.299
eca-1.289
Token Anthony
Token ID9953
Feature activation+2.10e-01

Pos logit contributions
theless+4.248
ammu+3.220
ーテ+3.033
ayer+3.025
 Lauder+2.983
Neg logit contributions
ourke-4.051
versions-3.726
acters-3.357
ggle-3.313
eca-3.276
Token Bre
Token ID3719
Feature activation+4.16e-02

Pos logit contributions
ourke+2.470
versions+2.238
onda+2.027
acters+2.013
eca+1.997
Neg logit contributions
theless-2.733
ammu-2.018
ーテ-1.972
ayer-1.901
emo-1.898
Token were
Token ID547
Feature activation-2.83e-02

Pos logit contributions
ourke+0.069
versions+0.064
acters+0.057
onda+0.056
eca+0.055
Neg logit contributions
theless-0.078
ammu-0.059
ーテ-0.057
 Lauder-0.055
emo-0.055
Token other
Token ID584
Feature activation+1.37e-01

Pos logit contributions
theless+0.399
ammu+0.309
ーテ+0.298
 Lauder+0.293
emo+0.291
Neg logit contributions
ourke-0.300
versions-0.274
acters-0.242
onda-0.238
eca-0.235
Token top
Token ID1353
Feature activation+2.53e-01

Pos logit contributions
ourke+1.522
versions+1.431
acters+1.263
onda+1.247
eca+1.238
Neg logit contributions
theless-1.832
ammu-1.401
ーテ-1.358
 Lauder-1.313
 Gors-1.304
Token targets
Token ID6670
Feature activation+1.54e-01

Pos logit contributions
ourke+2.627
versions+2.511
acters+2.207
onda+2.166
eca+2.164
Neg logit contributions
theless-3.489
ammu-2.675
ーテ-2.625
 Lauder-2.562
 Gors-2.527
Token of
Token ID286
Feature activation-5.03e-02

Pos logit contributions
ourke+1.615
versions+1.475
acters+1.307
onda+1.294
eca+1.276
Neg logit contributions
theless-2.178
ammu-1.687
ーテ-1.634
 Lauder-1.596
emo-1.586
Token consumer
Token ID7172
Feature activation-1.10e+00

Pos logit contributions
theless+0.670
ammu+0.511
ーテ+0.490
 Lauder+0.483
ayer+0.481
Neg logit contributions
ourke-0.574
versions-0.522
acters-0.469
onda-0.460
eca-0.455
Token wrath
Token ID26485
Feature activation-6.52e-04

Pos logit contributions
theless+11.531
ammu+8.546
 Gors+8.494
 Lauder+8.421
wire+8.299
Neg logit contributions
ourke-12.533
versions-10.838
acters-10.210
milo-10.014
orem-10.013
Token.
Token ID13
Feature activation-3.69e-02

Pos logit contributions
theless+0.009
ammu+0.007
ーテ+0.007
 Lauder+0.007
emo+0.007
Neg logit contributions
ourke-0.007
versions-0.007
acters-0.006
onda-0.006
eca-0.006
Token
Token ID198
Feature activation-8.90e-03

Pos logit contributions
theless+0.483
ammu+0.366
ーテ+0.352
 Lauder+0.344
emo+0.342
Neg logit contributions
ourke-0.430
versions-0.395
acters-0.354
onda-0.349
eca-0.345
Token
Token ID198
Feature activation-9.71e-02

Pos logit contributions
theless+0.121
ammu+0.092
ーテ+0.090
 Lauder+0.087
 Gors+0.086
Neg logit contributions
ourke-0.100
versions-0.092
acters-0.082
onda-0.081
eca-0.080
TokenIn
Token ID818
Feature activation+5.28e-02

Pos logit contributions
theless+1.216
ammu+0.920
ーテ+0.867
emo+0.860
 Lauder+0.857
Neg logit contributions
ourke-1.176
versions-1.073
acters-0.977
onda-0.961
ggle-0.948
Token C
Token ID327
Feature activation-1.57e-01

Pos logit contributions
theless+2.495
ammu+1.813
 Lauder+1.706
ayer+1.691
ーテ+1.689
Neg logit contributions
ourke-2.699
versions-2.471
acters-2.266
onda-2.227
ggle-2.210
TokenFP
Token ID5837
Feature activation-4.53e-01

Pos logit contributions
theless+1.939
ammu+1.463
ーテ+1.374
ayer+1.363
emo+1.362
Neg logit contributions
ourke-1.921
versions-1.760
acters-1.595
onda-1.557
eca-1.550
TokenB
Token ID33
Feature activation-4.82e-02

Pos logit contributions
theless+4.641
ammu+3.338
ayer+3.287
Berry+3.177
emo+3.136
Neg logit contributions
ourke-6.106
versions-5.797
acters-5.295
onda-5.181
yg-5.054
Token launched
Token ID5611
Feature activation-2.95e-01

Pos logit contributions
theless+0.632
ammu+0.478
ーテ+0.458
 Lauder+0.450
emo+0.448
Neg logit contributions
ourke-0.562
versions-0.515
acters-0.462
onda-0.455
eca-0.449
Token a
Token ID257
Feature activation-3.73e-02

Pos logit contributions
theless+3.686
ammu+2.759
 Lauder+2.639
ayer+2.620
 Gors+2.612
Neg logit contributions
ourke-3.535
versions-3.194
acters-2.914
onda-2.828
eca-2.823
Token consumer
Token ID7172
Feature activation-1.08e+00

Pos logit contributions
theless+0.504
ammu+0.386
ーテ+0.371
 Lauder+0.365
emo+0.363
Neg logit contributions
ourke-0.417
versions-0.381
acters-0.341
onda-0.335
eca-0.331
Token help
Token ID1037
Feature activation-4.94e-02

Pos logit contributions
theless+11.660
ammu+8.773
 Gors+8.444
emo+8.272
 Lauder+8.188
Neg logit contributions
ourke-11.893
versions-10.674
acters-10.070
onda-9.828
eca-9.612
Token center
Token ID3641
Feature activation+6.86e-02

Pos logit contributions
theless+0.615
ammu+0.459
ーテ+0.440
 Lauder+0.430
ayer+0.428
Neg logit contributions
ourke-0.606
versions-0.559
acters-0.505
onda-0.497
eca-0.492
Token where
Token ID810
Feature activation-1.98e-01

Pos logit contributions
ourke+0.763
versions+0.698
acters+0.621
onda+0.611
eca+0.605
Neg logit contributions
theless-0.936
ammu-0.716
ーテ-0.692
 Lauder-0.676
emo-0.670
Token Americans
Token ID3399
Feature activation-2.26e-01

Pos logit contributions
theless+2.556
ammu+1.927
 Lauder+1.829
ayer+1.826
ーテ+1.814
Neg logit contributions
ourke-2.335
versions-2.105
acters-1.907
onda-1.860
ggle-1.859
Token can
Token ID460
Feature activation-2.23e-01

Pos logit contributions
theless+2.941
ammu+2.226
ーテ+2.135
ayer+2.115
 Lauder+2.098
Neg logit contributions
ourke-2.630
versions-2.416
acters-2.173
onda-2.111
eca-2.082
Tokenlling
Token ID2680
Feature activation-4.73e-01

Pos logit contributions
theless+2.556
ammu+1.994
ーテ+1.882
ayer+1.878
 Lauder+1.867
Neg logit contributions
ourke-1.913
versions-1.731
acters-1.535
onda-1.514
eca-1.501
Token.
Token ID13
Feature activation-2.92e-01

Pos logit contributions
theless+5.753
ammu+4.335
ayer+4.076
emo+4.028
 Gors+3.965
Neg logit contributions
ourke-5.647
versions-5.109
acters-4.605
onda-4.579
eca-4.534
Token
Token ID198
Feature activation-5.84e-03

Pos logit contributions
theless+3.585
ammu+2.639
ーテ+2.541
 Gors+2.531
emo+2.505
Neg logit contributions
ourke-3.582
versions-3.259
acters-2.989
onda-2.938
ggle-2.908
Token
Token ID198
Feature activation-2.58e-01

Pos logit contributions
theless+0.079
ammu+0.060
ーテ+0.059
 Lauder+0.057
 Gors+0.057
Neg logit contributions
ourke-0.066
versions-0.061
acters-0.054
onda-0.053
eca-0.052
TokenOB
Token ID9864
Feature activation-3.58e-02

Pos logit contributions
theless+3.049
ammu+2.308
emo+2.179
ayer+2.145
Berry+2.125
Neg logit contributions
ourke-3.199
versions-2.904
acters-2.704
onda-2.651
ggle-2.613
TokenB
Token ID33
Feature activation-1.06e+00

Pos logit contributions
theless+0.452
ammu+0.339
ーテ+0.325
 Lauder+0.317
emo+0.317
Neg logit contributions
ourke-0.430
versions-0.397
acters-0.358
onda-0.353
eca-0.348
Token Par
Token ID2547
Feature activation+1.39e-01

Pos logit contributions
theless+9.941
ammu+7.214
ayer+6.801
Berry+6.409
 Messenger+6.385
Neg logit contributions
ourke-12.535
versions-11.779
yg-11.288
ggle-11.055
acters-10.834
Tokence
Token ID344
Feature activation-2.85e-01

Pos logit contributions
ourke+0.923
versions+0.800
acters+0.649
onda+0.616
eca+0.610
Neg logit contributions
theless-2.554
ーテ-2.072
ammu-2.064
 Lauder-2.027
 Gors-2.013
Tokenlling
Token ID2680
Feature activation-8.09e-01

Pos logit contributions
theless+3.925
ammu+3.074
ayer+2.903
 Lauder+2.865
emo+2.860
Neg logit contributions
ourke-3.046
versions-2.757
acters-2.464
onda-2.418
eca-2.407
Token
Token ID198
Feature activation-1.93e-02

Pos logit contributions
theless+8.741
ammu+6.490
ayer+6.124
 Gors+6.043
 Lauder+5.912
Neg logit contributions
ourke-9.723
versions-8.741
acters-8.129
ggle-8.061
onda-8.019
Token
Token ID198
Feature activation-1.48e-01

Pos logit contributions
theless+0.260
ammu+0.197
ーテ+0.192
 Lauder+0.186
 Gors+0.184
Neg logit contributions
ourke-0.219
versions-0.201
acters-0.179
onda-0.176
eca-0.175
Token (
Token ID357
Feature activation-2.86e-01

Pos logit contributions
theless+0.968
ammu+0.738
ーテ+0.707
 Lauder+0.696
ayer+0.692
Neg logit contributions
ourke-0.822
versions-0.753
acters-0.673
onda-0.661
eca-0.655
Tokenwhich
Token ID4758
Feature activation-3.80e-01

Pos logit contributions
theless+3.793
ammu+2.934
emo+2.754
ayer+2.738
ーテ+2.708
Neg logit contributions
ourke-3.211
versions-2.879
acters-2.687
ggle-2.570
eca-2.565
Token,
Token ID11
Feature activation-6.31e-02

Pos logit contributions
theless+4.873
ammu+3.620
 Gors+3.482
 Lauder+3.454
ーテ+3.444
Neg logit contributions
ourke-4.509
versions-4.117
acters-3.706
onda-3.570
ggle-3.569
Token it
Token ID340
Feature activation-2.51e-01

Pos logit contributions
theless+0.822
ammu+0.620
ーテ+0.594
 Lauder+0.584
 Gors+0.580
Neg logit contributions
ourke-0.741
versions-0.681
acters-0.612
onda-0.600
eca-0.595
Token should
Token ID815
Feature activation-5.87e-01

Pos logit contributions
theless+3.101
ammu+2.269
 Lauder+2.146
ayer+2.118
emo+2.113
Neg logit contributions
ourke-3.162
versions-2.911
acters-2.642
eca-2.554
onda-2.551
Token be
Token ID307
Feature activation-1.05e+00

Pos logit contributions
theless+6.234
ammu+4.621
emo+4.329
ayer+4.303
 Lauder+4.048
Neg logit contributions
ourke-7.857
versions-7.364
acters-6.606
eca-6.320
milo-6.231
Token noted
Token ID4367
Feature activation+3.76e-02

Pos logit contributions
theless+8.749
ammu+6.106
emo+6.050
 Gors+5.922
 Lauder+5.592
Neg logit contributions
ourke-14.295
versions-13.725
acters-12.881
milo-12.147
eca-12.041
Token,
Token ID11
Feature activation-3.40e-02

Pos logit contributions
ourke+0.414
versions+0.378
acters+0.336
onda+0.329
eca+0.328
Neg logit contributions
theless-0.515
ammu-0.397
ーテ-0.381
 Lauder-0.375
ayer-0.374
Token came
Token ID1625
Feature activation-3.62e-01

Pos logit contributions
theless+0.458
ammu+0.350
ーテ+0.337
 Lauder+0.330
emo+0.328
Neg logit contributions
ourke-0.382
versions-0.350
acters-0.313
onda-0.308
eca-0.304
Token two
Token ID734
Feature activation-2.92e-01

Pos logit contributions
theless+4.578
ammu+3.509
ーテ+3.310
ayer+3.308
 Lauder+3.262
Neg logit contributions
ourke-4.277
versions-3.916
acters-3.539
eca-3.435
ggle-3.363
Token months
Token ID1933
Feature activation-3.42e-01

Pos logit contributions
theless+4.141
ammu+3.263
 Lauder+3.088
ーテ+3.076
ayer+3.061
Neg logit contributions
ourke-3.020
versions-2.670
acters-2.433
eca-2.327
onda-2.317
Token
Token ID198
Feature activation-1.28e-01

Pos logit contributions
theless+2.048
ammu+1.556
ーテ+1.452
 Lauder+1.452
ayer+1.451
Neg logit contributions
ourke-1.985
versions-1.818
acters-1.648
onda-1.626
eca-1.601
Token
Token ID447
Feature activation+9.45e-02

Pos logit contributions
theless+1.594
ammu+1.212
ーテ+1.138
emo+1.131
ellen+1.128
Neg logit contributions
ourke-1.541
versions-1.418
acters-1.291
onda-1.268
eca-1.257
Token
Token ID250
Feature activation-1.91e-01

Pos logit contributions
ourke+0.829
versions+0.738
acters+0.636
onda+0.622
eca+0.613
Neg logit contributions
theless-1.508
ammu-1.204
ーテ-1.163
 Lauder-1.152
 Gors-1.144
TokenNobody
Token ID24795
Feature activation-3.79e-02

Pos logit contributions
theless+2.396
ammu+1.800
ーテ+1.720
 Lauder+1.680
ayer+1.680
Neg logit contributions
ourke-2.292
versions-2.115
acters-1.917
onda-1.883
eca-1.872
Token in
Token ID287
Feature activation-4.89e-02

Pos logit contributions
theless+0.501
ammu+0.380
ーテ+0.366
 Lauder+0.358
emo+0.356
Neg logit contributions
ourke-0.435
versions-0.399
acters-0.358
onda-0.352
eca-0.348
Token their
Token ID511
Feature activation-1.03e+00

Pos logit contributions
theless+0.599
ammu+0.444
ーテ+0.426
 Lauder+0.418
ayer+0.414
Neg logit contributions
ourke-0.608
versions-0.559
acters-0.510
onda-0.500
eca-0.497
Token right
Token ID826
Feature activation-2.11e-01

Pos logit contributions
theless+7.461
 Lauder+5.326
ammu+5.308
 Gors+4.932
ellen+4.903
Neg logit contributions
ourke-14.878
versions-12.906
acters-12.814
eca-12.336
milo-12.231
Token mind
Token ID2000
Feature activation-4.34e-02

Pos logit contributions
theless+2.597
ammu+1.953
 Lauder+1.842
ーテ+1.842
 Gors+1.829
Neg logit contributions
ourke-2.572
versions-2.352
acters-2.170
eca-2.117
onda-2.071
Token would
Token ID561
Feature activation-1.61e-01

Pos logit contributions
theless+0.573
ammu+0.435
ーテ+0.418
 Lauder+0.410
emo+0.407
Neg logit contributions
ourke-0.501
versions-0.460
acters-0.412
onda-0.405
eca-0.401
Token say
Token ID910
Feature activation+1.60e-01

Pos logit contributions
theless+2.209
ammu+1.706
ーテ+1.626
emo+1.603
ayer+1.602
Neg logit contributions
ourke-1.765
versions-1.616
acters-1.433
onda-1.412
eca-1.400
Token that
Token ID326
Feature activation-4.55e-02

Pos logit contributions
ourke+1.732
versions+1.578
acters+1.397
onda+1.381
eca+1.357
Neg logit contributions
theless-2.227
ammu-1.710
ーテ-1.657
 Lauder-1.616
emo-1.608
Token planet
Token ID5440
Feature activation-1.64e-01

Pos logit contributions
theless+5.026
ammu+3.881
ayer+3.674
 Gors+3.640
 Lauder+3.627
Neg logit contributions
ourke-4.728
versions-4.191
acters-3.928
onda-3.789
eca-3.710
Token,
Token ID11
Feature activation+7.35e-02

Pos logit contributions
theless+2.127
ammu+1.613
ーテ+1.533
ayer+1.514
 Lauder+1.506
Neg logit contributions
ourke-1.920
versions-1.761
acters-1.582
onda-1.550
eca-1.528
Token no
Token ID645
Feature activation-4.50e-01

Pos logit contributions
ourke+1.009
versions+0.938
acters+0.858
onda+0.847
eca+0.840
Neg logit contributions
theless-0.807
ammu-0.572
ーテ-0.545
 Lauder-0.530
emo-0.530
Token matter
Token ID2300
Feature activation-5.01e-01

Pos logit contributions
theless+4.672
ammu+3.393
 Lauder+3.143
ayer+3.124
ーテ+3.105
Neg logit contributions
ourke-6.158
versions-5.612
acters-5.250
onda-5.156
eca-5.092
Token what
Token ID644
Feature activation-7.58e-01

Pos logit contributions
theless+6.398
ammu+4.811
 Lauder+4.542
 Gors+4.542
ayer+4.480
Neg logit contributions
ourke-5.807
versions-5.139
acters-4.673
onda-4.654
eca-4.591
Token our
Token ID674
Feature activation-1.03e+00

Pos logit contributions
theless+8.758
ammu+6.589
 Gors+6.252
 Lauder+6.251
ayer+6.182
Neg logit contributions
ourke-8.914
versions-7.770
onda-7.260
acters-7.215
eca-7.166
Token backgrounds
Token ID19063
Feature activation-3.43e-01

Pos logit contributions
theless+10.729
ammu+8.126
 Gors+7.997
 Lauder+7.754
 Messenger+7.482
Neg logit contributions
ourke-12.065
versions-10.103
onda-10.057
eca-9.715
acters-9.640
Token may
Token ID743
Feature activation-1.89e-01

Pos logit contributions
theless+4.219
ammu+3.256
ayer+3.100
ーテ+3.078
 Lauder+3.042
Neg logit contributions
ourke-4.018
versions-3.700
acters-3.368
onda-3.240
eca-3.228
Token be
Token ID307
Feature activation-3.40e-01

Pos logit contributions
theless+2.356
ammu+1.788
ーテ+1.687
emo+1.672
ayer+1.658
Neg logit contributions
ourke-2.295
versions-2.095
acters-1.893
onda-1.854
eca-1.842
Token,
Token ID11
Feature activation-2.68e-01

Pos logit contributions
theless+4.197
ammu+3.185
ーテ+3.084
ayer+3.034
 Lauder+2.998
Neg logit contributions
ourke-4.004
versions-3.675
acters-3.335
onda-3.263
eca-3.218
Token
Token ID447
Feature activation-3.39e-02

Pos logit contributions
theless+2.956
ammu+2.138
ーテ+2.031
ayer+1.977
 Lauder+1.973
Neg logit contributions
ourke-3.601
versions-3.350
acters-3.061
onda-3.017
eca-2.984
Token invoke
Token ID26342
Feature activation-4.29e-01

Pos logit contributions
theless+2.347
ammu+1.798
ーテ+1.700
ayer+1.672
emo+1.670
Neg logit contributions
ourke-2.138
versions-1.940
acters-1.738
eca-1.717
onda-1.699
Token it
Token ID340
Feature activation-4.00e-01

Pos logit contributions
theless+5.137
ammu+3.885
ーテ+3.724
 Gors+3.666
emo+3.636
Neg logit contributions
ourke-5.228
versions-4.705
acters-4.269
eca-4.238
onda-4.194
Token locally
Token ID15726
Feature activation-1.84e-02

Pos logit contributions
theless+4.796
ammu+3.576
ayer+3.366
emo+3.335
ーテ+3.328
Neg logit contributions
ourke-4.871
versions-4.503
acters-4.056
eca-4.022
onda-3.874
Token with
Token ID351
Feature activation-6.20e-01

Pos logit contributions
theless+0.245
ammu+0.186
ーテ+0.179
 Lauder+0.176
emo+0.175
Neg logit contributions
ourke-0.211
versions-0.193
acters-0.173
onda-0.170
eca-0.168
Token sl
Token ID1017
Feature activation-4.88e-01

Pos logit contributions
theless+7.149
ammu+5.337
ーテ+5.116
 Gors+5.083
emo+5.062
Neg logit contributions
ourke-7.429
versions-6.699
acters-6.079
eca-6.050
ggle-5.987
Tokens
Token ID82
Feature activation-1.03e+00

Pos logit contributions
theless+6.416
ammu+5.166
ayer+4.871
ーテ+4.759
emo+4.712
Neg logit contributions
ourke-5.193
versions-4.515
acters-4.115
eca-4.072
ggle-4.061
Token invoke
Token ID26342
Feature activation-6.06e-01

Pos logit contributions
theless+11.645
ammu+9.233
ayer+8.605
emo+8.542
ーテ+8.379
Neg logit contributions
ourke-11.037
versions-9.399
acters-8.785
eca-8.695
ggle-8.457
Token local
Token ID1957
Feature activation-2.64e-01

Pos logit contributions
theless+7.044
ammu+5.447
ーテ+5.055
emo+5.017
ayer+4.982
Neg logit contributions
ourke-7.263
versions-6.439
acters-5.910
eca-5.790
ggle-5.747
Token -
Token ID532
Feature activation-2.57e-01

Pos logit contributions
theless+3.473
ammu+2.718
ーテ+2.597
emo+2.581
ayer+2.553
Neg logit contributions
ourke-2.960
versions-2.642
acters-2.427
eca-2.328
ggle-2.323
Tokenf
Token ID69
Feature activation-7.44e-01

Pos logit contributions
theless+3.501
ammu+2.708
emo+2.584
ーテ+2.565
ayer+2.545
Neg logit contributions
ourke-2.803
versions-2.515
acters-2.290
onda-2.205
eca-2.203
Token test
Token ID1332
Feature activation-6.73e-01

Pos logit contributions
theless+8.590
ammu+6.698
emo+6.284
ayer+6.177
ーテ+6.164
Neg logit contributions
ourke-8.605
versions-7.544
eca-6.889
acters-6.876
ggle-6.788
Token 20
Token ID1160
Feature activation-4.96e-01

Pos logit contributions
theless+6.835
ammu+5.232
ayer+4.916
ーテ+4.855
 Lauder+4.830
Neg logit contributions
ourke-6.884
versions-6.033
eca-5.544
acters-5.526
onda-5.472
Token 21
Token ID2310
Feature activation-3.56e-01

Pos logit contributions
theless+5.311
ammu+4.097
ayer+3.811
ーテ+3.688
 Gors+3.663
Neg logit contributions
ourke-6.450
versions-5.775
eca-5.420
ggle-5.332
yg-5.261
Token 22
Token ID2534
Feature activation-3.04e-01

Pos logit contributions
theless+4.313
ammu+3.302
ayer+3.192
 Lauder+3.079
ーテ+3.076
Neg logit contributions
ourke-4.150
versions-3.841
eca-3.494
acters-3.485
onda-3.412
Token 23
Token ID2242
Feature activation-5.26e-01

Pos logit contributions
theless+3.963
ammu+3.054
ーテ+2.880
ayer+2.870
 Gors+2.867
Neg logit contributions
ourke-3.402
versions-3.112
onda-2.748
eca-2.735
ggle-2.714
Token 24
Token ID1987
Feature activation-7.02e-01

Pos logit contributions
theless+5.586
ammu+4.010
ayer+3.815
 Gors+3.779
emo+3.705
Neg logit contributions
ourke-6.850
versions-6.202
eca-5.734
acters-5.707
onda-5.658
Token 25
Token ID1679
Feature activation-1.03e+00

Pos logit contributions
theless+7.767
ammu+5.735
 Gors+5.421
ivably+5.378
Berry+5.316
Neg logit contributions
ourke-8.620
versions-7.581
eca-7.142
yg-6.854
acters-6.842
Token 26
Token ID2608
Feature activation-5.70e-01

Pos logit contributions
theless+7.948
ammu+5.602
 Gors+5.369
 Messenger+5.039
ayer+5.036
Neg logit contributions
ourke-14.818
eca-12.735
versions-12.716
milo-12.108
onde-11.836
Token 27
Token ID2681
Feature activation-4.98e-01

Pos logit contributions
theless+6.055
ammu+4.462
 Gors+4.210
ーテ+4.127
 Lauder+3.997
Neg logit contributions
ourke-7.394
versions-6.695
eca-6.252
milo-6.233
yg-6.165
Token #
Token ID1303
Feature activation-3.03e-01

Pos logit contributions
theless+5.583
ammu+4.090
 Gors+3.933
ーテ+3.928
ayer+3.912
Neg logit contributions
ourke-6.323
versions-5.616
eca-5.213
onda-5.153
ington-5.150
Token loop
Token ID9052
Feature activation-3.03e-01

Pos logit contributions
theless+3.994
ammu+3.113
ーテ+2.929
ayer+2.917
 Lauder+2.897
Neg logit contributions
ourke-3.409
versions-3.033
acters-2.732
eca-2.700
ggle-2.668
Token over
Token ID625
Feature activation-2.23e-01

Pos logit contributions
theless+3.793
ammu+2.891
ーテ+2.743
ayer+2.705
emo+2.686
Neg logit contributions
ourke-3.606
versions-3.257
acters-2.909
onda-2.892
eca-2.886
Token.
Token ID764
Feature activation-3.89e-01

Pos logit contributions
ourke+0.283
versions+0.259
acters+0.230
onda+0.226
eca+0.224
Neg logit contributions
theless-0.355
ammu-0.273
ーテ-0.264
 Lauder-0.258
emo-0.257
Token
Token ID198
Feature activation-3.24e-01

Pos logit contributions
theless+4.630
ammu+3.437
emo+3.313
ayer+3.201
 Gors+3.197
Neg logit contributions
ourke-4.867
versions-4.399
acters-4.028
onda-3.946
ggle-3.888
Token
Token ID198
Feature activation-3.75e-01

Pos logit contributions
theless+3.702
ammu+2.819
emo+2.612
ayer+2.610
 Lauder+2.576
Neg logit contributions
ourke-4.113
versions-3.740
acters-3.446
onda-3.417
ggle-3.343
TokenDem
Token ID11522
Feature activation-4.87e-01

Pos logit contributions
theless+4.515
ammu+3.559
emo+3.414
ayer+3.298
ellen+3.259
Neg logit contributions
ourke-4.470
versions-3.959
acters-3.712
onda-3.616
ggle-3.544
Tokeno
Token ID78
Feature activation-4.81e-01

Pos logit contributions
theless+5.875
ammu+4.527
emo+4.393
ayer+4.234
 Gors+4.067
Neg logit contributions
ourke-5.695
versions-5.225
acters-4.747
ggle-4.532
eca-4.493
Token Application
Token ID15678
Feature activation-1.02e+00

Pos logit contributions
theless+5.843
ammu+4.543
emo+4.404
ayer+4.252
ーテ+4.223
Neg logit contributions
ourke-5.584
versions-4.954
acters-4.546
onda-4.487
ggle-4.390
Token Architecture
Token ID29778
Feature activation-2.29e-01

Pos logit contributions
theless+10.125
ammu+7.995
emo+7.794
ellen+7.409
 Gors+7.374
Neg logit contributions
ourke-11.795
versions-10.360
acters-9.607
ggle-9.572
onda-9.546
Token
Token ID198
Feature activation-8.45e-02

Pos logit contributions
theless+2.780
ammu+2.118
emo+2.001
ーテ+1.990
ayer+1.970
Neg logit contributions
ourke-2.813
versions-2.571
acters-2.352
onda-2.288
ggle-2.267
Token
Token ID198
Feature activation-5.72e-01

Pos logit contributions
theless+1.101
ammu+0.835
ーテ+0.800
 Lauder+0.785
emo+0.781
Neg logit contributions
ourke-0.987
versions-0.906
acters-0.813
onda-0.801
eca-0.791
TokenAll
Token ID3237
Feature activation-2.96e-01

Pos logit contributions
theless+6.058
emo+4.768
ammu+4.754
ayer+4.406
Berry+4.391
Neg logit contributions
ourke-7.231
versions-6.389
acters-6.079
onda-5.926
ggle-5.775
Token the
Token ID262
Feature activation-5.03e-01

Pos logit contributions
theless+3.794
ammu+2.883
emo+2.788
ーテ+2.765
ayer+2.731
Neg logit contributions
ourke-3.461
versions-3.110
acters-2.780
ggle-2.743
onda-2.741
Token of
Token ID286
Feature activation-6.42e-01

Pos logit contributions
theless+3.873
ammu+2.777
ayer+2.605
 Gors+2.592
ーテ+2.538
Neg logit contributions
ourke-5.508
versions-5.084
acters-4.698
ggle-4.632
eca-4.617
Token contributors
Token ID20420
Feature activation-6.15e-01

Pos logit contributions
theless+7.009
 Gors+5.214
ammu+5.157
ーテ+5.057
ayer+4.864
Neg logit contributions
ourke-8.466
versions-7.459
acters-7.015
eca-6.910
onda-6.763
Token are
Token ID389
Feature activation-5.21e-01

Pos logit contributions
theless+5.786
ammu+4.419
ayer+4.107
 Gors+4.066
emo+4.009
Neg logit contributions
ourke-8.537
versions-7.667
ggle-7.383
onda-7.198
acters-7.156
Token their
Token ID511
Feature activation-3.54e-01

Pos logit contributions
theless+5.140
ammu+3.792
 Gors+3.640
 Lauder+3.501
ellen+3.406
Neg logit contributions
ourke-6.984
versions-6.421
acters-6.205
ggle-6.078
onda-5.915
Token own
Token ID898
Feature activation-1.80e-01

Pos logit contributions
theless+3.329
ammu+2.729
ayer+2.425
 Gors+2.419
mere+2.357
Neg logit contributions
ourke-4.956
versions-4.490
acters-4.251
onda-4.083
yg-4.004
Token and
Token ID290
Feature activation-1.02e+00

Pos logit contributions
theless+2.254
ammu+1.781
 Gors+1.657
ーテ+1.655
ayer+1.643
Neg logit contributions
ourke-2.136
versions-1.938
acters-1.787
onda-1.700
ggle-1.681
Token are
Token ID389
Feature activation-8.49e-01

Pos logit contributions
theless+8.647
ammu+6.484
 Gors+6.289
ayer+6.146
ーテ+5.972
Neg logit contributions
ourke-13.526
acters-12.037
versions-11.926
ggle-11.155
onda-11.056
Token not
Token ID407
Feature activation-6.76e-01

Pos logit contributions
theless+7.311
ammu+5.059
 Gors+4.865
ayer+4.796
ーテ+4.749
Neg logit contributions
ourke-11.878
versions-11.158
acters-10.661
ggle-10.519
eca-10.506
Token the
Token ID262
Feature activation-4.37e-01

Pos logit contributions
theless+7.162
 Gors+4.990
ammu+4.979
ayer+4.690
 Lauder+4.555
Neg logit contributions
ourke-8.792
versions-7.855
acters-7.679
ggle-7.631
onda-7.519
Token views
Token ID5009
Feature activation-7.10e-01

Pos logit contributions
theless+5.518
ammu+4.552
 Gors+4.281
ayer+4.213
 Messenger+4.171
Neg logit contributions
ourke-4.762
versions-4.225
acters-3.880
ggle-3.755
onda-3.741
Token of
Token ID286
Feature activation-6.83e-01

Pos logit contributions
theless+7.548
ammu+5.701
ellen+5.407
ayer+5.382
 Gors+5.254
Neg logit contributions
ourke-8.947
versions-8.153
acters-7.421
onda-7.307
ggle-7.263
Token is
Token ID318
Feature activation-4.38e-01

Pos logit contributions
ourke+0.550
versions+0.504
acters+0.448
onda+0.441
eca+0.436
Neg logit contributions
theless-0.685
ammu-0.526
ーテ-0.508
 Lauder-0.497
emo-0.495
Token just
Token ID655
Feature activation-5.64e-01

Pos logit contributions
theless+5.485
ammu+4.193
 Gors+4.066
 Lauder+4.021
emo+3.949
Neg logit contributions
ourke-5.048
versions-4.604
acters-4.246
eca-4.125
onda-4.092
Token that
Token ID326
Feature activation-5.52e-01

Pos logit contributions
theless+6.958
ammu+5.289
 Gors+5.163
 Lauder+5.090
ayer+5.044
Neg logit contributions
ourke-6.340
versions-5.809
acters-5.381
onda-5.152
ggle-5.142
Token and
Token ID290
Feature activation-4.07e-01

Pos logit contributions
theless+6.279
ammu+4.770
ayer+4.655
ellen+4.485
 Gors+4.480
Neg logit contributions
ourke-6.729
versions-6.109
acters-5.715
eca-5.482
onda-5.441
Token must
Token ID1276
Feature activation-2.80e-01

Pos logit contributions
theless+5.265
ammu+3.955
 Gors+3.896
ayer+3.795
ーテ+3.794
Neg logit contributions
ourke-4.558
versions-4.228
acters-3.844
onda-3.758
eca-3.750
Token be
Token ID307
Feature activation-9.99e-01

Pos logit contributions
theless+3.470
ammu+2.602
ayer+2.447
emo+2.421
ーテ+2.416
Neg logit contributions
ourke-3.406
versions-3.110
acters-2.787
onda-2.764
eca-2.711
Token taken
Token ID2077
Feature activation-6.15e-01

Pos logit contributions
theless+11.428
 Gors+8.518
ammu+8.272
ayer+8.081
wolves+7.868
Neg logit contributions
ourke-10.941
versions-9.953
acters-9.360
onda-9.091
eca-8.964
Token with
Token ID351
Feature activation-8.76e-02

Pos logit contributions
theless+6.601
ammu+5.007
ayer+4.835
 Gors+4.753
 Lauder+4.612
Neg logit contributions
ourke-7.885
versions-7.263
acters-6.721
onda-6.425
eca-6.337
Token a
Token ID257
Feature activation-5.26e-02

Pos logit contributions
theless+1.184
ammu+0.917
ーテ+0.870
 Gors+0.863
ayer+0.862
Neg logit contributions
ourke-0.976
versions-0.890
acters-0.800
onda-0.782
eca-0.768
Token grain
Token ID13020
Feature activation-2.87e-01

Pos logit contributions
theless+0.716
ammu+0.549
ーテ+0.529
 Lauder+0.518
emo+0.515
Neg logit contributions
ourke-0.586
versions-0.536
acters-0.478
onda-0.470
eca-0.465
Token of
Token ID286
Feature activation-1.65e-01

Pos logit contributions
theless+3.744
ammu+2.908
ayer+2.752
ーテ+2.742
 Lauder+2.713
Neg logit contributions
ourke-3.252
versions-2.989
acters-2.697
onda-2.609
ggle-2.588
Token receive
Token ID3328
Feature activation-8.09e-01

Pos logit contributions
theless+5.874
ammu+4.319
 Gors+4.124
Berry+4.038
ayer+3.937
Neg logit contributions
ourke-8.399
versions-7.796
onda-7.291
acters-7.189
eca-7.005
Token occasional
Token ID12209
Feature activation-7.20e-01

Pos logit contributions
theless+7.597
ammu+6.028
 Gors+5.653
emo+5.634
wolves+5.487
Neg logit contributions
ourke-10.162
versions-9.209
onda-8.945
acters-8.830
orem-8.585
Token updates
Token ID5992
Feature activation-6.21e-01

Pos logit contributions
theless+7.697
ammu+6.032
ayer+5.459
 Gors+5.372
emo+5.318
Neg logit contributions
ourke-9.038
versions-8.165
onda-7.647
eca-7.599
acters-7.497
Token and
Token ID290
Feature activation-5.76e-01

Pos logit contributions
theless+6.602
ammu+5.223
 Lauder+4.640
 Gors+4.590
ellen+4.545
Neg logit contributions
ourke-7.880
versions-7.187
acters-6.664
ggle-6.626
onda-6.605
Token special
Token ID2041
Feature activation-3.93e-01

Pos logit contributions
theless+5.462
ammu+4.066
emo+3.650
ayer+3.601
 Gors+3.520
Neg logit contributions
ourke-7.994
versions-7.356
onda-6.877
ggle-6.720
acters-6.704
Token offers
Token ID4394
Feature activation-9.94e-01

Pos logit contributions
theless+4.362
ammu+3.279
 Gors+3.020
emo+3.003
ayer+2.994
Neg logit contributions
ourke-5.081
versions-4.605
onda-4.254
acters-4.205
eca-4.184
Token for
Token ID329
Feature activation-7.44e-01

Pos logit contributions
theless+8.109
ammu+6.630
ayer+5.661
 Gors+5.485
emo+5.393
Neg logit contributions
ourke-13.224
versions-12.223
onda-11.965
escription-11.791
acters-11.437
Token The
Token ID383
Feature activation-7.60e-01

Pos logit contributions
theless+7.274
ammu+5.600
ayer+5.152
emo+5.136
Berry+5.021
Neg logit contributions
ourke-9.527
versions-8.823
acters-8.173
eca-8.153
onda-8.066
Token New
Token ID968
Feature activation-3.92e-01

Pos logit contributions
theless+7.146
ammu+5.628
 Lauder+5.427
 Gors+5.357
 Messenger+5.283
Neg logit contributions
ourke-9.882
versions-9.158
milo-8.442
ggle-8.429
onda-8.393
Token York
Token ID1971
Feature activation-4.08e-01

Pos logit contributions
theless+4.388
ammu+3.376
 Lauder+3.138
 Gors+3.134
ayer+3.090
Neg logit contributions
ourke-4.943
versions-4.576
onda-4.157
acters-4.125
ggle-4.060
Token Times
Token ID3782
Feature activation-6.73e-01

Pos logit contributions
theless+3.956
ammu+2.868
 Lauder+2.604
 Gors+2.588
ーテ+2.566
Neg logit contributions
ourke-5.849
versions-5.415
acters-5.006
onda-4.971
ggle-4.872
Tokenll
Token ID297
Feature activation-7.88e-02

Pos logit contributions
theless+2.116
ammu+1.686
ayer+1.615
emo+1.601
ellen+1.587
Neg logit contributions
ourke-1.367
versions-1.233
acters-1.090
onda-1.061
eca-1.042
Token send
Token ID3758
Feature activation-3.86e-01

Pos logit contributions
theless+0.950
ammu+0.704
ーテ+0.663
emo+0.652
ayer+0.652
Neg logit contributions
ourke-0.994
versions-0.920
acters-0.833
onda-0.819
eca-0.814
Token you
Token ID345
Feature activation-2.10e-01

Pos logit contributions
theless+4.152
ammu+3.195
ayer+2.945
emo+2.920
 Gors+2.902
Neg logit contributions
ourke-5.057
versions-4.683
acters-4.233
onda-4.177
eca-4.153
Token three
Token ID1115
Feature activation+2.10e-01

Pos logit contributions
theless+2.850
ammu+2.178
ーテ+2.091
 Gors+2.067
 Lauder+2.066
Neg logit contributions
ourke-2.318
versions-2.111
acters-1.906
onda-1.878
eca-1.858
Token meaningful
Token ID11570
Feature activation-3.55e-01

Pos logit contributions
ourke+2.482
versions+2.295
acters+2.076
onda+2.033
ington+1.989
Neg logit contributions
theless-2.700
ammu-2.050
ーテ-2.024
 Lauder-1.927
emo-1.908
Token actions
Token ID4028
Feature activation-9.94e-01

Pos logit contributions
theless+4.191
ammu+3.143
ayer+3.033
 Gors+2.948
emo+2.940
Neg logit contributions
ourke-4.299
versions-3.902
onda-3.690
eca-3.473
ggle-3.472
Token you
Token ID345
Feature activation-1.95e-01

Pos logit contributions
theless+7.100
ammu+5.385
ayer+4.820
emo+4.744
 Gors+4.691
Neg logit contributions
ourke-14.140
versions-13.027
escription-12.882
onda-12.112
milo-12.090
Token can
Token ID460
Feature activation-3.67e-01

Pos logit contributions
theless+2.486
ammu+1.912
ーテ+1.800
ayer+1.789
emo+1.785
Neg logit contributions
ourke-2.256
versions-2.085
acters-1.852
onda-1.847
eca-1.808
Token take
Token ID1011
Feature activation-1.16e+00

Pos logit contributions
theless+4.730
ammu+3.761
ayer+3.578
emo+3.488
 Gors+3.468
Neg logit contributions
ourke-3.960
versions-3.640
acters-3.245
onda-3.191
eca-3.176
Token each
Token ID1123
Feature activation-4.42e-01

Pos logit contributions
theless+9.331
ammu+7.013
ayer+6.657
 Gors+6.471
ellen+6.342
Neg logit contributions
ourke-14.057
versions-12.934
escription-12.920
orem-12.604
onde-12.587
Token week
Token ID1285
Feature activation-4.79e-01

Pos logit contributions
theless+5.265
ammu+4.209
ayer+3.979
emo+3.927
 Gors+3.779
Neg logit contributions
ourke-5.111
versions-4.612
acters-4.321
ggle-4.194
eca-4.189
Token policy
Token ID2450
Feature activation-2.37e-01

Pos logit contributions
theless+2.155
ammu+1.649
ーテ+1.583
 Lauder+1.562
 Gors+1.553
Neg logit contributions
ourke-1.818
versions-1.643
acters-1.495
onda-1.458
eca-1.449
Token has
Token ID468
Feature activation-1.25e-02

Pos logit contributions
theless+3.030
ammu+2.260
ーテ+2.160
 Lauder+2.133
emo+2.130
Neg logit contributions
ourke-2.826
versions-2.562
acters-2.339
onda-2.284
eca-2.273
Token now
Token ID783
Feature activation+1.68e-01

Pos logit contributions
theless+0.173
ammu+0.133
ーテ+0.129
 Lauder+0.126
emo+0.126
Neg logit contributions
ourke-0.136
versions-0.124
acters-0.110
onda-0.108
eca-0.107
Token been
Token ID587
Feature activation-1.11e-01

Pos logit contributions
ourke+1.719
versions+1.594
acters+1.395
onda+1.374
eca+1.366
Neg logit contributions
theless-2.373
ammu-1.865
ーテ-1.812
ayer-1.753
 Lauder-1.752
Token well
Token ID880
Feature activation-2.75e-01

Pos logit contributions
theless+1.423
ammu+1.071
ーテ+1.024
 Lauder+1.004
ayer+1.001
Neg logit contributions
ourke-1.320
versions-1.211
acters-1.096
onda-1.076
eca-1.065
Token te
Token ID573
Feature activation-9.88e-01

Pos logit contributions
theless+3.575
ammu+2.699
ーテ+2.603
ayer+2.569
ellen+2.548
Neg logit contributions
ourke-3.232
versions-2.916
acters-2.666
onda-2.605
eca-2.566
Tokenleg
Token ID1455
Feature activation-2.27e-01

Pos logit contributions
theless+7.673
emo+6.051
ammu+5.887
ayer+5.381
ween+5.151
Neg logit contributions
ourke-13.774
versions-12.590
acters-11.714
onda-11.569
ggle-11.287
Tokenrap
Token ID2416
Feature activation-1.95e-01

Pos logit contributions
theless+2.646
ammu+1.952
emo+1.815
ayer+1.793
ーテ+1.789
Neg logit contributions
ourke-2.954
versions-2.738
acters-2.521
ggle-2.432
onda-2.424
Tokenhed
Token ID704
Feature activation+1.86e-03

Pos logit contributions
theless+2.867
ammu+2.263
ーテ+2.152
emo+2.151
ayer+2.143
Neg logit contributions
ourke-1.931
versions-1.748
acters-1.538
onda-1.487
ggle-1.485
Token and
Token ID290
Feature activation-1.06e-01

Pos logit contributions
ourke+0.021
versions+0.019
acters+0.017
onda+0.016
eca+0.016
Neg logit contributions
theless-0.026
ammu-0.020
ーテ-0.019
 Lauder-0.019
emo-0.018
Token will
Token ID481
Feature activation-1.31e-01

Pos logit contributions
theless+1.394
ammu+1.050
ーテ+1.011
 Lauder+0.997
ayer+0.990
Neg logit contributions
ourke-1.223
versions-1.115
acters-1.007
onda-0.985
eca-0.980
Token,
Token ID11
Feature activation-1.42e-01

Pos logit contributions
ourke+1.399
versions+1.277
acters+1.132
onda+1.115
eca+1.101
Neg logit contributions
theless-1.819
ammu-1.403
ーテ-1.355
 Lauder-1.327
emo-1.319
Token B
Token ID347
Feature activation-3.24e-01

Pos logit contributions
theless+1.834
ammu+1.391
ーテ+1.323
 Lauder+1.319
 Gors+1.314
Neg logit contributions
ourke-1.658
versions-1.504
acters-1.376
onda-1.336
eca-1.323
Tokenaid
Token ID1698
Feature activation-2.43e-01

Pos logit contributions
theless+3.791
ammu+2.973
ayer+2.809
emo+2.741
ellen+2.665
Neg logit contributions
ourke-4.000
versions-3.725
acters-3.357
onda-3.257
eca-3.236
Tokenu
Token ID84
Feature activation+1.65e-01

Pos logit contributions
theless+2.695
ammu+1.960
emo+1.866
ayer+1.853
ーテ+1.802
Neg logit contributions
ourke-3.294
versions-3.085
acters-2.745
ggle-2.703
onda-2.685
Token and
Token ID290
Feature activation-1.79e-01

Pos logit contributions
ourke+1.754
versions+1.593
acters+1.409
onda+1.389
eca+1.374
Neg logit contributions
theless-2.342
ammu-1.809
ーテ-1.752
 Lauder-1.711
 Gors-1.701
Token Y
Token ID575
Feature activation-9.87e-01

Pos logit contributions
theless+2.303
ammu+1.729
 Lauder+1.648
ーテ+1.646
 Gors+1.641
Neg logit contributions
ourke-2.115
versions-1.921
acters-1.756
onda-1.709
eca-1.687
Tokenand
Token ID392
Feature activation+1.35e-01

Pos logit contributions
theless+7.989
ammu+6.175
ellen+6.132
ayer+5.944
Berry+5.511
Neg logit contributions
ourke-13.569
versions-12.869
acters-11.961
eca-11.610
milo-11.560
Tokenex
Token ID1069
Feature activation+4.33e-02

Pos logit contributions
ourke+1.336
versions+1.201
eca+1.090
onda+1.073
acters+1.062
Neg logit contributions
theless-1.966
ammu-1.542
ーテ-1.528
 Lauder-1.485
 Gors-1.463
Token.
Token ID13
Feature activation+2.39e-01

Pos logit contributions
ourke+0.470
versions+0.429
acters+0.381
onda+0.375
eca+0.371
Neg logit contributions
theless-0.602
ammu-0.464
ーテ-0.448
 Lauder-0.438
emo-0.436
Token When
Token ID1649
Feature activation+2.71e-02

Pos logit contributions
ourke+2.580
versions+2.416
acters+2.096
onda+2.075
eca+2.062
Neg logit contributions
theless-3.285
ammu-2.514
ーテ-2.413
ellen-2.357
emo-2.335
Token you
Token ID345
Feature activation+2.80e-02

Pos logit contributions
ourke+0.306
versions+0.280
acters+0.250
onda+0.246
eca+0.243
Neg logit contributions
theless-0.364
ammu-0.278
ーテ-0.268
 Lauder-0.262
emo-0.261
Token.
Token ID13
Feature activation-1.73e-01

Pos logit contributions
ourke+1.123
versions+1.030
acters+0.898
eca+0.879
onda+0.878
Neg logit contributions
theless-1.693
ーテ-1.302
ammu-1.288
 Lauder-1.235
 Gors-1.231
Tokenx
Token ID87
Feature activation+1.26e-01

Pos logit contributions
theless+2.129
ammu+1.611
ーテ+1.506
emo+1.504
ayer+1.497
Neg logit contributions
ourke-2.116
versions-1.942
acters-1.768
onda-1.735
eca-1.711
Token.
Token ID13
Feature activation-7.49e-02

Pos logit contributions
ourke+1.261
versions+1.158
acters+1.001
onda+0.997
eca+0.988
Neg logit contributions
theless-1.900
ーテ-1.466
ammu-1.448
 Lauder-1.394
 Gors-1.383
Token104
Token ID13464
Feature activation-1.52e-01

Pos logit contributions
theless+1.015
ammu+0.778
ーテ+0.740
emo+0.737
 Lauder+0.732
Neg logit contributions
ourke-0.838
versions-0.762
acters-0.688
onda-0.669
eca-0.660
Token 10
Token ID838
Feature activation+1.85e-01

Pos logit contributions
theless+1.877
ammu+1.395
ーテ+1.329
 Lauder+1.317
 Gors+1.308
Neg logit contributions
ourke-1.870
versions-1.711
acters-1.550
onda-1.524
eca-1.513
Token446
Token ID27260
Feature activation+6.91e-01

Pos logit contributions
ourke+2.108
versions+1.952
acters+1.746
eca+1.744
onda+1.711
Neg logit contributions
theless-2.457
ammu-1.857
ーテ-1.809
ayer-1.749
 Lauder-1.743
Token01
Token ID486
Feature activation-5.19e-01

Pos logit contributions
ourke+5.245
versions+4.760
eca+4.105
acters+4.005
ington+3.944
Neg logit contributions
theless-10.961
ーテ-8.959
etheless-8.702
ammu-8.600
ayer-8.336
Token x
Token ID2124
Feature activation+1.95e-01

Pos logit contributions
theless+3.507
ammu+2.135
 Lauder+1.926
 Gors+1.867
emo+1.843
Neg logit contributions
ourke-8.749
versions-8.259
acters-7.734
onda-7.622
ggle-7.569
Token.
Token ID13
Feature activation-2.25e-01

Pos logit contributions
ourke+1.855
versions+1.699
acters+1.471
eca+1.443
yg+1.429
Neg logit contributions
theless-3.028
ーテ-2.350
ammu-2.304
etheless-2.225
 Gors-2.217
Tokenx
Token ID87
Feature activation+1.94e-01

Pos logit contributions
theless+2.695
ammu+2.045
emo+1.912
ayer+1.896
ellen+1.888
Neg logit contributions
ourke-2.791
versions-2.557
acters-2.340
onda-2.300
ggle-2.258
Token.
Token ID13
Feature activation-6.67e-02

Pos logit contributions
ourke+1.859
versions+1.706
acters+1.472
eca+1.449
onda+1.448
Neg logit contributions
theless-3.001
ーテ-2.338
ammu-2.274
 Lauder-2.201
etheless-2.191
Token high
Token ID1029
Feature activation-1.11e-01

Pos logit contributions
theless+2.163
ammu+1.652
ーテ+1.570
 Lauder+1.553
 Gors+1.547
Neg logit contributions
ourke-1.859
versions-1.679
acters-1.534
onda-1.510
eca-1.486
Token-
Token ID12
Feature activation-7.64e-02

Pos logit contributions
theless+1.208
ammu+0.860
ーテ+0.811
 Lauder+0.792
 Gors+0.787
Neg logit contributions
ourke-1.532
versions-1.421
acters-1.307
onda-1.289
eca-1.277
Tokenlevel
Token ID5715
Feature activation+1.06e-01

Pos logit contributions
theless+0.922
ammu+0.680
ーテ+0.649
 Lauder+0.634
emo+0.631
Neg logit contributions
ourke-0.967
versions-0.894
acters-0.812
onda-0.799
eca-0.792
Token of
Token ID286
Feature activation+1.49e-02

Pos logit contributions
ourke+1.172
versions+1.082
acters+0.949
onda+0.937
eca+0.929
Neg logit contributions
theless-1.449
ammu-1.109
ーテ-1.083
emo-1.047
 Lauder-1.045
Token intellectual
Token ID9028
Feature activation-3.16e-01

Pos logit contributions
ourke+0.156
versions+0.142
acters+0.125
onda+0.123
eca+0.122
Neg logit contributions
theless-0.212
ammu-0.164
ーテ-0.159
 Lauder-0.156
emo-0.155
Token rig
Token ID7805
Feature activation+9.13e-01

Pos logit contributions
theless+4.086
ammu+3.143
ーテ+2.962
 Lauder+2.961
 Gors+2.935
Neg logit contributions
ourke-3.631
versions-3.260
acters-2.968
onda-2.910
eca-2.902
Tokenor
Token ID273
Feature activation-1.70e-01

Pos logit contributions
ourke+5.902
versions+5.026
onde+3.970
eca+3.966
orem+3.933
Neg logit contributions
theless-16.178
 Lauder-12.285
ーテ-12.233
 Gors-12.146
ammu-12.047
Token.
Token ID13
Feature activation+3.80e-02

Pos logit contributions
theless+2.294
ammu+1.763
ーテ+1.684
ayer+1.665
 Lauder+1.659
Neg logit contributions
ourke-1.898
versions-1.734
acters-1.557
onda-1.521
eca-1.506
Token But
Token ID887
Feature activation+2.79e-03

Pos logit contributions
ourke+0.448
versions+0.412
acters+0.370
onda+0.364
eca+0.364
Neg logit contributions
theless-0.493
ammu-0.370
ーテ-0.356
emo-0.348
ayer-0.346
Token they
Token ID484
Feature activation-1.99e-01

Pos logit contributions
ourke+0.032
versions+0.029
acters+0.026
onda+0.025
eca+0.025
Neg logit contributions
theless-0.037
ammu-0.028
ーテ-0.027
 Lauder-0.027
emo-0.027
Token surrounded
Token ID11191
Feature activation-2.35e-01

Pos logit contributions
theless+2.557
ammu+1.911
ーテ+1.799
ayer+1.799
 Lauder+1.798
Neg logit contributions
ourke-2.363
versions-2.171
acters-1.964
onda-1.924
eca-1.900
Token said
Token ID531
Feature activation+2.66e-01

Pos logit contributions
ourke+2.028
versions+1.850
acters+1.656
onda+1.642
eca+1.622
Neg logit contributions
theless-2.381
ammu-1.814
ーテ-1.756
 Lauder-1.695
emo-1.693
Token the
Token ID262
Feature activation+1.81e-01

Pos logit contributions
ourke+2.865
versions+2.596
acters+2.300
onda+2.268
eca+2.232
Neg logit contributions
theless-3.717
ammu-2.885
ーテ-2.773
emo-2.705
ellen-2.678
Token size
Token ID2546
Feature activation+1.76e-01

Pos logit contributions
ourke+1.961
versions+1.817
acters+1.582
eca+1.551
onda+1.549
Neg logit contributions
theless-2.517
ammu-1.946
ーテ-1.877
emo-1.815
 Lauder-1.810
Token of
Token ID286
Feature activation+7.87e-02

Pos logit contributions
ourke+1.884
versions+1.721
acters+1.526
onda+1.499
eca+1.478
Neg logit contributions
theless-2.475
ammu-1.911
ーテ-1.845
 Lauder-1.805
emo-1.802
Token the
Token ID262
Feature activation+1.41e-01

Pos logit contributions
ourke+0.863
versions+0.790
acters+0.701
onda+0.689
eca+0.681
Neg logit contributions
theless-1.084
ammu-0.836
ーテ-0.806
 Lauder-0.786
emo-0.785
Token bonus
Token ID7202
Feature activation+7.47e-01

Pos logit contributions
ourke+1.574
versions+1.455
acters+1.288
onda+1.267
eca+1.255
Neg logit contributions
theless-1.898
ammu-1.454
ーテ-1.408
emo-1.355
ellen-1.352
Token pool
Token ID5933
Feature activation+3.36e-01

Pos logit contributions
ourke+6.691
versions+6.568
acters+5.628
ggle+5.384
ington+5.306
Neg logit contributions
theless-10.489
ammu-8.447
ーテ-8.244
emo-7.705
ellen-7.704
Token each
Token ID1123
Feature activation+2.87e-01

Pos logit contributions
ourke+3.498
versions+3.237
acters+2.843
onda+2.797
eca+2.757
Neg logit contributions
theless-4.690
ammu-3.644
ーテ-3.527
emo-3.435
 Lauder-3.392
Token year
Token ID614
Feature activation+2.43e-01

Pos logit contributions
ourke+2.693
versions+2.449
acters+2.077
onda+2.065
eca+2.043
Neg logit contributions
theless-4.370
ammu-3.453
ーテ-3.348
emo-3.274
 Gors-3.239
Token has
Token ID468
Feature activation-9.15e-02

Pos logit contributions
ourke+2.593
versions+2.384
acters+2.108
onda+2.082
eca+2.039
Neg logit contributions
theless-3.365
ammu-2.606
ーテ-2.530
emo-2.464
 Lauder-2.446
Token historically
Token ID15074
Feature activation+6.17e-02

Pos logit contributions
theless+1.240
ammu+0.948
ーテ+0.911
 Lauder+0.900
 Gors+0.893
Neg logit contributions
ourke-1.023
versions-0.932
acters-0.831
onda-0.818
eca-0.806
Token.
Token ID13
Feature activation+9.57e-02

Pos logit contributions
ourke+3.408
versions+3.139
acters+2.805
onda+2.704
eca+2.690
Neg logit contributions
theless-4.905
ammu-3.826
ーテ-3.722
 Gors-3.678
 Lauder-3.627
Token
Token ID198
Feature activation-1.05e-01

Pos logit contributions
ourke+1.077
versions+0.998
acters+0.888
onda+0.866
eca+0.864
Neg logit contributions
theless-1.279
ammu-0.978
ーテ-0.945
emo-0.917
 Lauder-0.915
Token
Token ID198
Feature activation+3.13e-01

Pos logit contributions
theless+1.348
ammu+1.022
ーテ+0.972
 Lauder+0.959
ayer+0.955
Neg logit contributions
ourke-1.233
versions-1.130
acters-1.018
onda-1.003
eca-0.991
Tokenshould
Token ID21754
Feature activation+1.10e-01

Pos logit contributions
ourke+3.403
versions+3.153
acters+2.760
onda+2.680
eca+2.667
Neg logit contributions
theless-4.316
ammu-3.286
ーテ-3.254
 Lauder-3.166
 Gors-3.150
Token be
Token ID307
Feature activation-1.84e-01

Pos logit contributions
ourke+1.277
versions+1.172
acters+1.049
onda+1.033
eca+1.022
Neg logit contributions
theless-1.451
ammu-1.101
ーテ-1.059
 Lauder-1.037
ayer-1.032
Token clone
Token ID17271
Feature activation+8.29e-01

Pos logit contributions
theless+2.750
ammu+2.158
 Lauder+2.092
ーテ+2.088
ayer+2.069
Neg logit contributions
ourke-1.757
versions-1.590
acters-1.373
onda-1.365
eca-1.340
Tokenable
Token ID540
Feature activation-1.27e-01

Pos logit contributions
ourke+4.854
versions+4.730
acters+3.478
eca+3.396
usable+3.375
Neg logit contributions
theless-14.649
ーテ-12.196
 pione-11.955
ammu-11.808
 Gors-11.613
Token,
Token ID11
Feature activation+2.37e-01

Pos logit contributions
theless+1.734
ammu+1.337
ーテ+1.281
 Lauder+1.264
emo+1.258
Neg logit contributions
ourke-1.387
versions-1.265
acters-1.127
onda-1.109
eca-1.095
Token and
Token ID290
Feature activation+1.66e-01

Pos logit contributions
ourke+2.247
versions+2.053
acters+1.766
onda+1.736
eca+1.720
Neg logit contributions
theless-3.633
ーテ-2.819
ammu-2.805
 Lauder-2.683
emo-2.664
Token those
Token ID883
Feature activation+9.69e-02

Pos logit contributions
ourke+1.705
versions+1.571
acters+1.391
onda+1.341
eca+1.326
Neg logit contributions
theless-2.396
ammu-1.869
ーテ-1.803
emo-1.772
 Lauder-1.766
Token clones
Token ID32498
Feature activation-1.86e-01

Pos logit contributions
ourke+0.910
versions+0.833
acters+0.720
onda+0.693
eca+0.689
Neg logit contributions
theless-1.482
ammu-1.175
ーテ-1.144
emo-1.118
 Lauder-1.117
Token it
Token ID340
Feature activation+1.33e-01

Pos logit contributions
ourke+2.240
versions+2.054
acters+1.805
onda+1.798
eca+1.760
Neg logit contributions
theless-2.962
ammu-2.281
ーテ-2.202
 Lauder-2.144
 Gors-2.132
Token time
Token ID640
Feature activation+2.53e-01

Pos logit contributions
ourke+1.418
versions+1.297
acters+1.144
onda+1.129
eca+1.112
Neg logit contributions
theless-1.850
ammu-1.442
ーテ-1.401
 Lauder-1.348
ayer-1.346
Token for
Token ID329
Feature activation-6.46e-02

Pos logit contributions
ourke+2.721
versions+2.475
acters+2.212
onda+2.172
eca+2.148
Neg logit contributions
theless-3.515
ammu-2.710
ーテ-2.614
 Lauder-2.549
 Gors-2.532
Token us
Token ID514
Feature activation+6.64e-02

Pos logit contributions
theless+0.876
ammu+0.672
ーテ+0.643
 Lauder+0.637
emo+0.634
Neg logit contributions
ourke-0.722
versions-0.656
acters-0.589
onda-0.577
eca-0.570
Token to
Token ID284
Feature activation-1.92e-01

Pos logit contributions
ourke+0.758
versions+0.695
acters+0.621
onda+0.611
eca+0.604
Neg logit contributions
theless-0.883
ammu-0.674
ーテ-0.648
 Lauder-0.635
ayer-0.632
Token go
Token ID467
Feature activation+7.11e-01

Pos logit contributions
theless+2.533
ammu+1.942
ーテ+1.827
ayer+1.824
emo+1.823
Neg logit contributions
ourke-2.204
versions-2.003
acters-1.794
onda-1.774
eca-1.743
Token?
Token ID30
Feature activation-1.77e-01

Pos logit contributions
ourke+6.643
versions+6.213
ggle+5.689
onda+5.507
eca+5.442
Neg logit contributions
theless-9.774
ammu-7.524
ーテ-7.520
ellen-7.284
 Lauder-7.223
Token
Token ID198
Feature activation-3.56e-01

Pos logit contributions
theless+2.189
ammu+1.649
ーテ+1.575
 Lauder+1.564
 Gors+1.558
Neg logit contributions
ourke-2.165
versions-1.989
acters-1.809
onda-1.768
eca-1.745
Token
Token ID198
Feature activation-1.16e-02

Pos logit contributions
theless+3.931
ammu+3.018
ayer+2.796
emo+2.785
ellen+2.766
Neg logit contributions
ourke-4.594
versions-4.172
acters-3.867
onda-3.808
ggle-3.729
TokenEll
Token ID30639
Feature activation-6.82e-02

Pos logit contributions
theless+0.155
ammu+0.118
ーテ+0.114
 Lauder+0.111
emo+0.111
Neg logit contributions
ourke-0.132
versions-0.121
acters-0.108
onda-0.107
eca-0.105
Tokenen
Token ID268
Feature activation-2.17e-01

Pos logit contributions
theless+1.023
ammu+0.811
ーテ+0.778
ayer+0.772
 Lauder+0.771
Neg logit contributions
ourke-0.657
versions-0.594
acters-0.520
onda-0.505
eca-0.501
Token He
Token ID679
Feature activation+1.26e-02

Pos logit contributions
theless+0.409
ammu+0.309
ーテ+0.297
 Lauder+0.290
emo+0.289
Neg logit contributions
ourke-0.367
versions-0.337
acters-0.303
onda-0.298
eca-0.295
Token beats
Token ID17825
Feature activation-1.40e-01

Pos logit contributions
ourke+0.146
versions+0.134
acters+0.120
onda+0.118
eca+0.117
Neg logit contributions
theless-0.165
ammu-0.125
ーテ-0.120
 Lauder-0.118
emo-0.117
Token opponents
Token ID7691
Feature activation-3.87e-01

Pos logit contributions
theless+1.767
ammu+1.337
ーテ+1.262
 Gors+1.254
ayer+1.254
Neg logit contributions
ourke-1.670
versions-1.525
acters-1.379
onda-1.354
eca-1.339
Token physically
Token ID10170
Feature activation+1.45e-01

Pos logit contributions
theless+4.786
ammu+3.661
ayer+3.478
emo+3.414
 Gors+3.411
Neg logit contributions
ourke-4.527
versions-4.154
acters-3.756
onda-3.659
ggle-3.640
Token and
Token ID290
Feature activation-1.25e-01

Pos logit contributions
ourke+1.512
versions+1.380
acters+1.215
onda+1.210
eca+1.186
Neg logit contributions
theless-2.069
ammu-1.605
ーテ-1.568
 Lauder-1.521
emo-1.513
Token gets
Token ID3011
Feature activation+1.01e-01

Pos logit contributions
theless+1.640
ammu+1.235
ーテ+1.177
 Lauder+1.167
 Gors+1.163
Neg logit contributions
ourke-1.453
versions-1.331
acters-1.195
onda-1.184
eca-1.163
Token under
Token ID739
Feature activation-7.10e-01

Pos logit contributions
ourke+1.090
versions+1.000
acters+0.889
onda+0.870
eca+0.861
Neg logit contributions
theless-1.399
ammu-1.077
ーテ-1.051
 Lauder-1.021
ellen-1.014
Token their
Token ID511
Feature activation-5.45e-01

Pos logit contributions
theless+7.595
ammu+5.904
 Gors+5.491
 Lauder+5.435
ayer+5.337
Neg logit contributions
ourke-8.468
versions-7.587
ggle-7.189
acters-7.054
onda-6.980
Token skin
Token ID4168
Feature activation-1.41e-01

Pos logit contributions
theless+4.823
ammu+3.194
 Lauder+3.146
 Gors+2.952
ayer+2.849
Neg logit contributions
ourke-8.148
versions-7.420
onda-6.964
eca-6.946
ggle-6.867
Token between
Token ID1022
Feature activation-3.63e-02

Pos logit contributions
theless+1.875
ammu+1.429
ーテ+1.355
ayer+1.347
 Lauder+1.347
Neg logit contributions
ourke-1.613
versions-1.464
acters-1.320
onda-1.289
eca-1.276
Token plays
Token ID5341
Feature activation-2.11e-01

Pos logit contributions
theless+0.504
ammu+0.390
ーテ+0.373
 Lauder+0.369
emo+0.366
Neg logit contributions
ourke-0.394
versions-0.356
acters-0.318
onda-0.314
eca-0.309
Token [
Token ID685
Feature activation+1.30e-01

Pos logit contributions
theless+0.739
ammu+0.563
ーテ+0.540
 Lauder+0.532
ayer+0.528
Neg logit contributions
ourke-0.633
versions-0.581
acters-0.518
onda-0.510
eca-0.505
Tokensince
Token ID20777
Feature activation+1.28e-02

Pos logit contributions
ourke+1.368
versions+1.255
acters+1.098
onda+1.086
eca+1.071
Neg logit contributions
theless-1.835
ammu-1.424
ーテ-1.388
 Lauder-1.355
 Gors-1.348
Token the
Token ID262
Feature activation-2.57e-01

Pos logit contributions
ourke+0.139
versions+0.127
acters+0.113
onda+0.111
eca+0.110
Neg logit contributions
theless-0.176
ammu-0.136
ーテ-0.131
 Lauder-0.128
emo-0.128
Token trade
Token ID3292
Feature activation+3.68e-01

Pos logit contributions
theless+2.975
ammu+2.162
 Gors+2.057
emo+2.026
ーテ+2.019
Neg logit contributions
ourke-3.297
versions-3.059
acters-2.784
onda-2.734
ggle-2.727
Token],
Token ID4357
Feature activation-3.40e-02

Pos logit contributions
ourke+3.874
versions+3.571
acters+3.149
onda+3.129
eca+3.121
Neg logit contributions
theless-5.067
ammu-3.946
ーテ-3.798
 Lauder-3.705
ayer-3.686
Token going
Token ID1016
Feature activation+2.27e-01

Pos logit contributions
theless+0.446
ammu+0.338
ーテ+0.324
 Lauder+0.319
 Gors+0.317
Neg logit contributions
ourke-0.395
versions-0.362
acters-0.325
onda-0.319
eca-0.316
Token back
Token ID736
Feature activation+1.21e-01

Pos logit contributions
ourke+2.439
versions+2.243
onda+1.990
acters+1.978
ggle+1.978
Neg logit contributions
theless-3.141
ammu-2.405
ーテ-2.344
emo-2.285
 Lauder-2.274
Token there
Token ID612
Feature activation+1.14e-01

Pos logit contributions
ourke+1.315
versions+1.202
acters+1.065
onda+1.059
eca+1.039
Neg logit contributions
theless-1.686
ammu-1.294
ーテ-1.255
 Lauder-1.227
emo-1.222
Token,
Token ID11
Feature activation+1.14e-01

Pos logit contributions
ourke+1.211
versions+1.104
acters+0.974
onda+0.968
eca+0.952
Neg logit contributions
theless-1.596
ammu-1.234
ーテ-1.197
 Lauder-1.167
emo-1.165
Token but
Token ID475
Feature activation+1.24e-01

Pos logit contributions
ourke+1.226
versions+1.121
acters+0.992
onda+0.983
eca+0.971
Neg logit contributions
theless-1.587
ammu-1.222
ーテ-1.185
 Lauder-1.153
emo-1.150
Token I
Token ID314
Feature activation+1.69e-02

Pos logit contributions
ourke+1.342
versions+1.227
acters+1.091
onda+1.074
eca+1.066
Neg logit contributions
theless-1.723
ammu-1.332
ーテ-1.292
 Lauder-1.253
emo-1.250
Token against
Token ID1028
Feature activation+3.20e-01

Pos logit contributions
ourke+1.809
versions+1.669
acters+1.475
onda+1.458
ggle+1.431
Neg logit contributions
theless-2.279
ammu-1.752
ーテ-1.701
 Lauder-1.653
emo-1.647
Token Lenin
Token ID22899
Feature activation+2.48e-01

Pos logit contributions
ourke+3.320
versions+3.069
onda+2.675
acters+2.668
eca+2.648
Neg logit contributions
theless-4.438
ammu-3.448
ーテ-3.351
ellen-3.274
ayer-3.237
Token and
Token ID290
Feature activation+1.07e-01

Pos logit contributions
ourke+2.876
versions+2.648
acters+2.374
onda+2.338
eca+2.316
Neg logit contributions
theless-3.206
ammu-2.416
ーテ-2.335
emo-2.273
 Lauder-2.272
Token Trotsky
Token ID32706
Feature activation-9.88e-03

Pos logit contributions
ourke+1.208
versions+1.109
acters+0.988
onda+0.974
eca+0.966
Neg logit contributions
theless-1.434
ammu-1.098
ーテ-1.053
 Lauder-1.030
emo-1.026
Token,
Token ID11
Feature activation+2.70e-01

Pos logit contributions
theless+0.135
ammu+0.104
ーテ+0.100
 Lauder+0.098
emo+0.098
Neg logit contributions
ourke-0.109
versions-0.100
acters-0.089
onda-0.087
eca-0.086
Token
Token ID198
Feature activation+1.55e-01

Pos logit contributions
ourke+2.877
versions+2.643
acters+2.299
onda+2.296
eca+2.268
Neg logit contributions
theless-3.732
ammu-2.898
ーテ-2.804
emo-2.733
ellen-2.711
Token
Token ID198
Feature activation+3.94e-01

Pos logit contributions
ourke+1.622
versions+1.495
acters+1.303
eca+1.277
onda+1.273
Neg logit contributions
theless-2.255
ーテ-1.737
ammu-1.707
 Lauder-1.635
 Gors-1.633
Tokenfor
Token ID1640
Feature activation+2.43e-01

Pos logit contributions
ourke+4.520
versions+4.218
eca+3.648
onda+3.634
acters+3.634
Neg logit contributions
theless-5.081
ーテ-3.816
ammu-3.784
 Lauder-3.667
 Gors-3.650
Token worldwide
Token ID8688
Feature activation+2.71e-01

Pos logit contributions
ourke+2.589
versions+2.392
onda+2.087
acters+2.086
eca+2.067
Neg logit contributions
theless-3.337
ammu-2.593
ーテ-2.509
 Lauder-2.448
ellen-2.445
Token anti
Token ID3098
Feature activation+2.75e-01

Pos logit contributions
ourke+2.783
versions+2.585
acters+2.243
onda+2.207
ggle+2.207
Neg logit contributions
theless-3.809
ammu-2.958
ーテ-2.880
ellen-2.832
emo-2.807
Token-
Token ID12
Feature activation-3.96e-01

Pos logit contributions
ourke+2.590
versions+2.353
acters+2.043
onda+2.000
eca+1.977
Neg logit contributions
theless-4.178
ammu-3.275
ーテ-3.172
 Lauder-3.136
 Gors-3.116
Token,
Token ID11
Feature activation-6.12e-02

Pos logit contributions
theless+0.103
ammu+0.078
ーテ+0.075
 Lauder+0.073
emo+0.073
Neg logit contributions
ourke-0.089
versions-0.082
acters-0.073
onda-0.072
eca-0.071
Token
Token ID447
Feature activation-6.70e-02

Pos logit contributions
theless+0.770
ammu+0.578
ーテ+0.554
 Lauder+0.542
 Gors+0.539
Neg logit contributions
ourke-0.740
versions-0.682
acters-0.615
onda-0.604
eca-0.599
Token
Token ID251
Feature activation-5.28e-02

Pos logit contributions
theless+1.064
ammu+0.854
ーテ+0.828
 Lauder+0.814
ayer+0.810
Neg logit contributions
ourke-0.592
versions-0.528
acters-0.454
onda-0.443
eca-0.438
Token which
Token ID543
Feature activation-1.72e-01

Pos logit contributions
theless+0.699
ammu+0.533
ーテ+0.512
 Lauder+0.501
emo+0.499
Neg logit contributions
ourke-0.605
versions-0.554
acters-0.496
onda-0.488
eca-0.484
Token is
Token ID318
Feature activation-3.50e-01

Pos logit contributions
theless+2.252
ammu+1.712
ーテ+1.633
 Lauder+1.619
 Gors+1.612
Neg logit contributions
ourke-1.974
versions-1.810
acters-1.614
onda-1.591
eca-1.586
Token his
Token ID465
Feature activation+1.06e-01

Pos logit contributions
theless+4.442
ammu+3.357
 Gors+3.189
ーテ+3.189
 Lauder+3.160
Neg logit contributions
ourke-4.067
versions-3.710
acters-3.352
eca-3.299
onda-3.294
Token welcoming
Token ID22516
Feature activation-1.96e-03

Pos logit contributions
ourke+1.153
versions+1.063
acters+0.943
onda+0.925
eca+0.914
Neg logit contributions
theless-1.455
ammu-1.116
ーテ-1.077
 Lauder-1.057
ayer-1.047
Token song
Token ID3496
Feature activation+2.49e-01

Pos logit contributions
theless+0.027
ammu+0.021
ーテ+0.020
 Lauder+0.019
emo+0.019
Neg logit contributions
ourke-0.022
versions-0.020
acters-0.018
onda-0.017
eca-0.017
Token to
Token ID284
Feature activation+2.75e-02

Pos logit contributions
ourke+2.570
versions+2.374
acters+2.099
onda+2.035
eca+2.015
Neg logit contributions
theless-3.562
ammu-2.745
ーテ-2.656
 Lauder-2.620
 Gors-2.583
Token the
Token ID262
Feature activation+1.40e-01

Pos logit contributions
ourke+0.310
versions+0.284
acters+0.254
onda+0.250
eca+0.247
Neg logit contributions
theless-0.371
ammu-0.283
ーテ-0.273
 Lauder-0.267
emo-0.266
Token kids
Token ID3988
Feature activation+1.03e-01

Pos logit contributions
ourke+1.572
versions+1.444
acters+1.297
onda+1.270
eca+1.251
Neg logit contributions
theless-1.877
ammu-1.434
ーテ-1.388
 Lauder-1.347
ayer-1.347
Token,
Token ID11
Feature activation-1.63e-01

Pos logit contributions
ourke+0.463
versions+0.424
acters+0.378
onda+0.372
eca+0.368
Neg logit contributions
theless-0.560
ammu-0.428
ーテ-0.413
 Lauder-0.404
emo-0.402
Token the
Token ID262
Feature activation+3.35e-02

Pos logit contributions
theless+2.161
ammu+1.644
ーテ+1.574
 Lauder+1.541
emo+1.539
Neg logit contributions
ourke-1.883
versions-1.707
acters-1.539
onda-1.506
eca-1.502
Token Store
Token ID9363
Feature activation-3.94e-02

Pos logit contributions
ourke+0.378
versions+0.348
acters+0.310
onda+0.305
eca+0.302
Neg logit contributions
theless-0.449
ammu-0.342
ーテ-0.330
 Lauder-0.323
emo-0.321
Token!
Token ID0
Feature activation+4.40e-02

Pos logit contributions
theless+0.531
ammu+0.409
ーテ+0.392
 Lauder+0.385
emo+0.384
Neg logit contributions
ourke-0.441
versions-0.401
acters-0.359
onda-0.353
eca-0.348
Token
Token ID198
Feature activation-1.31e-02

Pos logit contributions
ourke+0.506
versions+0.465
acters+0.415
onda+0.409
eca+0.405
Neg logit contributions
theless-0.582
ammu-0.442
ーテ-0.425
 Lauder-0.416
emo-0.414
Token
Token ID198
Feature activation+1.76e-01

Pos logit contributions
theless+0.177
ammu+0.134
ーテ+0.131
 Lauder+0.127
 Gors+0.126
Neg logit contributions
ourke-0.148
versions-0.136
acters-0.121
onda-0.119
eca-0.118
TokenThere
Token ID1858
Feature activation+7.90e-02

Pos logit contributions
ourke+2.055
versions+1.900
acters+1.683
onda+1.662
eca+1.656
Neg logit contributions
theless-2.325
ammu-1.715
ーテ-1.705
 Lauder-1.650
 Gors-1.634
Token
Token ID447
Feature activation+1.95e-01

Pos logit contributions
ourke+0.992
versions+0.917
acters+0.829
onda+0.820
eca+0.809
Neg logit contributions
theless-0.959
ammu-0.707
ーテ-0.683
 Lauder-0.664
emo-0.662
Token
Token ID247
Feature activation+1.69e-02

Pos logit contributions
ourke+2.577
versions+2.392
ggle+2.178
onda+2.175
acters+2.172
Neg logit contributions
theless-2.261
ammu-1.585
ーテ-1.521
 Gors-1.483
 Lauder-1.482
Tokens
Token ID82
Feature activation-7.81e-02

Pos logit contributions
ourke+0.153
versions+0.137
acters+0.118
onda+0.115
eca+0.114
Neg logit contributions
theless-0.264
ammu-0.211
ーテ-0.204
 Lauder-0.201
emo-0.200
Token a
Token ID257
Feature activation-1.45e-03

Pos logit contributions
theless+1.051
ammu+0.805
ーテ+0.771
 Lauder+0.754
ayer+0.753
Neg logit contributions
ourke-0.883
versions-0.805
acters-0.722
onda-0.705
eca-0.698
Token does
Token ID857
Feature activation-4.39e-02

Pos logit contributions
theless+2.778
ammu+2.081
ーテ+1.966
ayer+1.962
 Lauder+1.953
Neg logit contributions
ourke-2.616
versions-2.395
acters-2.161
onda-2.150
eca-2.110
Token not
Token ID407
Feature activation+1.13e-01

Pos logit contributions
theless+0.559
ammu+0.421
ーテ+0.402
ayer+0.395
 Lauder+0.394
Neg logit contributions
ourke-0.525
versions-0.482
acters-0.434
onda-0.428
eca-0.424
Token solely
Token ID9944
Feature activation-3.76e-03

Pos logit contributions
ourke+1.252
versions+1.153
acters+1.028
onda+1.006
ggle+0.999
Neg logit contributions
theless-1.539
ammu-1.177
ーテ-1.150
 Lauder-1.123
 Gors-1.110
Token pay
Token ID1414
Feature activation-2.14e-01

Pos logit contributions
theless+0.044
ammu+0.032
ーテ+0.031
 Lauder+0.030
emo+0.030
Neg logit contributions
ourke-0.049
versions-0.045
acters-0.041
onda-0.040
eca-0.040
Token attention
Token ID3241
Feature activation-5.95e-01

Pos logit contributions
theless+2.674
ammu+2.039
 Lauder+1.937
ayer+1.927
 Gors+1.922
Neg logit contributions
ourke-2.548
versions-2.336
acters-2.127
onda-2.074
eca-2.068
Token to
Token ID284
Feature activation-2.63e-01

Pos logit contributions
theless+6.138
ammu+4.657
emo+4.465
ayer+4.454
ーテ+4.427
Neg logit contributions
ourke-7.514
versions-7.033
acters-6.444
onda-6.148
eca-6.128
Token polls
Token ID9231
Feature activation+1.74e-02

Pos logit contributions
theless+3.440
ammu+2.675
 Lauder+2.522
ayer+2.517
emo+2.503
Neg logit contributions
ourke-2.969
versions-2.694
acters-2.472
onda-2.376
eca-2.373
Token and
Token ID290
Feature activation-3.37e-01

Pos logit contributions
ourke+0.188
versions+0.171
acters+0.152
onda+0.150
eca+0.148
Neg logit contributions
theless-0.242
ammu-0.187
ーテ-0.180
 Lauder-0.177
emo-0.176
Token fundraising
Token ID18066
Feature activation+3.59e-01

Pos logit contributions
theless+4.331
ammu+3.271
ーテ+3.134
 Gors+3.104
emo+3.100
Neg logit contributions
ourke-3.850
versions-3.537
acters-3.186
onda-3.166
eca-3.115
Token,
Token ID11
Feature activation-5.68e-02

Pos logit contributions
ourke+3.495
versions+3.285
acters+2.790
onda+2.734
eca+2.732
Neg logit contributions
theless-5.208
ammu-4.073
ーテ-3.963
ellen-3.859
 Lauder-3.841
Token but
Token ID475
Feature activation+3.02e-02

Pos logit contributions
theless+0.759
ammu+0.579
ーテ+0.557
 Lauder+0.547
emo+0.543
Neg logit contributions
ourke-0.645
versions-0.591
acters-0.529
onda-0.520
eca-0.515
Token back
Token ID736
Feature activation+7.82e-02

Pos logit contributions
ourke+1.027
versions+0.935
acters+0.828
onda+0.816
eca+0.813
Neg logit contributions
theless-1.307
ammu-1.005
ーテ-0.967
 Lauder-0.945
emo-0.942
Token in
Token ID287
Feature activation+4.71e-02

Pos logit contributions
ourke+0.838
versions+0.765
acters+0.677
onda+0.666
eca+0.659
Neg logit contributions
theless-1.097
ammu-0.849
ーテ-0.818
 Lauder-0.802
emo-0.798
Token June
Token ID2795
Feature activation+2.75e-02

Pos logit contributions
ourke+0.513
versions+0.473
acters+0.415
onda+0.411
eca+0.406
Neg logit contributions
theless-0.650
ammu-0.501
ーテ-0.486
emo-0.470
ayer-0.470
Token.
Token ID13
Feature activation+2.80e-01

Pos logit contributions
ourke+0.298
versions+0.272
acters+0.241
onda+0.237
eca+0.235
Neg logit contributions
theless-0.383
ammu-0.295
ーテ-0.285
 Lauder-0.279
emo-0.278
Token Talk
Token ID12167
Feature activation-4.43e-02

Pos logit contributions
ourke+3.130
versions+2.875
acters+2.542
eca+2.523
onda+2.506
Neg logit contributions
theless-3.746
ammu-2.855
ーテ-2.727
 Lauder-2.665
emo-2.661
TokenTo
Token ID2514
Feature activation-2.85e-02

Pos logit contributions
theless+0.529
ammu+0.389
ーテ+0.371
 Lauder+0.362
emo+0.361
Neg logit contributions
ourke-0.566
versions-0.524
acters-0.475
onda-0.468
eca-0.463
Token employs
Token ID24803
Feature activation-2.31e-03

Pos logit contributions
theless+0.370
ammu+0.279
ーテ+0.268
 Lauder+0.263
emo+0.261
Neg logit contributions
ourke-0.336
versions-0.309
acters-0.277
onda-0.273
eca-0.270
Token a
Token ID257
Feature activation-2.68e-01

Pos logit contributions
theless+0.032
ammu+0.025
ーテ+0.024
 Lauder+0.024
emo+0.023
Neg logit contributions
ourke-0.025
versions-0.023
acters-0.020
onda-0.020
eca-0.020
Token large
Token ID1588
Feature activation-5.75e-02

Pos logit contributions
theless+3.409
ammu+2.538
ーテ+2.447
 Lauder+2.445
 Gors+2.418
Neg logit contributions
ourke-3.132
versions-2.873
acters-2.631
onda-2.590
ggle-2.539
Token number
Token ID1271
Feature activation-1.16e-01

Pos logit contributions
theless+0.705
ammu+0.522
ーテ+0.499
 Lauder+0.492
ayer+0.487
Neg logit contributions
ourke-0.716
versions-0.662
acters-0.598
onda-0.590
eca-0.582
Token of
Token ID286
Feature activation+1.36e-01

Pos logit contributions
theless+1.595
ammu+1.231
ーテ+1.183
 Lauder+1.164
ayer+1.157
Neg logit contributions
ourke-1.273
versions-1.163
acters-1.035
onda-1.017
eca-1.004
Tokenuse
Token ID1904
Feature activation-1.54e-01

Pos logit contributions
ourke+4.942
versions+4.510
onda+4.255
ggle+3.885
acters+3.842
Neg logit contributions
theless-8.151
ーテ-6.345
ammu-6.197
 Gors-6.034
 Lauder-6.002
Token themselves
Token ID2405
Feature activation-2.39e-03

Pos logit contributions
theless+1.473
ammu+0.993
ーテ+0.919
 Lauder+0.909
ayer+0.897
Neg logit contributions
ourke-2.324
versions-2.174
acters-2.024
onda-1.979
ggle-1.969
Token,
Token ID11
Feature activation-6.86e-02

Pos logit contributions
theless+0.033
ammu+0.025
ーテ+0.024
 Lauder+0.024
ayer+0.024
Neg logit contributions
ourke-0.026
versions-0.024
acters-0.021
onda-0.021
eca-0.021
Token giving
Token ID3501
Feature activation-3.23e-01

Pos logit contributions
theless+0.899
ammu+0.681
ーテ+0.653
 Lauder+0.642
ayer+0.637
Neg logit contributions
ourke-0.798
versions-0.732
acters-0.658
onda-0.645
eca-0.640
Token rise
Token ID4485
Feature activation-2.55e-01

Pos logit contributions
theless+4.157
ammu+3.198
 Lauder+3.036
 Gors+3.023
ーテ+3.021
Neg logit contributions
ourke-3.686
versions-3.355
acters-3.109
eca-2.994
ggle-2.978
Token to
Token ID284
Feature activation-1.25e-01

Pos logit contributions
theless+3.229
ammu+2.461
ーテ+2.336
emo+2.313
 Lauder+2.309
Neg logit contributions
ourke-3.003
versions-2.773
acters-2.487
onda-2.432
ggle-2.421
Token an
Token ID281
Feature activation-8.60e-02

Pos logit contributions
theless+1.681
ammu+1.289
ーテ+1.229
 Lauder+1.224
 Gors+1.214
Neg logit contributions
ourke-1.399
versions-1.274
acters-1.141
onda-1.117
eca-1.107
Token appeal
Token ID5198
Feature activation+7.89e-02

Pos logit contributions
theless+1.193
ammu+0.929
ーテ+0.889
 Lauder+0.877
emo+0.875
Neg logit contributions
ourke-0.927
versions-0.849
acters-0.752
onda-0.735
ggle-0.729
Token to
Token ID284
Feature activation-2.09e-01

Pos logit contributions
ourke+0.866
versions+0.790
acters+0.700
onda+0.692
eca+0.683
Neg logit contributions
theless-1.089
ammu-0.837
ーテ-0.808
 Lauder-0.789
emo-0.784
Token the
Token ID262
Feature activation-3.11e-01

Pos logit contributions
theless+2.676
ammu+2.042
 Gors+1.913
ーテ+1.909
 Lauder+1.909
Neg logit contributions
ourke-2.416
versions-2.231
acters-2.047
ggle-1.971
onda-1.963
Token U
Token ID471
Feature activation-1.04e-01

Pos logit contributions
theless+3.871
ammu+2.983
 Lauder+2.795
emo+2.781
 Gors+2.768
Neg logit contributions
ourke-3.630
versions-3.375
acters-3.145
ggle-3.018
eca-2.997
Token be
Token ID307
Feature activation-7.58e-02

Pos logit contributions
ourke+2.357
versions+2.155
acters+1.916
onda+1.902
eca+1.872
Neg logit contributions
theless-2.957
ammu-2.247
ーテ-2.168
 Lauder-2.141
 Gors-2.122
Token).
Token ID737
Feature activation+4.42e-02

Pos logit contributions
theless+1.016
ammu+0.777
ーテ+0.749
 Lauder+0.733
ayer+0.733
Neg logit contributions
ourke-0.853
versions-0.784
acters-0.701
onda-0.687
eca-0.679
Token
Token ID198
Feature activation-1.72e-01

Pos logit contributions
ourke+0.518
versions+0.476
acters+0.428
onda+0.421
eca+0.416
Neg logit contributions
theless-0.575
ammu-0.435
ーテ-0.418
 Lauder-0.409
emo-0.407
Token
Token ID198
Feature activation+1.95e-01

Pos logit contributions
theless+2.143
ammu+1.626
 Lauder+1.518
ーテ+1.517
ayer+1.517
Neg logit contributions
ourke-2.083
versions-1.907
acters-1.728
onda-1.705
eca-1.678
TokenDisclaimer
Token ID19618
Feature activation-9.32e-02

Pos logit contributions
ourke+2.258
versions+2.098
acters+1.849
onda+1.829
eca+1.828
Neg logit contributions
theless-2.587
ammu-1.913
ーテ-1.906
 Lauder-1.841
 Gors-1.823
Token:
Token ID25
Feature activation-1.90e-01

Pos logit contributions
theless+1.201
ammu+0.914
ーテ+0.873
ayer+0.866
 Gors+0.859
Neg logit contributions
ourke-1.094
versions-1.006
acters-0.898
onda-0.879
ggle-0.877
Token These
Token ID2312
Feature activation-1.06e-01

Pos logit contributions
theless+2.420
ammu+1.816
ーテ+1.749
 Gors+1.726
 Lauder+1.716
Neg logit contributions
ourke-2.256
versions-2.072
acters-1.864
ggle-1.823
onda-1.822
Token tips
Token ID9040
Feature activation+2.91e-02

Pos logit contributions
theless+1.415
ammu+1.080
ーテ+1.037
ayer+1.017
 Lauder+1.016
Neg logit contributions
ourke-1.215
versions-1.104
acters-0.991
onda-0.973
ggle-0.966
Token are
Token ID389
Feature activation-1.68e-01

Pos logit contributions
ourke+0.339
versions+0.312
acters+0.280
onda+0.276
eca+0.273
Neg logit contributions
theless-0.379
ammu-0.287
ーテ-0.276
 Lauder-0.270
emo-0.268
Token based
Token ID1912
Feature activation+1.56e-01

Pos logit contributions
theless+2.126
ammu+1.595
ーテ+1.533
 Gors+1.504
ayer+1.501
Neg logit contributions
ourke-2.027
versions-1.834
acters-1.679
onda-1.644
ggle-1.635
Token on
Token ID319
Feature activation+3.16e-01

Pos logit contributions
ourke+1.751
versions+1.602
acters+1.432
onda+1.411
eca+1.391
Neg logit contributions
theless-2.109
ammu-1.609
ーテ-1.550
 Lauder-1.519
emo-1.511
Token curl
Token ID29249
Feature activation+1.74e-01

Pos logit contributions
ourke+0.247
versions+0.226
acters+0.201
onda+0.198
eca+0.196
Neg logit contributions
theless-0.307
ammu-0.235
ーテ-0.227
 Lauder-0.222
emo-0.221
Token,
Token ID11
Feature activation+2.68e-02

Pos logit contributions
ourke+1.811
versions+1.653
acters+1.451
onda+1.431
eca+1.413
Neg logit contributions
theless-2.481
ammu-1.907
ーテ-1.858
 Lauder-1.825
emo-1.815
Token a
Token ID257
Feature activation+9.89e-03

Pos logit contributions
ourke+0.293
versions+0.268
acters+0.238
onda+0.234
eca+0.232
Neg logit contributions
theless-0.369
ammu-0.284
ーテ-0.274
 Lauder-0.268
emo-0.267
Token comeback
Token ID21933
Feature activation-6.43e-02

Pos logit contributions
ourke+0.113
versions+0.104
acters+0.093
onda+0.091
eca+0.091
Neg logit contributions
theless-0.131
ammu-0.100
ーテ-0.096
 Lauder-0.094
emo-0.094
Token,
Token ID11
Feature activation+1.37e-01

Pos logit contributions
theless+0.886
ammu+0.687
ーテ+0.659
ayer+0.646
 Lauder+0.645
Neg logit contributions
ourke-0.700
versions-0.637
acters-0.570
onda-0.560
eca-0.551
Token a
Token ID257
Feature activation+4.38e-02

Pos logit contributions
ourke+1.408
versions+1.281
acters+1.123
onda+1.105
eca+1.095
Neg logit contributions
theless-1.983
ammu-1.540
ーテ-1.492
emo-1.460
 Lauder-1.459
Token nine
Token ID5193
Feature activation+2.06e-01

Pos logit contributions
ourke+0.491
versions+0.451
acters+0.400
onda+0.393
eca+0.390
Neg logit contributions
theless-0.593
ammu-0.453
ーテ-0.438
 Lauder-0.428
emo-0.426
Token,
Token ID11
Feature activation+4.25e-03

Pos logit contributions
ourke+1.913
versions+1.731
acters+1.487
onda+1.479
eca+1.439
Neg logit contributions
theless-3.181
ammu-2.485
ーテ-2.440
 Lauder-2.384
 Gors-2.377
Token or
Token ID393
Feature activation+1.17e-01

Pos logit contributions
ourke+0.045
versions+0.041
acters+0.036
onda+0.035
eca+0.035
Neg logit contributions
theless-0.060
ammu-0.047
ーテ-0.045
 Lauder-0.044
ayer-0.044
Token a
Token ID257
Feature activation+1.62e-01

Pos logit contributions
ourke+1.240
versions+1.133
acters+0.995
onda+0.981
eca+0.971
Neg logit contributions
theless-1.659
ammu-1.285
ーテ-1.244
 Lauder-1.216
emo-1.215
Token sl
Token ID1017
Feature activation+9.99e-02

Pos logit contributions
ourke+1.764
versions+1.639
acters+1.427
onda+1.405
eca+1.401
Neg logit contributions
theless-2.231
ammu-1.706
ーテ-1.656
 Lauder-1.617
emo-1.612
Token [
Token ID685
Feature activation+1.14e-01

Pos logit contributions
theless+1.359
ammu+1.022
ーテ+0.977
 Lauder+0.960
emo+0.954
Neg logit contributions
ourke-1.286
versions-1.184
acters-1.065
onda-1.049
eca-1.038
Token10
Token ID940
Feature activation+1.52e-01

Pos logit contributions
ourke+1.121
versions+1.035
acters+0.885
onda+0.878
eca+0.872
Neg logit contributions
theless-1.723
ーテ-1.332
ammu-1.323
 Lauder-1.278
 Gors-1.271
Token:
Token ID25
Feature activation+3.50e-01

Pos logit contributions
ourke+1.501
versions+1.387
acters+1.206
eca+1.184
onda+1.176
Neg logit contributions
theless-2.288
ーテ-1.766
ammu-1.737
 Lauder-1.669
etheless-1.653
Token48
Token ID2780
Feature activation-4.33e-02

Pos logit contributions
ourke+3.852
versions+3.541
eca+3.120
acters+3.108
onda+3.077
Neg logit contributions
theless-4.787
ammu-3.628
ーテ-3.582
 Gors-3.458
emo-3.451
Token]
Token ID60
Feature activation-3.35e-02

Pos logit contributions
theless+0.609
ammu+0.470
ーテ+0.457
 Lauder+0.445
emo+0.442
Neg logit contributions
ourke-0.464
versions-0.423
acters-0.375
onda-0.368
eca-0.364
Token <
Token ID1279
Feature activation-6.17e-01

Pos logit contributions
theless+0.431
ammu+0.323
ーテ+0.312
 Lauder+0.303
emo+0.302
Neg logit contributions
ourke-0.398
versions-0.367
acters-0.330
onda-0.325
eca-0.322
Tokenmin
Token ID1084
Feature activation+4.80e-01

Pos logit contributions
theless+8.142
ammu+6.427
emo+6.322
ellen+6.107
ayer+6.102
Neg logit contributions
ourke-6.323
versions-5.658
acters-5.176
onda-4.869
eca-4.771
Tokenion
Token ID295
Feature activation-7.47e-02

Pos logit contributions
ourke+5.381
versions+4.875
onda+4.428
ington+4.419
acters+4.378
Neg logit contributions
theless-6.376
ーテ-5.061
ammu-4.689
 Gors-4.657
 Lauder-4.637
Token13
Token ID1485
Feature activation+3.42e-01

Pos logit contributions
theless+1.056
ammu+0.819
ーテ+0.790
 Lauder+0.775
emo+0.772
Neg logit contributions
ourke-0.792
versions-0.721
acters-0.638
onda-0.627
eca-0.619
Token37
Token ID2718
Feature activation+1.88e-02

Pos logit contributions
ourke+2.832
versions+2.611
acters+2.243
onda+2.141
ington+2.134
Neg logit contributions
theless-5.590
ーテ-4.518
ammu-4.317
 Gors-4.277
 Lauder-4.260
Token>
Token ID29
Feature activation+4.45e-01

Pos logit contributions
ourke+0.179
versions+0.162
acters+0.141
onda+0.137
eca+0.136
Neg logit contributions
theless-0.289
ammu-0.227
ーテ-0.223
 Lauder-0.218
 Gors-0.216
Token,
Token ID11
Feature activation-1.43e-01

Pos logit contributions
theless+3.477
ammu+2.572
 Lauder+2.476
 Gors+2.457
ーテ+2.454
Neg logit contributions
ourke-3.448
versions-3.176
acters-2.896
onda-2.833
ggle-2.809
Token Louis
Token ID5593
Feature activation+8.30e-02

Pos logit contributions
theless+1.827
ammu+1.384
ーテ+1.327
 Lauder+1.305
 Gors+1.299
Neg logit contributions
ourke-1.687
versions-1.555
acters-1.399
onda-1.374
ggle-1.356
Token H
Token ID367
Feature activation+9.66e-02

Pos logit contributions
ourke+1.363
versions+1.283
acters+1.191
onda+1.182
eca+1.172
Neg logit contributions
theless-0.690
ammu-0.426
ーテ-0.395
 Lauder-0.375
emo-0.373
Token.
Token ID13
Feature activation-3.17e-01

Pos logit contributions
ourke+1.062
versions+0.958
onda+0.854
acters+0.851
eca+0.833
Neg logit contributions
theless-1.350
ammu-1.023
ーテ-1.006
 Lauder-0.962
 Gors-0.958
Token Feldman
Token ID45498
Feature activation-3.04e-01

Pos logit contributions
theless+2.548
ammu+1.601
 Lauder+1.558
 Gors+1.537
ーテ+1.492
Neg logit contributions
ourke-5.116
versions-4.860
acters-4.535
ggle-4.433
onda-4.428
Token,
Token ID11
Feature activation-5.81e-01

Pos logit contributions
theless+3.817
ammu+2.923
 Lauder+2.785
ーテ+2.772
 Gors+2.767
Neg logit contributions
ourke-3.513
versions-3.214
acters-2.935
ggle-2.884
onda-2.880
Token James
Token ID3700
Feature activation-4.04e-01

Pos logit contributions
theless+6.712
ammu+5.093
 Gors+4.924
 Lauder+4.878
ーテ+4.857
Neg logit contributions
ourke-6.954
versions-6.296
acters-5.790
onda-5.747
ggle-5.729
Token L
Token ID406
Feature activation+2.31e-01

Pos logit contributions
theless+4.997
ammu+3.850
 Lauder+3.741
 Gors+3.723
ellen+3.614
Neg logit contributions
ourke-4.744
versions-4.364
acters-3.952
ggle-3.841
onda-3.799
Token.
Token ID13
Feature activation-1.80e-01

Pos logit contributions
ourke+2.450
versions+2.202
acters+1.974
onda+1.958
eca+1.947
Neg logit contributions
theless-3.313
ammu-2.490
ーテ-2.462
 Lauder-2.367
 Gors-2.353
Token K
Token ID509
Feature activation+1.07e-01

Pos logit contributions
theless+2.191
ammu+1.639
 Lauder+1.574
ーテ+1.565
 Gors+1.562
Neg logit contributions
ourke-2.225
versions-2.062
acters-1.866
onda-1.826
ggle-1.812
Tokenug
Token ID1018
Feature activation+1.40e-01

Pos logit contributions
ourke+1.214
versions+1.110
acters+0.990
onda+0.987
eca+0.975
Neg logit contributions
theless-1.445
ammu-1.081
ーテ-1.072
 Lauder-1.026
 Gors-1.016
Token.
Token ID13
Feature activation-1.53e-02

Pos logit contributions
theless+3.184
ammu+2.415
ーテ+2.272
 Lauder+2.269
ayer+2.251
Neg logit contributions
ourke-2.938
versions-2.705
acters-2.434
onda-2.379
ggle-2.366
Token Stephan
Token ID20800
Feature activation-3.15e-01

Pos logit contributions
theless+0.212
ammu+0.163
ーテ+0.157
 Lauder+0.154
emo+0.153
Neg logit contributions
ourke-0.168
versions-0.153
acters-0.136
onda-0.134
eca-0.132
Tokenopol
Token ID39704
Feature activation-9.37e-03

Pos logit contributions
theless+3.071
ammu+2.165
ayer+1.974
ellen+1.960
emo+1.952
Neg logit contributions
ourke-4.571
versions-4.291
acters-3.931
ggle-3.848
eca-3.843
Tokenous
Token ID516
Feature activation-7.56e-02

Pos logit contributions
theless+0.121
ammu+0.091
ーテ+0.087
 Lauder+0.085
emo+0.085
Neg logit contributions
ourke-0.111
versions-0.102
acters-0.092
onda-0.090
eca-0.089
Token will
Token ID481
Feature activation-4.14e-01

Pos logit contributions
theless+0.979
ammu+0.741
ーテ+0.708
 Lauder+0.696
ayer+0.693
Neg logit contributions
ourke-0.889
versions-0.817
acters-0.735
onda-0.721
eca-0.714
Token be
Token ID307
Feature activation-6.11e-01

Pos logit contributions
theless+4.918
ammu+3.729
ayer+3.481
 Lauder+3.432
ーテ+3.406
Neg logit contributions
ourke-5.104
versions-4.689
acters-4.221
onda-4.102
eca-4.090
Token in
Token ID287
Feature activation-5.62e-01

Pos logit contributions
theless+7.088
ammu+5.200
 Lauder+5.133
ayer+4.999
 Gors+4.970
Neg logit contributions
ourke-7.190
versions-6.721
acters-6.166
eca-6.010
ggle-5.937
Token Boston
Token ID6182
Feature activation+2.98e-02

Pos logit contributions
theless+6.411
ammu+5.021
 Lauder+4.820
 Gors+4.678
ellen+4.570
Neg logit contributions
ourke-6.826
versions-6.017
acters-5.714
onda-5.574
ggle-5.466
Token on
Token ID319
Feature activation+2.84e-02

Pos logit contributions
ourke+0.330
versions+0.302
acters+0.269
onda+0.265
eca+0.262
Neg logit contributions
theless-0.406
ammu-0.311
ーテ-0.299
 Lauder-0.294
emo-0.292
Token Tuesday
Token ID3431
Feature activation+2.05e-01

Pos logit contributions
ourke+0.321
versions+0.295
acters+0.263
onda+0.259
eca+0.256
Neg logit contributions
theless-0.381
ammu-0.290
ーテ-0.281
 Lauder-0.274
emo-0.273
Token morning
Token ID3329
Feature activation+8.37e-02

Pos logit contributions
ourke+1.921
versions+1.739
onda+1.490
acters+1.488
eca+1.457
Neg logit contributions
theless-3.129
ammu-2.464
ーテ-2.406
 Lauder-2.348
ayer-2.341
Token is
Token ID318
Feature activation-2.34e-01

Pos logit contributions
ourke+1.938
versions+1.757
acters+1.551
onda+1.548
ggle+1.512
Neg logit contributions
theless-2.557
ammu-1.967
ーテ-1.927
 Lauder-1.865
ayer-1.853
Token even
Token ID772
Feature activation-3.03e-01

Pos logit contributions
theless+3.019
ammu+2.300
 Gors+2.185
emo+2.183
ーテ+2.179
Neg logit contributions
ourke-2.718
versions-2.499
acters-2.270
onda-2.193
eca-2.192
Token scar
Token ID10153
Feature activation-5.11e-03

Pos logit contributions
theless+4.058
ammu+3.112
emo+2.966
 Gors+2.959
 Lauder+2.943
Neg logit contributions
ourke-3.387
versions-3.078
acters-2.774
eca-2.735
onda-2.695
Tokenier
Token ID959
Feature activation-3.00e-01

Pos logit contributions
theless+0.051
ammu+0.035
ーテ+0.033
 Lauder+0.032
ayer+0.032
Neg logit contributions
ourke-0.075
versions-0.070
acters-0.065
onda-0.064
eca-0.063
Token than
Token ID621
Feature activation-4.94e-01

Pos logit contributions
theless+3.774
ammu+2.845
ーテ+2.699
ayer+2.692
 Lauder+2.680
Neg logit contributions
ourke-3.540
versions-3.238
acters-2.910
onda-2.893
eca-2.846
Token her
Token ID607
Feature activation-5.65e-01

Pos logit contributions
theless+6.214
ammu+4.698
 Gors+4.666
 Lauder+4.577
emo+4.573
Neg logit contributions
ourke-5.523
versions-5.038
acters-4.592
eca-4.460
onda-4.430
Token appearance
Token ID5585
Feature activation-2.35e-01

Pos logit contributions
theless+6.992
ammu+5.261
 Gors+5.132
emo+5.104
 Lauder+5.055
Neg logit contributions
ourke-6.462
versions-5.630
acters-5.206
eca-5.135
onda-5.119
Token now
Token ID783
Feature activation-3.18e-02

Pos logit contributions
theless+3.069
ammu+2.324
emo+2.200
ーテ+2.193
 Gors+2.190
Neg logit contributions
ourke-2.731
versions-2.467
acters-2.234
onda-2.191
eca-2.183
Token.
Token ID13
Feature activation-2.94e-01

Pos logit contributions
theless+0.434
ammu+0.333
ーテ+0.319
 Lauder+0.315
emo+0.313
Neg logit contributions
ourke-0.352
versions-0.321
acters-0.287
onda-0.281
eca-0.279
Token
Token ID198
Feature activation-2.67e-02

Pos logit contributions
theless+3.510
ammu+2.625
 Gors+2.534
 Lauder+2.520
ーテ+2.518
Neg logit contributions
ourke-3.626
versions-3.345
acters-3.039
onda-2.977
ggle-2.948
Token
Token ID198
Feature activation-1.87e-01

Pos logit contributions
theless+0.358
ammu+0.271
ーテ+0.264
 Lauder+0.256
 Gors+0.254
Neg logit contributions
ourke-0.305
versions-0.280
acters-0.250
onda-0.246
eca-0.244
TokenBlack
Token ID9915
Feature activation-7.79e-01

Pos logit contributions
theless+5.486
ammu+4.297
Berry+4.136
emo+4.109
ellen+4.057
Neg logit contributions
ourke-5.938
versions-5.478
acters-5.108
onda-5.001
ggle-4.977
Token President
Token ID1992
Feature activation-2.29e-01

Pos logit contributions
theless+9.056
ammu+7.230
Berry+7.188
emo+7.026
wolves+6.794
Neg logit contributions
ourke-8.656
versions-7.746
ggle-6.958
acters-6.878
eca-6.870
Token-
Token ID12
Feature activation-6.80e-01

Pos logit contributions
theless+3.004
ammu+2.347
 Lauder+2.226
 Gors+2.209
ellen+2.206
Neg logit contributions
ourke-2.549
versions-2.327
acters-2.086
onda-2.035
ggle-2.030
TokenWhite
Token ID12256
Feature activation-6.51e-01

Pos logit contributions
theless+7.174
ammu+5.719
Berry+5.706
emo+5.554
ellen+5.493
Neg logit contributions
ourke-7.957
versions-7.328
acters-6.924
ggle-6.701
onda-6.678
Token Vs
Token ID31280
Feature activation+2.30e-01

Pos logit contributions
theless+7.664
ammu+6.120
emo+5.852
 Gors+5.808
 Lauder+5.800
Neg logit contributions
ourke-7.299
versions-6.708
acters-6.046
eca-5.853
ggle-5.787
Token Black
Token ID2619
Feature activation-6.18e-01

Pos logit contributions
ourke+2.417
versions+2.193
acters+1.928
onda+1.918
eca+1.893
Neg logit contributions
theless-3.301
ammu-2.516
ーテ-2.452
ayer-2.365
 Lauder-2.359
Token-
Token ID12
Feature activation-6.79e-01

Pos logit contributions
theless+7.469
ammu+5.887
Berry+5.558
emo+5.558
 Lauder+5.539
Neg logit contributions
ourke-6.859
versions-6.230
ggle-5.588
acters-5.513
eca-5.485
TokenJer
Token ID36134
Feature activation-2.62e-01

Pos logit contributions
theless+7.156
Berry+5.722
ammu+5.688
emo+5.541
ellen+5.469
Neg logit contributions
ourke-7.950
versions-7.321
acters-6.916
onda-6.700
ggle-6.683
Tokenkin
Token ID5116
Feature activation-1.80e-01

Pos logit contributions
theless+3.970
ammu+3.213
ayer+3.041
ellen+3.033
emo+3.031
Neg logit contributions
ourke-2.383
versions-2.157
acters-1.855
ggle-1.778
eca-1.776
Token Off
Token ID3242
Feature activation+6.36e-03

Pos logit contributions
theless+2.323
ammu+1.786
ーテ+1.692
 Lauder+1.676
emo+1.674
Neg logit contributions
ourke-2.078
versions-1.904
acters-1.714
onda-1.675
ggle-1.667
Token On
Token ID1550
Feature activation-2.21e-01

Pos logit contributions
ourke+0.072
versions+0.066
acters+0.058
onda+0.058
eca+0.057
Neg logit contributions
theless-0.086
ammu-0.065
ーテ-0.063
 Lauder-0.062
emo-0.061