TOP ACTIVATIONS
MAX = 3.912

opposing ideas , is fairly imp en etr able . On
the brain , are too imp rec ise to measure the
inf lex ible deadlines is imp ract icable .
These warm and stubborn sonic imp urities , having been encoded
( 32 ) Ref use to solve the first
maj est é l ' Imp é rat rice , A
re at this imp asse right now where they
the Spanish government insists on imp eding the basic functioning of
-- Ref eren ced :
brute grew hard panels of imp ossibly thick bone all over
References : Ref . 1 - The Inner
be their first ( Sub ban averaged 5 : 46
apart . Add Imm ersion G imm icks wherever
to the lack of an imp assion ed , positive ,
over Baylor . Turn ing to the Rice Trading
and the word Dep ression , though
- Un em otional , Sub serv ient , Violent
could create a paper trail imp ug ning Price
talks at the imp asse , but
that its success dep ends on us all

MIN ACTIVATIONS
MIN = -3.237

an alliance with B ibli othe ca to directly sell its
Pi , Nvidia Shield , Chrom ecast or Fire St ick
and active bystand er training .
, in conjunction with Encyclopedia Britann ica Films and an O
( SELECT SPL IT ( content , ' '
In a statement , Google defended its Android strategy and
Sc rub bing centers and Content Distribution Networks ( CD N
acts self lessly for the common man - NOT lie 2
city face a f ait accompl i under which they are
the hack , Crowd strike stood by its analysis writing ,
be located . <|endoftext|> The thorn y debates around encryption ,
subsequently completing a first Sunderland tre ble by shooting his 19
97 s )} 10 . B xe 7 { ( 66
Born in Steven age , Hert s 1991 : First ride
the first Ghostbusters , the ne igh bor hood in New
я . н о� и е
k art ing track in Hert ford shire . " They
from a Paris ian disc othe que owned by a slim
of the Anaheim Ducks . Justin Faul k , one of
s somewhat based on the comic book , Ar

INTERVAL 2.125 - 3.912
CONTAINS 0.058%

Donald Trump is simply to turn off the cable news shows
s leadership : Lib ya needs to exploit its
in all likelihood , from making the playoffs in Kat ow
You can 't make this up :
to the required dim ensions , with a few

INTERVAL 0.338 - 2.125
CONTAINS 24.918%

ACT ERS IN D ANG ER . How will
Netflix ) supported the FCC power grab of net neutrality .
the Mid as Touch that turns everything into s *** .
and remain ) good friends . I swear to you that
unexpected market shifts never seems to damp en their determination to

INTERVAL -1.450 - 0.338
CONTAINS 74.295%

doing something they enjoy , and are now far happier .
the H 1 N 1 strain which caused the 1918 flu
Ste lla - san collapsed ?」 "
behavior , the school 's response , and his dissatisfaction with
with us . There was a sequence set in a mall

INTERVAL -3.237 - -1.450
CONTAINS 0.729%

: i T unes ( HD ) | iTunes
medicine often did more harm than it did good . Mol
to follow in the same footsteps . He taught a free
play on the themes of art , nature and diversity .
Facebook Share this comment on Twitter NY T Pick
Token opposing
Token ID12330
Feature activation+7.64e-01

Pos logit contributions
aye+1.970
ersed+1.739
 tongues+1.652
oing+1.637
 hormones+1.636
Neg logit contributions
ighth-1.957
fs-1.953
brates-1.920
umber-1.891
ibia-1.887
Token ideas
Token ID4213
Feature activation+1.18e-01

Pos logit contributions
aye+3.204
ersed+2.784
 tongues+2.680
 hormones+2.648
oing+2.627
Neg logit contributions
ighth-3.681
fs-3.655
brates-3.620
ibia-3.545
umber-3.545
Token,
Token ID11
Feature activation+2.92e-01

Pos logit contributions
aye+0.512
ersed+0.449
oing+0.423
 tongues+0.417
 hormones+0.415
Neg logit contributions
ighth-0.556
fs-0.554
brates-0.546
umber-0.537
ibia-0.534
Token is
Token ID318
Feature activation+3.73e-01

Pos logit contributions
aye+1.296
ersed+1.141
oing+1.075
 tongues+1.064
 hormones+1.058
Neg logit contributions
ighth-1.350
fs-1.345
brates-1.325
umber-1.303
ibia-1.297
Token fairly
Token ID6547
Feature activation-6.12e-02

Pos logit contributions
aye+1.651
ersed+1.456
oing+1.370
 tongues+1.360
 hormones+1.357
Neg logit contributions
ighth-1.721
fs-1.717
brates-1.694
umber-1.662
ibia-1.659
Token imp
Token ID848
Feature activation+3.91e+00

Pos logit contributions
ighth+0.271
fs+0.269
brates+0.264
umber+0.260
ibia+0.257
Neg logit contributions
aye-0.285
ersed-0.251
oing-0.240
astered-0.233
 tongues-0.233
Tokenen
Token ID268
Feature activation+2.06e-01

Pos logit contributions
aye+13.996
ersed+12.818
ossibly+12.743
oing+11.473
astered+11.315
Neg logit contributions
brates-15.630
FORE-14.940
ighth-14.532
emis-14.442
ibia-14.155
Tokenetr
Token ID21879
Feature activation-5.02e-01

Pos logit contributions
aye+1.055
ersed+0.948
 tongues+0.902
oing+0.899
 hormones+0.897
Neg logit contributions
fs-0.806
ighth-0.806
umber-0.784
brates-0.783
ibia-0.765
Tokenable
Token ID540
Feature activation+7.82e-01

Pos logit contributions
fs+1.605
ighth+1.601
umber+1.542
brates+1.531
ibia+1.486
Neg logit contributions
aye-2.931
ersed-2.676
 hormones-2.578
 tongues-2.569
yip-2.551
Token.
Token ID13
Feature activation+3.64e-01

Pos logit contributions
aye+3.548
ersed+3.114
 tongues+2.974
oing+2.948
 hormones+2.947
Neg logit contributions
ighth-3.505
fs-3.450
brates-3.425
ibia-3.343
umber-3.336
Token On
Token ID1550
Feature activation-2.68e-01

Pos logit contributions
aye+1.727
ersed+1.537
oing+1.448
 tongues+1.444
 hormones+1.432
Neg logit contributions
ighth-1.560
fs-1.547
brates-1.524
umber-1.500
ibia-1.492
Token the
Token ID262
Feature activation-2.85e-01

Pos logit contributions
ighth+0.265
fs+0.263
brates+0.261
umber+0.255
ibia+0.254
Neg logit contributions
aye-0.258
ersed-0.229
oing-0.214
astered-0.210
 tongues-0.209
Token brain
Token ID3632
Feature activation-3.06e-01

Pos logit contributions
ighth+1.229
brates+1.215
fs+1.212
ibia+1.182
umber+1.177
Neg logit contributions
aye-1.345
ersed-1.209
oing-1.126
astered-1.103
 tongues-1.087
Token,
Token ID11
Feature activation-1.85e-02

Pos logit contributions
ighth+1.379
fs+1.377
brates+1.363
umber+1.335
ibia+1.327
Neg logit contributions
aye-1.396
ersed-1.230
oing-1.160
 tongues-1.140
astered-1.128
Token are
Token ID389
Feature activation+6.05e-01

Pos logit contributions
fs+0.095
ighth+0.095
brates+0.093
umber+0.092
ibia+0.092
Neg logit contributions
aye-0.072
ersed-0.062
oing-0.058
 tongues-0.057
astered-0.057
Token too
Token ID1165
Feature activation+1.20e-01

Pos logit contributions
aye+2.624
ersed+2.318
oing+2.205
 hormones+2.195
 tongues+2.186
Neg logit contributions
ighth-2.801
fs-2.787
brates-2.762
umber-2.713
ibia-2.708
Token imp
Token ID848
Feature activation+3.88e+00

Pos logit contributions
aye+0.590
ersed+0.526
oing+0.499
 tongues+0.494
 hormones+0.491
Neg logit contributions
ighth-0.494
fs-0.492
brates-0.483
umber-0.474
ibia-0.470
Tokenrec
Token ID8344
Feature activation+1.07e+00

Pos logit contributions
aye+11.481
ossibly+10.268
ersed+10.181
oing+9.351
astered+9.220
Neg logit contributions
FORE-18.069
brates-17.703
ighth-17.063
ibia-16.550
emis-16.453
Tokenise
Token ID786
Feature activation+8.56e-01

Pos logit contributions
aye+5.370
ersed+4.836
astered+4.608
oing+4.599
 tongues+4.480
Neg logit contributions
ighth-4.089
brates-4.083
fs-4.070
FORE-3.930
ibia-3.926
Token to
Token ID284
Feature activation+1.25e+00

Pos logit contributions
aye+3.482
ersed+2.998
 tongues+2.863
 hormones+2.845
oing+2.841
Neg logit contributions
ighth-4.240
fs-4.150
brates-4.118
ibia-4.046
umber-4.034
Token measure
Token ID3953
Feature activation+3.56e-01

Pos logit contributions
aye+5.375
ersed+4.511
 tongues+4.483
 hormones+4.471
oing+4.425
Neg logit contributions
ighth-5.700
fs-5.598
brates-5.563
umber-5.469
ibia-5.432
Token the
Token ID262
Feature activation+5.01e-02

Pos logit contributions
aye+1.534
ersed+1.343
oing+1.263
 tongues+1.258
 hormones+1.253
Neg logit contributions
ighth-1.685
fs-1.678
brates-1.650
umber-1.625
ibia-1.619
Token inf
Token ID1167
Feature activation+1.97e+00

Pos logit contributions
aye+2.796
ersed+2.488
oing+2.351
 tongues+2.326
 hormones+2.320
Neg logit contributions
fs-2.611
ighth-2.605
brates-2.584
umber-2.522
ibia-2.518
Tokenlex
Token ID2588
Feature activation-2.42e-01

Pos logit contributions
aye+7.741
ersed+6.959
astered+6.367
oing+6.193
ellar+6.037
Neg logit contributions
brates-9.420
FORE-9.278
fs-9.103
ighth-9.078
ibia-8.960
Tokenible
Token ID856
Feature activation+7.84e-01

Pos logit contributions
ighth+0.808
fs+0.797
brates+0.780
umber+0.762
ibia+0.756
Neg logit contributions
aye-1.394
ersed-1.254
oing-1.205
 tongues-1.200
 hormones-1.194
Token deadlines
Token ID36773
Feature activation+7.31e-01

Pos logit contributions
aye+3.070
ersed+2.676
 tongues+2.507
oing+2.495
 hormones+2.478
Neg logit contributions
ighth-3.954
fs-3.929
brates-3.927
ibia-3.837
umber-3.817
Token is
Token ID318
Feature activation+1.41e-02

Pos logit contributions
aye+2.991
ersed+2.626
 hormones+2.448
 tongues+2.445
oing+2.436
Neg logit contributions
ighth-3.566
fs-3.510
brates-3.489
umber-3.438
FORE-3.411
Token imp
Token ID848
Feature activation+3.69e+00

Pos logit contributions
aye+0.063
ersed+0.056
oing+0.052
 tongues+0.052
 hormones+0.052
Neg logit contributions
ighth-0.065
fs-0.065
brates-0.063
umber-0.063
ibia-0.062
Tokenract
Token ID974
Feature activation+1.42e-01

Pos logit contributions
aye+13.200
oing+11.390
ossibly+11.347
ersed+11.311
oti+10.219
Neg logit contributions
brates-16.217
FORE-14.541
ighth-14.133
emis-13.981
ibia-13.645
Tokenicable
Token ID18424
Feature activation+7.16e-01

Pos logit contributions
aye+0.785
ersed+0.709
 tongues+0.674
oing+0.674
 hormones+0.671
Neg logit contributions
ighth-0.498
fs-0.496
brates-0.484
umber-0.477
ibia-0.472
Token.
Token ID13
Feature activation-1.03e-01

Pos logit contributions
aye+3.201
ersed+2.808
oing+2.643
 hormones+2.620
 tongues+2.620
Neg logit contributions
ighth-3.253
fs-3.214
brates-3.198
umber-3.102
ibia-3.092
Token
Token ID447
Feature activation-2.45e-01

Pos logit contributions
fs+0.445
ighth+0.444
brates+0.437
umber+0.427
ibia+0.425
Neg logit contributions
aye-0.492
ersed-0.434
oing-0.413
 tongues-0.409
 hormones-0.407
Token
Token ID251
Feature activation-1.86e-01

Pos logit contributions
ighth+0.972
fs+0.970
brates+0.951
umber+0.930
ibia+0.930
Neg logit contributions
aye-1.243
ersed-1.113
oing-1.057
 tongues-1.052
astered-1.042
Token These
Token ID2312
Feature activation+6.96e-01

Pos logit contributions
ighth+0.041
fs+0.041
brates+0.040
umber+0.039
ibia+0.039
Neg logit contributions
aye-0.051
ersed-0.045
oing-0.043
 tongues-0.043
 hormones-0.042
Token warm
Token ID5814
Feature activation+1.63e+00

Pos logit contributions
aye+3.043
ersed+2.678
oing+2.530
 tongues+2.520
 hormones+2.520
Neg logit contributions
ighth-3.201
fs-3.194
brates-3.177
umber-3.100
ibia-3.087
Token and
Token ID290
Feature activation+8.85e-01

Pos logit contributions
aye+7.069
ersed+6.292
 tongues+5.985
astered+5.976
oing+5.964
Neg logit contributions
brates-7.108
ighth-7.097
FORE-6.976
fs-6.972
ibia-6.854
Token stubborn
Token ID24444
Feature activation+6.43e-01

Pos logit contributions
aye+3.859
ersed+3.414
oing+3.260
 tongues+3.259
 hormones+3.232
Neg logit contributions
ighth-4.022
fs-3.993
brates-3.974
ibia-3.894
umber-3.867
Token sonic
Token ID36220
Feature activation-2.07e-01

Pos logit contributions
aye+2.796
ersed+2.465
 tongues+2.364
oing+2.344
 hormones+2.331
Neg logit contributions
ighth-2.981
brates-2.936
fs-2.935
umber-2.869
ibia-2.860
Token imp
Token ID848
Feature activation+3.48e+00

Pos logit contributions
ighth+0.973
fs+0.971
brates+0.953
umber+0.939
ibia+0.934
Neg logit contributions
aye-0.899
ersed-0.788
oing-0.740
 tongues-0.725
astered-0.725
Tokenurities
Token ID10886
Feature activation+4.78e-02

Pos logit contributions
aye+13.752
ersed+12.781
oing+12.545
ersion+12.121
oti+11.679
Neg logit contributions
brates-13.365
FORE-13.167
ighth-13.055
emis-12.189
fs-11.792
Token,
Token ID11
Feature activation+6.13e-01

Pos logit contributions
aye+0.202
ersed+0.176
oing+0.165
 tongues+0.163
 hormones+0.162
Neg logit contributions
ighth-0.232
fs-0.231
brates-0.227
umber-0.224
ibia-0.223
Token having
Token ID1719
Feature activation+2.44e-01

Pos logit contributions
aye+2.649
ersed+2.327
 tongues+2.187
oing+2.186
 hormones+2.178
Neg logit contributions
ighth-2.871
fs-2.865
brates-2.843
ibia-2.778
umber-2.770
Token been
Token ID587
Feature activation-8.75e-01

Pos logit contributions
aye+1.045
ersed+0.917
oing+0.861
 tongues+0.856
 hormones+0.850
Neg logit contributions
ighth-1.159
fs-1.155
brates-1.138
umber-1.119
ibia-1.114
Token encoded
Token ID30240
Feature activation-1.40e-01

Pos logit contributions
ighth+3.796
fs+3.748
umber+3.629
brates+3.612
ibia+3.590
Neg logit contributions
aye-4.037
ersed-3.465
oing-3.330
yip-3.299
astered-3.243
Token (
Token ID357
Feature activation+5.48e-01

Pos logit contributions
ighth+2.726
fs+2.723
brates+2.652
umber+2.638
ibia+2.612
Neg logit contributions
aye-2.632
ersed-2.328
oing-2.156
 hormones-2.117
 tongues-2.115
Token32
Token ID2624
Feature activation+9.26e-01

Pos logit contributions
aye+2.734
ersed+2.428
oing+2.311
astered+2.271
 tongues+2.271
Neg logit contributions
ighth-2.196
fs-2.169
brates-2.157
umber-2.108
ibia-2.094
Token)
Token ID8
Feature activation-4.22e-01

Pos logit contributions
aye+3.982
ersed+3.516
 tongues+3.260
astered+3.255
oing+3.251
Neg logit contributions
ighth-4.198
brates-4.088
fs-4.073
emis-4.036
umber-4.022
Token
Token ID198
Feature activation+7.57e-02

Pos logit contributions
ighth+1.520
fs+1.515
brates+1.491
umber+1.464
ibia+1.445
Neg logit contributions
aye-2.286
ersed-2.058
oing-1.964
 hormones-1.940
 tongues-1.938
Token
Token ID198
Feature activation+1.31e-01

Pos logit contributions
aye+0.393
ersed+0.353
oing+0.335
 tongues+0.333
 hormones+0.332
Neg logit contributions
ighth-0.292
fs-0.290
brates-0.285
umber-0.279
ibia-0.278
TokenRef
Token ID8134
Feature activation+3.46e+00

Pos logit contributions
aye+0.592
ersed+0.522
oing+0.492
 tongues+0.487
 hormones+0.484
Neg logit contributions
ighth-0.591
fs-0.588
brates-0.580
umber-0.569
ibia-0.567
Tokenuse
Token ID1904
Feature activation+6.35e-01

Pos logit contributions
aye+16.096
ersed+14.149
urst+13.645
lect+13.550
astered+13.280
Neg logit contributions
ighth-10.777
brates-10.636
fs-10.550
FORE-10.487
MENTS-9.976
Token to
Token ID284
Feature activation+1.24e-01

Pos logit contributions
aye+2.775
ersed+2.405
 tongues+2.318
oing+2.299
 hormones+2.293
Neg logit contributions
ighth-2.973
fs-2.929
brates-2.896
umber-2.846
ibia-2.836
Token solve
Token ID8494
Feature activation+3.70e-01

Pos logit contributions
aye+0.607
ersed+0.541
oing+0.513
 tongues+0.508
 hormones+0.506
Neg logit contributions
ighth-0.516
fs-0.514
brates-0.505
umber-0.496
ibia-0.493
Token the
Token ID262
Feature activation+3.41e-02

Pos logit contributions
aye+1.586
ersed+1.385
oing+1.309
 tongues+1.301
 hormones+1.292
Neg logit contributions
ighth-1.762
fs-1.754
brates-1.728
umber-1.699
ibia-1.695
Token first
Token ID717
Feature activation-1.22e-01

Pos logit contributions
aye+0.157
ersed+0.139
oing+0.131
 tongues+0.130
 hormones+0.129
Neg logit contributions
ighth-0.151
fs-0.151
brates-0.148
umber-0.146
ibia-0.145
Token maj
Token ID16486
Feature activation+8.77e-01

Pos logit contributions
aye+1.438
ersed+1.280
oing+1.214
 tongues+1.205
 hormones+1.199
Neg logit contributions
ighth-1.232
fs-1.222
brates-1.206
umber-1.184
ibia-1.179
Tokenest
Token ID395
Feature activation-6.60e-01

Pos logit contributions
aye+3.663
ersed+3.059
oing+2.894
astered+2.859
 tongues+2.776
Neg logit contributions
brates-4.258
ighth-4.258
fs-4.240
FORE-4.158
umber-4.106
Tokené
Token ID2634
Feature activation-7.75e-01

Pos logit contributions
fs+3.339
ighth+3.307
ibia+3.256
umber+3.242
brates+3.239
Neg logit contributions
aye-2.451
ersed-2.221
 hormones-2.064
 tongues-2.049
oing-2.038
Token l
Token ID300
Feature activation-1.03e+00

Pos logit contributions
fs+3.218
ighth+3.134
brates+3.111
ibia+3.042
umber+3.041
Neg logit contributions
aye-3.670
ersed-3.363
oing-3.154
 tongues-3.128
 hormones-3.113
Token'
Token ID6
Feature activation-2.14e-01

Pos logit contributions
ighth+2.970
fs+2.899
umber+2.799
brates+2.702
ibia+2.655
Neg logit contributions
aye-6.210
ersed-5.780
 tongues-5.559
oing-5.533
 hormones-5.513
TokenImp
Token ID26950
Feature activation+3.42e+00

Pos logit contributions
ighth+0.872
fs+0.865
umber+0.842
brates+0.830
ibia+0.824
Neg logit contributions
aye-1.048
ersed-0.949
 tongues-0.907
 hormones-0.899
oing-0.898
Tokené
Token ID2634
Feature activation+5.15e-01

Pos logit contributions
aye+16.761
ersed+14.805
ersion+14.257
oing+13.960
oti+13.800
Neg logit contributions
brates-10.444
ighth-10.397
FORE-9.743
fs-9.137
emis-9.074
Tokenrat
Token ID10366
Feature activation-9.32e-01

Pos logit contributions
aye+2.696
ersed+2.420
oing+2.288
 tongues+2.266
astered+2.257
Neg logit contributions
ighth-1.958
brates-1.921
fs-1.919
umber-1.886
ibia-1.851
Tokenrice
Token ID20970
Feature activation-1.93e-01

Pos logit contributions
fs+3.560
ighth+3.481
ibia+3.480
umber+3.419
brates+3.410
Neg logit contributions
aye-4.641
ersed-4.189
yip-4.048
oing-4.044
 tongues-4.036
Token,
Token ID11
Feature activation-1.70e-01

Pos logit contributions
fs+0.801
ighth+0.795
brates+0.781
umber+0.768
ibia+0.763
Neg logit contributions
aye-0.941
ersed-0.848
oing-0.803
 tongues-0.792
 hormones-0.787
TokenA
Token ID32
Feature activation-3.50e-01

Pos logit contributions
fs+0.716
ighth+0.713
brates+0.698
umber+0.688
ibia+0.682
Neg logit contributions
aye-0.820
ersed-0.736
oing-0.696
astered-0.685
 tongues-0.684
Token
Token ID447
Feature activation+2.60e-01

Pos logit contributions
aye+1.751
ersed+1.543
oing+1.459
 tongues+1.446
 hormones+1.440
Neg logit contributions
ighth-1.838
fs-1.828
brates-1.812
umber-1.778
ibia-1.771
Token
Token ID247
Feature activation+3.03e-01

Pos logit contributions
aye+1.056
ersed+0.917
oing+0.860
 tongues+0.849
 hormones+0.846
Neg logit contributions
ighth-1.289
fs-1.283
brates-1.270
umber-1.250
ibia-1.241
Tokenre
Token ID260
Feature activation+1.20e-01

Pos logit contributions
aye+1.442
ersed+1.281
oing+1.212
 tongues+1.200
 hormones+1.194
Neg logit contributions
ighth-1.296
fs-1.288
brates-1.272
umber-1.248
ibia-1.242
Token at
Token ID379
Feature activation-2.91e-01

Pos logit contributions
aye+0.543
ersed+0.479
oing+0.452
 tongues+0.448
 hormones+0.445
Neg logit contributions
ighth-0.543
fs-0.541
brates-0.531
umber-0.523
ibia-0.520
Token this
Token ID428
Feature activation+4.17e-01

Pos logit contributions
ighth+1.312
fs+1.302
brates+1.269
umber+1.262
ibia+1.247
Neg logit contributions
aye-1.312
ersed-1.175
oing-1.097
 tongues-1.082
astered-1.078
Token imp
Token ID848
Feature activation+3.41e+00

Pos logit contributions
aye+2.096
ersed+1.868
oing+1.780
 tongues+1.770
 hormones+1.770
Neg logit contributions
ighth-1.674
fs-1.661
brates-1.642
ibia-1.603
umber-1.603
Tokenasse
Token ID21612
Feature activation-2.20e-01

Pos logit contributions
aye+10.622
ersed+9.722
ossibly+9.272
oing+9.107
ersion+8.785
Neg logit contributions
brates-16.527
ighth-15.766
FORE-15.746
emis-14.906
ibia-14.618
Token right
Token ID826
Feature activation-1.62e-01

Pos logit contributions
fs+0.967
ighth+0.961
brates+0.939
umber+0.936
ibia+0.924
Neg logit contributions
aye-1.023
ersed-0.902
oing-0.854
astered-0.840
 tongues-0.838
Token now
Token ID783
Feature activation+1.55e-01

Pos logit contributions
ighth+0.958
fs+0.956
umber+0.942
brates+0.940
ibia+0.929
Neg logit contributions
aye-0.508
ersed-0.417
oing-0.381
 tongues-0.374
astered-0.369
Token where
Token ID810
Feature activation+3.21e-02

Pos logit contributions
aye+0.720
ersed+0.638
oing+0.602
 tongues+0.597
 hormones+0.592
Neg logit contributions
ighth-0.680
fs-0.678
brates-0.666
umber-0.657
ibia-0.652
Token they
Token ID484
Feature activation-1.68e-01

Pos logit contributions
aye+0.121
ersed+0.105
oing+0.097
 tongues+0.095
astered+0.095
Neg logit contributions
ighth-0.169
fs-0.169
brates-0.166
umber-0.164
ibia-0.163
Token the
Token ID262
Feature activation+3.90e-01

Pos logit contributions
aye+5.450
ersed+4.573
oing+4.361
 tongues+4.316
 hormones+4.316
Neg logit contributions
ighth-6.647
brates-6.559
fs-6.497
ibia-6.420
emis-6.393
Token Spanish
Token ID7897
Feature activation+6.39e-01

Pos logit contributions
aye+1.580
ersed+1.367
oing+1.287
 tongues+1.274
 hormones+1.268
Neg logit contributions
ighth-1.953
fs-1.943
brates-1.922
umber-1.889
ibia-1.882
Token government
Token ID1230
Feature activation+5.50e-02

Pos logit contributions
aye+2.141
ersed+1.805
oing+1.666
 tongues+1.662
 hormones+1.658
Neg logit contributions
ighth-3.613
fs-3.600
brates-3.575
ibia-3.514
umber-3.505
Token insists
Token ID17424
Feature activation+2.01e-01

Pos logit contributions
aye+0.259
ersed+0.230
oing+0.218
 tongues+0.216
 hormones+0.215
Neg logit contributions
ighth-0.239
fs-0.237
brates-0.234
umber-0.230
ibia-0.228
Token on
Token ID319
Feature activation+2.83e-02

Pos logit contributions
aye+0.861
ersed+0.753
oing+0.707
 tongues+0.701
 hormones+0.697
Neg logit contributions
ighth-0.963
fs-0.958
brates-0.944
umber-0.929
ibia-0.924
Token imp
Token ID848
Feature activation+3.30e+00

Pos logit contributions
aye+0.140
ersed+0.125
oing+0.118
 tongues+0.117
 hormones+0.117
Neg logit contributions
ighth-0.117
fs-0.116
brates-0.114
umber-0.112
ibia-0.111
Tokeneding
Token ID8228
Feature activation+5.93e-01

Pos logit contributions
aye+12.224
oing+11.142
ersed+10.820
ersion+10.282
ossibly+9.915
Neg logit contributions
brates-14.409
FORE-13.463
fs-13.149
emis-13.119
ighth-12.778
Token the
Token ID262
Feature activation+7.41e-01

Pos logit contributions
aye+2.521
ersed+2.186
oing+2.063
 tongues+2.053
 hormones+2.036
Neg logit contributions
ighth-2.839
fs-2.826
brates-2.800
ibia-2.743
umber-2.737
Token basic
Token ID4096
Feature activation+8.04e-01

Pos logit contributions
aye+3.143
ersed+2.697
 tongues+2.589
oing+2.588
 hormones+2.580
Neg logit contributions
ighth-3.555
fs-3.520
brates-3.500
umber-3.419
ibia-3.405
Token functioning
Token ID15025
Feature activation+6.30e-01

Pos logit contributions
aye+3.751
ersed+3.269
oing+3.148
 hormones+3.144
 tongues+3.140
Neg logit contributions
ighth-3.501
fs-3.466
brates-3.420
umber-3.372
ibia-3.337
Token of
Token ID286
Feature activation+3.31e-01

Pos logit contributions
aye+2.213
ersed+1.883
oing+1.755
 tongues+1.733
 hormones+1.729
Neg logit contributions
ighth-3.443
fs-3.428
brates-3.396
ibia-3.332
umber-3.329
Token
Token ID198
Feature activation+1.23e-01

Pos logit contributions
fs+1.621
ighth+1.608
brates+1.577
umber+1.553
ibia+1.540
Neg logit contributions
aye-1.973
ersed-1.753
oing-1.697
 tongues-1.650
astered-1.646
Token
Token ID198
Feature activation-4.63e-01

Pos logit contributions
aye+0.632
ersed+0.568
oing+0.538
 tongues+0.534
 hormones+0.532
Neg logit contributions
ighth-0.481
fs-0.475
brates-0.469
umber-0.459
ibia-0.456
Token--
Token ID438
Feature activation+1.07e-01

Pos logit contributions
fs+1.930
ighth+1.908
brates+1.852
FORE+1.834
umber+1.829
Neg logit contributions
aye-2.259
ersed-2.020
 tongues-1.952
oing-1.938
 hormones-1.923
Token
Token ID198
Feature activation+1.07e-02

Pos logit contributions
aye+0.529
ersed+0.472
oing+0.448
 tongues+0.444
 hormones+0.442
Neg logit contributions
ighth-0.439
fs-0.437
brates-0.430
umber-0.422
ibia-0.420
Token
Token ID198
Feature activation-2.26e-01

Pos logit contributions
aye+0.056
ersed+0.051
oing+0.048
 tongues+0.048
 hormones+0.048
Neg logit contributions
ighth-0.041
fs-0.041
brates-0.040
umber-0.039
ibia-0.039
TokenRef
Token ID8134
Feature activation+3.28e+00

Pos logit contributions
fs+0.929
ighth+0.923
brates+0.899
umber+0.885
ibia+0.880
Neg logit contributions
aye-1.116
ersed-1.002
 tongues-0.958
oing-0.955
 hormones-0.949
Tokeneren
Token ID14226
Feature activation-1.21e-01

Pos logit contributions
aye+14.617
ersed+13.063
ersion+12.046
oing+11.998
astered+11.994
Neg logit contributions
brates-11.126
FORE-11.008
ighth-10.933
fs-10.742
ibia-10.711
Tokenced
Token ID771
Feature activation+2.72e-01

Pos logit contributions
fs+0.212
ighth+0.212
brates+0.204
umber+0.194
ibia+0.191
Neg logit contributions
aye-0.882
ersed-0.818
oing-0.789
 tongues-0.787
 hormones-0.785
Token:
Token ID25
Feature activation-3.13e-01

Pos logit contributions
aye+1.253
ersed+1.108
oing+1.047
 tongues+1.037
 hormones+1.031
Neg logit contributions
ighth-1.208
fs-1.204
brates-1.184
umber-1.165
ibia-1.158
Token
Token ID198
Feature activation-3.88e-02

Pos logit contributions
fs+1.331
ighth+1.328
brates+1.302
umber+1.287
ibia+1.269
Neg logit contributions
aye-1.486
ersed-1.325
oing-1.260
 tongues-1.240
astered-1.235
Token
Token ID198
Feature activation-2.29e-01

Pos logit contributions
ighth+0.148
fs+0.147
brates+0.144
umber+0.141
ibia+0.140
Neg logit contributions
aye-0.204
ersed-0.183
oing-0.175
 tongues-0.173
 hormones-0.172
Token brute
Token ID33908
Feature activation-1.58e+00

Pos logit contributions
ighth+1.196
fs+1.189
brates+1.164
umber+1.150
ibia+1.145
Neg logit contributions
aye-1.330
ersed-1.195
oing-1.125
astered-1.100
 hormones-1.095
Token grew
Token ID6348
Feature activation-5.25e-01

Pos logit contributions
fs+6.303
ighth+6.066
umber+6.033
brates+6.023
 else+6.005
Neg logit contributions
aye-7.422
ersed-6.739
oing-6.335
anchester-6.302
ersion-6.177
Token hard
Token ID1327
Feature activation+5.53e-01

Pos logit contributions
fs+2.209
ighth+2.184
brates+2.150
umber+2.129
ibia+2.108
Neg logit contributions
aye-2.540
ersed-2.248
oing-2.090
astered-2.071
yip-2.063
Token panels
Token ID13043
Feature activation+7.26e-03

Pos logit contributions
aye+2.255
ersed+1.972
oing+1.863
 tongues+1.849
 hormones+1.840
Neg logit contributions
ighth-2.722
fs-2.691
brates-2.679
ibia-2.610
umber-2.609
Token of
Token ID286
Feature activation+8.28e-01

Pos logit contributions
aye+0.030
ersed+0.027
oing+0.025
 tongues+0.024
 hormones+0.024
Neg logit contributions
ighth-0.035
fs-0.035
brates-0.035
umber-0.034
ibia-0.034
Token imp
Token ID848
Feature activation+3.28e+00

Pos logit contributions
aye+3.669
ersed+3.187
 tongues+3.121
 hormones+3.087
oing+3.063
Neg logit contributions
fs-3.763
ighth-3.763
brates-3.730
ibia-3.615
umber-3.614
Tokenossibly
Token ID20846
Feature activation+1.08e-01

Pos logit contributions
aye+12.517
ossibly+11.154
ersed+10.941
oing+10.848
astered+10.626
Neg logit contributions
FORE-13.453
brates-13.452
ighth-13.304
fs-12.137
ibia-11.963
Token thick
Token ID6546
Feature activation+8.72e-01

Pos logit contributions
aye+0.520
ersed+0.463
oing+0.438
 tongues+0.434
 hormones+0.432
Neg logit contributions
ighth-0.458
fs-0.456
brates-0.448
umber-0.440
ibia-0.438
Token bone
Token ID9970
Feature activation-5.52e-01

Pos logit contributions
aye+3.756
ersed+3.243
 tongues+3.211
 hormones+3.142
oing+3.141
Neg logit contributions
ighth-4.060
fs-4.005
brates-4.002
FORE-3.887
ibia-3.874
Token all
Token ID477
Feature activation+3.15e-02

Pos logit contributions
fs+2.427
ighth+2.399
brates+2.365
umber+2.320
ibia+2.316
Neg logit contributions
aye-2.570
ersed-2.271
oing-2.149
 hormones-2.081
astered-2.063
Token over
Token ID625
Feature activation-6.59e-01

Pos logit contributions
aye+0.136
ersed+0.120
oing+0.112
 tongues+0.111
 hormones+0.110
Neg logit contributions
ighth-0.149
fs-0.149
brates-0.146
umber-0.144
ibia-0.144
Token
Token ID198
Feature activation-3.94e-01

Pos logit contributions
fs+0.781
ighth+0.777
brates+0.760
umber+0.746
ibia+0.741
Neg logit contributions
aye-1.161
ersed-1.046
oing-1.002
 tongues-0.993
 hormones-0.988
TokenReferences
Token ID19927
Feature activation-5.25e-02

Pos logit contributions
fs+1.601
ighth+1.595
brates+1.539
umber+1.520
FORE+1.515
Neg logit contributions
aye-1.948
ersed-1.760
 tongues-1.687
oing-1.670
 hormones-1.666
Token:
Token ID25
Feature activation-1.02e-01

Pos logit contributions
ighth+0.190
fs+0.189
brates+0.185
umber+0.182
ibia+0.180
Neg logit contributions
aye-0.285
ersed-0.257
oing-0.246
 tongues-0.244
 hormones-0.242
Token
Token ID198
Feature activation-5.60e-02

Pos logit contributions
ighth+0.402
fs+0.400
brates+0.393
umber+0.385
ibia+0.383
Neg logit contributions
aye-0.520
ersed-0.466
oing-0.444
 tongues-0.440
 hormones-0.437
Token
Token ID198
Feature activation-1.89e-01

Pos logit contributions
ighth+0.212
fs+0.211
brates+0.207
umber+0.203
ibia+0.201
Neg logit contributions
aye-0.296
ersed-0.266
oing-0.253
 tongues-0.251
 hormones-0.250
TokenRef
Token ID8134
Feature activation+3.26e+00

Pos logit contributions
fs+0.791
ighth+0.790
brates+0.772
umber+0.761
ibia+0.757
Neg logit contributions
aye-0.916
ersed-0.820
oing-0.777
 tongues-0.774
 hormones-0.768
Token.
Token ID13
Feature activation-2.75e-02

Pos logit contributions
aye+14.810
ersed+12.854
astered+12.288
ersion+12.086
oing+11.852
Neg logit contributions
brates-11.083
ighth-10.554
fs-10.525
FORE-10.503
ibia-10.140
Token 1
Token ID352
Feature activation-1.30e-01

Pos logit contributions
ighth+0.126
fs+0.126
brates+0.124
umber+0.122
ibia+0.121
Neg logit contributions
aye-0.122
ersed-0.108
oing-0.102
 tongues-0.101
 hormones-0.100
Token -
Token ID532
Feature activation+1.71e-01

Pos logit contributions
fs+0.478
ighth+0.477
brates+0.465
umber+0.458
ibia+0.453
Neg logit contributions
aye-0.696
ersed-0.629
oing-0.600
 tongues-0.596
 hormones-0.590
Token The
Token ID383
Feature activation-1.31e-01

Pos logit contributions
aye+0.776
ersed+0.685
oing+0.646
 tongues+0.640
 hormones+0.638
Neg logit contributions
ighth-0.774
fs-0.770
brates-0.760
umber-0.746
ibia-0.743
Token Inner
Token ID24877
Feature activation+7.48e-02

Pos logit contributions
ighth+0.600
fs+0.597
brates+0.587
umber+0.579
ibia+0.576
Neg logit contributions
aye-0.589
ersed-0.521
oing-0.489
 tongues-0.484
 hormones-0.479
Token be
Token ID307
Feature activation-5.26e-01

Pos logit contributions
ighth+0.588
fs+0.587
brates+0.576
umber+0.570
ibia+0.567
Neg logit contributions
aye-0.521
ersed-0.455
oing-0.425
 tongues-0.420
 hormones-0.419
Token their
Token ID511
Feature activation-1.75e-01

Pos logit contributions
ighth+2.243
fs+2.236
ibia+2.165
brates+2.161
umber+2.158
Neg logit contributions
aye-2.499
ersed-2.180
oing-2.079
 hormones-2.040
astered-2.040
Token first
Token ID717
Feature activation+3.60e-02

Pos logit contributions
ighth+0.630
fs+0.626
brates+0.611
umber+0.601
ibia+0.600
Neg logit contributions
aye-0.952
ersed-0.859
oing-0.818
 tongues-0.808
 hormones-0.806
Token
Token ID1399
Feature activation-1.13e+00

Pos logit contributions
aye+0.174
ersed+0.154
oing+0.146
 tongues+0.144
 hormones+0.144
Neg logit contributions
ighth-0.152
fs-0.152
brates-0.149
umber-0.147
ibia-0.146
Token (
Token ID357
Feature activation+5.72e-01

Pos logit contributions
ighth+4.744
fs+4.716
umber+4.501
brates+4.472
ibia+4.443
Neg logit contributions
aye-5.135
ersed-4.685
oing-4.450
 tongues-4.419
 hormones-4.401
TokenSub
Token ID7004
Feature activation+3.15e+00

Pos logit contributions
aye+2.889
ersed+2.552
oing+2.434
 tongues+2.408
 hormones+2.398
Neg logit contributions
ighth-2.278
brates-2.253
fs-2.244
ibia-2.174
umber-2.171
Tokenban
Token ID3820
Feature activation-1.95e-01

Pos logit contributions
aye+14.603
ersed+13.122
oing+12.344
astered+12.230
 hormones+12.209
Neg logit contributions
FORE-9.981
ibia-9.754
brates-9.746
ighth-9.659
fs-9.249
Token averaged
Token ID16449
Feature activation-3.05e-01

Pos logit contributions
ighth+0.859
fs+0.859
brates+0.833
umber+0.830
ibia+0.824
Neg logit contributions
aye-0.904
ersed-0.805
 tongues-0.753
oing-0.752
astered-0.745
Token 5
Token ID642
Feature activation+1.10e-01

Pos logit contributions
ighth+1.427
fs+1.421
brates+1.392
umber+1.382
ibia+1.373
Neg logit contributions
aye-1.328
ersed-1.174
oing-1.098
 tongues-1.082
 hormones-1.075
Token:
Token ID25
Feature activation+1.66e-01

Pos logit contributions
aye+0.560
ersed+0.499
oing+0.476
 tongues+0.472
 hormones+0.469
Neg logit contributions
ighth-0.433
fs-0.431
brates-0.428
umber-0.418
ibia-0.416
Token46
Token ID3510
Feature activation+6.15e-01

Pos logit contributions
aye+0.868
ersed+0.778
oing+0.739
 tongues+0.734
 hormones+0.730
Neg logit contributions
ighth-0.636
fs-0.629
brates-0.622
umber-0.608
ibia-0.605
Token apart
Token ID5475
Feature activation-2.40e-01

Pos logit contributions
ighth+0.279
fs+0.277
brates+0.272
umber+0.269
ibia+0.267
Neg logit contributions
aye-0.297
ersed-0.262
oing-0.249
 tongues-0.245
 hormones-0.244
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
fs+1.048
ighth+1.045
brates+1.023
umber+1.016
ibia+0.997
Neg logit contributions
aye-1.129
ersed-0.990
oing-0.943
astered-0.925
 tongues-0.919
Token
Token ID198
Feature activation+4.41e-02

Pos logit contributions
fs+1.041
ighth+1.037
brates+1.018
umber+0.997
ibia+0.988
Neg logit contributions
aye-1.348
ersed-1.208
oing-1.156
 tongues-1.136
astered-1.132
Token
Token ID198
Feature activation-5.39e-01

Pos logit contributions
aye+0.229
ersed+0.206
oing+0.196
 tongues+0.194
 hormones+0.193
Neg logit contributions
ighth-0.170
fs-0.168
brates-0.166
umber-0.162
ibia-0.161
TokenAdd
Token ID4550
Feature activation+6.14e-01

Pos logit contributions
fs+2.080
ighth+2.057
FORE+1.996
brates+1.994
umber+1.974
Neg logit contributions
aye-2.782
ersed-2.522
 tongues-2.429
oing-2.417
 hormones-2.400
Token Imm
Token ID9543
Feature activation+3.15e+00

Pos logit contributions
aye+2.691
ersed+2.361
oing+2.243
 tongues+2.236
 hormones+2.224
Neg logit contributions
ighth-2.833
fs-2.806
brates-2.768
ibia-2.743
umber-2.730
Tokenersion
Token ID6900
Feature activation+8.21e-01

Pos logit contributions
ersed+11.577
ersion+11.283
aye+10.641
astered+9.739
ersive+9.221
Neg logit contributions
brates-13.257
FORE-13.053
fs-12.749
cape-12.740
Reply-12.466
Token G
Token ID402
Feature activation-9.11e-01

Pos logit contributions
aye+3.555
ersed+3.175
 hormones+3.015
oing+2.987
 tongues+2.984
Neg logit contributions
ighth-3.751
fs-3.663
brates-3.615
ibia-3.595
umber-3.555
Tokenimm
Token ID8608
Feature activation+2.21e+00

Pos logit contributions
fs+3.158
ighth+3.065
brates+3.011
umber+3.006
ibia+2.881
Neg logit contributions
aye-5.097
ersed-4.537
 tongues-4.412
oing-4.333
 hormones-4.328
Tokenicks
Token ID3378
Feature activation+9.84e-01

Pos logit contributions
aye+11.221
ersed+10.125
ersion+9.602
astered+9.547
oing+9.507
Neg logit contributions
FORE-7.513
ighth-7.506
brates-7.297
fs-6.972
ibia-6.930
Token wherever
Token ID14530
Feature activation-3.16e-01

Pos logit contributions
aye+4.267
ersed+3.797
 hormones+3.585
oing+3.567
 tongues+3.546
Neg logit contributions
ighth-4.481
fs-4.388
brates-4.372
ibia-4.316
umber-4.222
Token to
Token ID284
Feature activation-4.04e-01

Pos logit contributions
fs+1.712
ighth+1.709
brates+1.677
umber+1.656
ibia+1.639
Neg logit contributions
aye-1.762
ersed-1.566
oing-1.491
astered-1.464
 hormones-1.444
Token the
Token ID262
Feature activation-1.62e-01

Pos logit contributions
fs+1.811
ighth+1.810
brates+1.772
umber+1.768
ibia+1.740
Neg logit contributions
aye-1.807
ersed-1.611
oing-1.524
astered-1.485
 hormones-1.479
Token lack
Token ID3092
Feature activation+2.08e-01

Pos logit contributions
ighth+0.704
fs+0.703
brates+0.687
umber+0.682
ibia+0.675
Neg logit contributions
aye-0.761
ersed-0.676
oing-0.638
astered-0.628
 tongues-0.627
Token of
Token ID286
Feature activation+3.05e-01

Pos logit contributions
aye+0.784
ersed+0.673
oing+0.626
 tongues+0.619
 hormones+0.615
Neg logit contributions
ighth-1.099
fs-1.096
brates-1.080
umber-1.066
ibia-1.061
Token an
Token ID281
Feature activation-1.44e-01

Pos logit contributions
aye+1.389
ersed+1.224
oing+1.158
 tongues+1.152
 hormones+1.144
Neg logit contributions
ighth-1.374
fs-1.368
brates-1.351
umber-1.327
ibia-1.319
Token imp
Token ID848
Feature activation+3.13e+00

Pos logit contributions
ighth+0.649
fs+0.645
brates+0.627
umber+0.626
ibia+0.622
Neg logit contributions
aye-0.653
ersed-0.580
oing-0.545
 tongues-0.543
 hormones-0.536
Tokenassion
Token ID11857
Feature activation-9.72e-01

Pos logit contributions
aye+12.339
ersed+10.832
oing+10.341
ossibly+10.290
ersion+10.075
Neg logit contributions
brates-13.619
FORE-12.601
ighth-12.463
emis-11.917
fs-11.894
Tokened
Token ID276
Feature activation-1.36e-01

Pos logit contributions
fs+4.471
ighth+4.386
brates+4.289
umber+4.238
ces+4.147
Neg logit contributions
aye-4.198
ersed-3.727
 tongues-3.509
 antibodies-3.458
 hormones-3.452
Token,
Token ID11
Feature activation+7.44e-01

Pos logit contributions
fs+0.628
ighth+0.628
brates+0.611
umber+0.607
ibia+0.603
Neg logit contributions
aye-0.592
ersed-0.529
oing-0.494
astered-0.487
 hormones-0.485
Token positive
Token ID3967
Feature activation+2.69e-01

Pos logit contributions
aye+3.284
ersed+2.860
 tongues+2.712
oing+2.705
 hormones+2.686
Neg logit contributions
ighth-3.431
fs-3.424
brates-3.391
umber-3.328
ibia-3.310
Token,
Token ID11
Feature activation+3.65e-01

Pos logit contributions
aye+1.217
ersed+1.073
oing+1.013
 tongues+1.003
 hormones+0.998
Neg logit contributions
ighth-1.220
fs-1.215
brates-1.197
umber-1.176
ibia-1.170
Token over
Token ID625
Feature activation-6.64e-01

Pos logit contributions
aye+0.096
ersed+0.083
oing+0.077
 tongues+0.076
 hormones+0.076
Neg logit contributions
ighth-0.123
fs-0.123
brates-0.121
umber-0.120
ibia-0.119
Token Baylor
Token ID28245
Feature activation+4.98e-02

Pos logit contributions
ighth+2.750
fs+2.721
brates+2.632
umber+2.619
ibia+2.600
Neg logit contributions
aye-3.176
ersed-2.894
oing-2.716
astered-2.671
 tongues-2.650
Token.
Token ID13
Feature activation-1.92e-02

Pos logit contributions
aye+0.234
ersed+0.209
oing+0.197
 tongues+0.195
 hormones+0.194
Neg logit contributions
ighth-0.217
fs-0.216
brates-0.212
umber-0.208
ibia-0.207
Token
Token ID198
Feature activation+3.54e-01

Pos logit contributions
ighth+0.083
fs+0.082
brates+0.081
umber+0.079
ibia+0.079
Neg logit contributions
aye-0.092
ersed-0.081
oing-0.077
 tongues-0.076
 hormones-0.076
Token
Token ID198
Feature activation-1.56e-02

Pos logit contributions
aye+1.739
ersed+1.559
oing+1.465
 tongues+1.457
 hormones+1.451
Neg logit contributions
ighth-1.460
fs-1.421
brates-1.419
umber-1.388
ibia-1.377
TokenTurn
Token ID17278
Feature activation+3.11e+00

Pos logit contributions
ighth+0.066
fs+0.065
brates+0.064
umber+0.063
ibia+0.063
Neg logit contributions
aye-0.076
ersed-0.068
oing-0.064
 tongues-0.064
 hormones-0.063
Tokening
Token ID278
Feature activation+1.77e+00

Pos logit contributions
aye+11.584
oing+10.218
ersed+9.965
stone+9.480
astered+9.472
Neg logit contributions
brates-13.210
FORE-12.814
ighth-12.729
ibia-12.683
fs-12.576
Token to
Token ID284
Feature activation-1.45e-01

Pos logit contributions
aye+7.011
ersed+5.987
 tongues+5.805
oing+5.717
 hormones+5.674
Neg logit contributions
fs-8.246
ighth-8.231
ibia-8.187
FORE-8.186
brates-8.165
Token the
Token ID262
Feature activation+1.95e-01

Pos logit contributions
ighth+0.651
fs+0.648
brates+0.636
umber+0.626
ibia+0.623
Neg logit contributions
aye-0.658
ersed-0.584
oing-0.550
 tongues-0.541
 hormones-0.538
Token Rice
Token ID13823
Feature activation+1.69e-02

Pos logit contributions
aye+0.884
ersed+0.779
oing+0.738
 tongues+0.732
 hormones+0.729
Neg logit contributions
ighth-0.879
fs-0.875
brates-0.863
umber-0.847
ibia-0.843
Token Trading
Token ID25469
Feature activation+1.91e-01

Pos logit contributions
aye+0.077
ersed+0.068
oing+0.064
 tongues+0.064
 hormones+0.063
Neg logit contributions
ighth-0.076
fs-0.076
brates-0.074
umber-0.073
ibia-0.073
Token and
Token ID290
Feature activation+8.17e-01

Pos logit contributions
aye+2.921
ersed+2.550
oing+2.413
 tongues+2.394
 hormones+2.392
Neg logit contributions
ighth-2.919
fs-2.892
brates-2.866
ibia-2.797
umber-2.794
Token the
Token ID262
Feature activation-1.33e-01

Pos logit contributions
aye+3.500
ersed+3.024
 hormones+2.888
 tongues+2.871
oing+2.852
Neg logit contributions
ighth-3.842
fs-3.829
brates-3.793
ibia-3.711
umber-3.699
Token word
Token ID1573
Feature activation+6.42e-01

Pos logit contributions
ighth+0.597
fs+0.593
brates+0.582
umber+0.576
ibia+0.572
Neg logit contributions
aye-0.607
ersed-0.541
oing-0.508
 tongues-0.499
astered-0.499
Token
Token ID564
Feature activation+7.19e-01

Pos logit contributions
aye+2.958
ersed+2.600
 tongues+2.465
 hormones+2.452
oing+2.452
Neg logit contributions
ighth-2.852
fs-2.819
brates-2.791
ibia-2.724
umber-2.718
Token
Token ID246
Feature activation+3.85e-01

Pos logit contributions
aye+3.960
ersed+3.528
oing+3.334
 hormones+3.331
 tongues+3.330
Neg logit contributions
fs-2.527
ighth-2.496
brates-2.476
umber-2.433
ibia-2.386
TokenDep
Token ID12156
Feature activation+3.11e+00

Pos logit contributions
aye+1.896
ersed+1.683
oing+1.598
 tongues+1.580
 hormones+1.574
Neg logit contributions
ighth-1.595
fs-1.582
brates-1.563
umber-1.528
ibia-1.521
Tokenression
Token ID2234
Feature activation+8.03e-01

Pos logit contributions
aye+14.011
ersed+12.636
ersion+11.584
astered+11.569
oing+11.499
Neg logit contributions
brates-10.804
FORE-10.727
ighth-10.455
fs-10.229
ibia-9.790
Token
Token ID447
Feature activation+3.37e-01

Pos logit contributions
aye+3.869
ersed+3.436
 hormones+3.248
 tongues+3.236
oing+3.233
Neg logit contributions
ighth-3.365
fs-3.317
brates-3.303
umber-3.190
ibia-3.190
Token
Token ID247
Feature activation+7.70e-01

Pos logit contributions
aye+1.719
ersed+1.543
oing+1.461
 tongues+1.449
 hormones+1.445
Neg logit contributions
ighth-1.335
fs-1.325
brates-1.303
umber-1.280
ibia-1.272
Token,
Token ID11
Feature activation+7.94e-01

Pos logit contributions
aye+3.515
ersed+3.095
oing+2.908
 tongues+2.889
 hormones+2.888
Neg logit contributions
ighth-3.433
fs-3.388
brates-3.366
ibia-3.281
umber-3.279
Token though
Token ID996
Feature activation+9.02e-02

Pos logit contributions
aye+3.323
ersed+2.879
 tongues+2.714
oing+2.702
 hormones+2.702
Neg logit contributions
ighth-3.841
fs-3.813
brates-3.790
ibia-3.699
umber-3.684
Token-
Token ID12
Feature activation+6.12e-01

Pos logit contributions
aye+0.139
ersed+0.124
oing+0.117
 tongues+0.116
 hormones+0.116
Neg logit contributions
ighth-0.126
fs-0.126
brates-0.124
umber-0.121
ibia-0.121
Token Un
Token ID791
Feature activation+8.91e-01

Pos logit contributions
aye+2.705
ersed+2.392
 tongues+2.243
oing+2.231
 hormones+2.221
Neg logit contributions
ighth-2.809
fs-2.764
brates-2.725
umber-2.706
ibia-2.704
Tokenem
Token ID368
Feature activation+8.15e-01

Pos logit contributions
aye+4.581
ersed+4.133
oing+3.952
astered+3.877
 tongues+3.854
Neg logit contributions
ighth-3.343
brates-3.300
ibia-3.238
fs-3.203
umber-3.187
Tokenotional
Token ID25453
Feature activation+6.67e-01

Pos logit contributions
aye+2.558
ersed+2.224
oing+1.972
astered+1.954
 tongues+1.864
Neg logit contributions
ighth-4.728
fs-4.665
FORE-4.656
brates-4.630
umber-4.607
Token,
Token ID11
Feature activation+1.62e-01

Pos logit contributions
aye+2.846
ersed+2.489
oing+2.362
 hormones+2.350
 tongues+2.345
Neg logit contributions
ighth-3.145
fs-3.124
brates-3.059
umber-3.041
ibia-3.029
Token Sub
Token ID3834
Feature activation+3.10e+00

Pos logit contributions
aye+0.730
ersed+0.643
oing+0.607
 tongues+0.600
 hormones+0.597
Neg logit contributions
ighth-0.739
fs-0.737
brates-0.725
umber-0.712
ibia-0.709
Tokenserv
Token ID3168
Feature activation+1.47e+00

Pos logit contributions
aye+12.641
ersed+12.162
astered+11.130
versive+11.130
oing+11.071
Neg logit contributions
FORE-11.548
ibia-10.923
ighth-10.868
emis-10.115
brates-9.920
Tokenient
Token ID1153
Feature activation+5.50e-01

Pos logit contributions
aye+6.098
ersed+5.681
oing+5.514
astered+5.139
ersion+4.964
Neg logit contributions
FORE-6.427
ighth-6.361
brates-6.297
ibia-6.232
umber-6.196
Token,
Token ID11
Feature activation+1.30e-01

Pos logit contributions
aye+2.420
ersed+2.126
 tongues+2.021
oing+2.015
 hormones+2.003
Neg logit contributions
ighth-2.532
fs-2.500
brates-2.463
umber-2.438
ibia-2.428
Token Violent
Token ID35615
Feature activation+1.39e-01

Pos logit contributions
aye+0.584
ersed+0.514
oing+0.485
 tongues+0.480
 hormones+0.477
Neg logit contributions
ighth-0.593
fs-0.591
brates-0.581
umber-0.571
ibia-0.568
Token
Token ID198
Feature activation+3.14e-01

Pos logit contributions
aye+0.644
ersed+0.570
oing+0.540
 tongues+0.535
 hormones+0.532
Neg logit contributions
ighth-0.610
fs-0.607
brates-0.597
umber-0.588
ibia-0.585
Token could
Token ID714
Feature activation-3.87e-01

Pos logit contributions
ighth+3.638
fs+3.624
ibia+3.472
umber+3.449
brates+3.393
Neg logit contributions
aye-4.823
ersed-4.274
yip-4.046
oing-4.041
anchester-4.004
Token create
Token ID2251
Feature activation+8.18e-01

Pos logit contributions
fs+1.580
ighth+1.570
umber+1.518
brates+1.515
ibia+1.500
Neg logit contributions
aye-1.934
ersed-1.739
oing-1.622
astered-1.604
anchester-1.589
Token a
Token ID257
Feature activation+2.31e-01

Pos logit contributions
aye+3.422
ersed+2.975
 tongues+2.869
 hormones+2.858
oing+2.830
Neg logit contributions
ighth-3.904
fs-3.852
brates-3.848
ibia-3.771
umber-3.764
Token paper
Token ID3348
Feature activation-2.56e-01

Pos logit contributions
aye+1.113
ersed+0.988
oing+0.940
 tongues+0.934
 hormones+0.930
Neg logit contributions
ighth-0.975
fs-0.968
brates-0.958
umber-0.937
ibia-0.934
Token trail
Token ID8025
Feature activation-2.38e-01

Pos logit contributions
fs+0.948
ighth+0.939
brates+0.909
umber+0.902
ibia+0.892
Neg logit contributions
aye-1.373
ersed-1.238
oing-1.183
 tongues-1.169
 hormones-1.164
Token imp
Token ID848
Feature activation+3.07e+00

Pos logit contributions
fs+1.061
ighth+1.050
brates+1.025
umber+1.020
ibia+1.012
Neg logit contributions
aye-1.097
ersed-0.974
oing-0.916
 tongues-0.906
 hormones-0.899
Tokenug
Token ID1018
Feature activation+1.01e+00

Pos logit contributions
aye+11.518
ersed+10.396
oing+10.067
astered+9.555
ersion+9.482
Neg logit contributions
brates-13.542
emis-12.288
ighth-12.279
FORE-12.077
cape-11.807
Tokenning
Token ID768
Feature activation-6.27e-02

Pos logit contributions
aye+4.841
ersed+4.349
oing+4.163
 tongues+4.046
astered+3.987
Neg logit contributions
ighth-4.162
fs-4.110
brates-4.091
ibia-4.049
FORE-4.013
Token Price
Token ID7886
Feature activation+8.81e-01

Pos logit contributions
ighth+0.284
fs+0.283
brates+0.278
umber+0.274
ibia+0.272
Neg logit contributions
aye-0.283
ersed-0.250
oing-0.236
 tongues-0.233
 hormones-0.232
Token
Token ID447
Feature activation-8.63e-03

Pos logit contributions
aye+3.788
ersed+3.293
 hormones+3.166
oing+3.126
 tongues+3.124
Neg logit contributions
brates-4.032
ighth-4.019
fs-4.003
ibia-3.906
umber-3.880
Token
Token ID247
Feature activation+8.33e-02

Pos logit contributions
ighth+0.028
fs+0.028
brates+0.028
umber+0.027
ibia+0.027
Neg logit contributions
aye-0.050
ersed-0.045
oing-0.043
 tongues-0.043
 hormones-0.043
Token talks
Token ID6130
Feature activation+1.65e-01

Pos logit contributions
aye+1.506
ersed+1.315
oing+1.248
 tongues+1.236
 hormones+1.231
Neg logit contributions
ighth-1.588
fs-1.572
brates-1.560
ibia-1.528
umber-1.527
Token at
Token ID379
Feature activation-1.98e-01

Pos logit contributions
aye+0.722
ersed+0.637
oing+0.597
 tongues+0.591
 hormones+0.585
Neg logit contributions
fs-0.770
ighth-0.770
brates-0.750
umber-0.741
ibia-0.738
Token the
Token ID262
Feature activation+1.68e-01

Pos logit contributions
fs+0.932
ighth+0.929
brates+0.906
umber+0.905
ibia+0.896
Neg logit contributions
aye-0.858
ersed-0.761
oing-0.711
 tongues-0.698
astered-0.696
Token
Token ID564
Feature activation+2.91e-01

Pos logit contributions
aye+0.749
ersed+0.659
oing+0.622
 tongues+0.616
 hormones+0.613
Neg logit contributions
ighth-0.775
fs-0.772
brates-0.761
umber-0.747
ibia-0.744
Token
Token ID250
Feature activation+1.97e-01

Pos logit contributions
aye+1.276
ersed+1.116
oing+1.052
 tongues+1.039
 hormones+1.036
Neg logit contributions
ighth-1.354
fs-1.351
brates-1.335
umber-1.309
ibia-1.300
Tokenimp
Token ID11011
Feature activation+3.07e+00

Pos logit contributions
aye+0.989
ersed+0.884
oing+0.840
 tongues+0.832
 hormones+0.828
Neg logit contributions
ighth-0.790
fs-0.786
brates-0.774
umber-0.758
ibia-0.754
Tokenasse
Token ID21612
Feature activation-4.35e-01

Pos logit contributions
aye+11.659
ersed+10.058
oing+9.568
ersion+9.513
oti+9.315
Neg logit contributions
brates-13.863
ighth-13.157
emis-12.866
FORE-12.743
fs-12.273
Token,
Token ID11
Feature activation-5.96e-02

Pos logit contributions
fs+1.974
ighth+1.944
brates+1.889
umber+1.878
ibia+1.864
Neg logit contributions
aye-1.960
ersed-1.745
oing-1.635
 tongues-1.606
astered-1.590
Token
Token ID447
Feature activation+1.62e-03

Pos logit contributions
ighth+0.283
fs+0.282
brates+0.276
umber+0.273
ibia+0.272
Neg logit contributions
aye-0.256
ersed-0.225
oing-0.211
 tongues-0.209
astered-0.207
Token
Token ID251
Feature activation-2.83e-01

Pos logit contributions
aye+0.008
ersed+0.007
oing+0.007
 tongues+0.007
 hormones+0.007
Neg logit contributions
ighth-0.006
fs-0.006
brates-0.006
umber-0.006
ibia-0.006
Token but
Token ID475
Feature activation+2.28e-01

Pos logit contributions
ighth+1.320
fs+1.318
brates+1.286
umber+1.272
ibia+1.266
Neg logit contributions
aye-1.234
ersed-1.095
oing-1.019
 tongues-1.016
 hormones-1.002
Token that
Token ID326
Feature activation+5.65e-01

Pos logit contributions
fs+0.939
ighth+0.936
brates+0.906
umber+0.896
ibia+0.891
Neg logit contributions
aye-1.434
ersed-1.295
oing-1.236
astered-1.221
 tongues-1.216
Token its
Token ID663
Feature activation+5.32e-01

Pos logit contributions
aye+2.394
ersed+2.078
oing+1.961
 tongues+1.959
 hormones+1.955
Neg logit contributions
ighth-2.721
fs-2.683
brates-2.682
umber-2.617
ibia-2.604
Token success
Token ID1943
Feature activation-7.71e-02

Pos logit contributions
aye+2.479
ersed+2.193
oing+2.091
 tongues+2.088
 hormones+2.079
Neg logit contributions
ighth-2.321
fs-2.302
brates-2.283
umber-2.240
ibia-2.214
Token
Token ID564
Feature activation+8.04e-01

Pos logit contributions
fs+0.360
ighth+0.358
brates+0.351
ibia+0.347
umber+0.347
Neg logit contributions
aye-0.339
ersed-0.297
oing-0.280
astered-0.275
 tongues-0.274
Token
Token ID250
Feature activation+8.55e-01

Pos logit contributions
aye+3.707
ersed+3.255
oing+3.063
 tongues+3.054
 hormones+3.042
Neg logit contributions
ighth-3.481
fs-3.464
brates-3.443
umber-3.397
ibia-3.324
Tokendep
Token ID10378
Feature activation+3.05e+00

Pos logit contributions
aye+4.404
ersed+3.891
oing+3.705
 hormones+3.630
astered+3.629
Neg logit contributions
ighth-3.306
brates-3.260
fs-3.228
umber-3.177
ibia-3.139
Tokenends
Token ID2412
Feature activation+1.23e+00

Pos logit contributions
aye+10.021
ersed+9.979
oing+8.338
astered+8.089
ersion+7.998
Neg logit contributions
FORE-13.714
brates-13.520
Reply-13.472
ighth-13.428
cape-13.297
Token on
Token ID319
Feature activation+9.67e-02

Pos logit contributions
aye+4.875
ersed+4.329
 hormones+3.974
astered+3.892
 tongues+3.884
Neg logit contributions
brates-6.025
ighth-5.967
fs-5.884
FORE-5.748
ibia-5.707
Token us
Token ID514
Feature activation+4.23e-01

Pos logit contributions
aye+0.464
ersed+0.412
oing+0.391
 tongues+0.388
 hormones+0.386
Neg logit contributions
ighth-0.411
fs-0.409
brates-0.403
umber-0.396
ibia-0.393
Token all
Token ID477
Feature activation+2.10e-01

Pos logit contributions
aye+2.050
ersed+1.819
oing+1.736
 hormones+1.708
 tongues+1.708
Neg logit contributions
ighth-1.781
fs-1.765
brates-1.756
umber-1.706
ibia-1.695
Token
Token ID447
Feature activation-6.74e-02

Pos logit contributions
aye+1.019
ersed+0.906
oing+0.861
 tongues+0.852
 hormones+0.849
Neg logit contributions
ighth-0.886
fs-0.880
brates-0.869
umber-0.849
ibia-0.846
Token an
Token ID281
Feature activation-4.70e-01

Pos logit contributions
fs+4.606
ighth+4.555
brates+4.438
ibia+4.422
umber+4.357
Neg logit contributions
aye-4.778
ersed-4.246
oing-4.087
astered-3.970
 hormones-3.940
Token alliance
Token ID12801
Feature activation+4.96e-02

Pos logit contributions
ighth+2.040
fs+2.027
umber+1.969
ibia+1.944
brates+1.942
Neg logit contributions
aye-2.190
ersed-1.952
oing-1.844
 tongues-1.834
 hormones-1.818
Token with
Token ID351
Feature activation-1.17e-01

Pos logit contributions
aye+0.233
ersed+0.206
oing+0.195
 tongues+0.193
 hormones+0.192
Neg logit contributions
ighth-0.217
fs-0.216
brates-0.212
umber-0.209
ibia-0.208
Token B
Token ID347
Feature activation-1.23e+00

Pos logit contributions
ighth+0.544
fs+0.543
brates+0.533
umber+0.526
ibia+0.523
Neg logit contributions
aye-0.517
ersed-0.455
oing-0.430
 tongues-0.424
 hormones-0.421
Tokenibli
Token ID29142
Feature activation-5.35e-01

Pos logit contributions
ighth+5.198
fs+5.176
umber+5.031
emis+4.974
ibia+4.929
Neg logit contributions
aye-5.599
ersed-4.924
 tongues-4.920
oing-4.838
 hormones-4.833
Tokenothe
Token ID20388
Feature activation-3.24e+00

Pos logit contributions
ighth+2.307
fs+2.267
umber+2.252
brates+2.215
ibia+2.201
Neg logit contributions
aye-2.494
ersed-2.234
 tongues-2.140
 hormones-2.115
oing-2.102
Tokenca
Token ID6888
Feature activation-5.78e-01

Pos logit contributions
que+8.567
fs+8.564
ces+8.476
ighth+8.245
ters+8.129
Neg logit contributions
ersed-16.379
aye-16.312
 tongues-15.295
 hormones-15.236
interstitial-14.759
Token to
Token ID284
Feature activation-5.99e-02

Pos logit contributions
fs+2.567
ighth+2.540
brates+2.516
ibia+2.495
umber+2.481
Neg logit contributions
aye-2.631
ersed-2.348
oing-2.245
astered-2.171
 tongues-2.157
Token directly
Token ID3264
Feature activation-3.96e-01

Pos logit contributions
ighth+0.249
fs+0.249
brates+0.244
umber+0.240
ibia+0.239
Neg logit contributions
aye-0.293
ersed-0.262
oing-0.248
 tongues-0.244
 hormones-0.243
Token sell
Token ID3677
Feature activation-2.61e-01

Pos logit contributions
ighth+1.583
fs+1.576
brates+1.529
umber+1.512
ibia+1.508
Neg logit contributions
aye-2.001
ersed-1.804
oing-1.709
 tongues-1.678
 hormones-1.676
Token its
Token ID663
Feature activation-4.88e-01

Pos logit contributions
ighth+1.156
fs+1.148
brates+1.124
umber+1.107
ibia+1.098
Neg logit contributions
aye-1.205
ersed-1.071
oing-1.016
astered-0.991
 tongues-0.989
Token Pi
Token ID13993
Feature activation-1.85e-01

Pos logit contributions
ighth+1.553
fs+1.546
brates+1.476
umber+1.466
ibia+1.445
Neg logit contributions
aye-3.016
ersed-2.742
oing-2.649
 hormones-2.601
 tongues-2.589
Token,
Token ID11
Feature activation-1.02e-01

Pos logit contributions
ighth+0.842
fs+0.841
brates+0.818
umber+0.814
ibia+0.809
Neg logit contributions
aye-0.836
ersed-0.737
oing-0.694
 tongues-0.686
 hormones-0.684
Token Nvidia
Token ID27699
Feature activation+3.80e-01

Pos logit contributions
fs+0.469
ighth+0.469
brates+0.459
umber+0.454
ibia+0.451
Neg logit contributions
aye-0.451
ersed-0.396
oing-0.373
 tongues-0.368
 hormones-0.367
Token Shield
Token ID10022
Feature activation+1.08e+00

Pos logit contributions
aye+1.789
ersed+1.595
 tongues+1.502
oing+1.499
 hormones+1.492
Neg logit contributions
ighth-1.625
fs-1.614
brates-1.600
FORE-1.556
umber-1.556
Token,
Token ID11
Feature activation+3.40e-02

Pos logit contributions
aye+4.467
ersed+3.958
oing+3.715
 hormones+3.666
 tongues+3.644
Neg logit contributions
brates-5.047
ighth-5.043
fs-4.989
FORE-4.911
ibia-4.887
Token Chrom
Token ID18255
Feature activation-2.94e+00

Pos logit contributions
aye+0.154
ersed+0.136
oing+0.128
 tongues+0.127
 hormones+0.126
Neg logit contributions
ighth-0.154
fs-0.153
brates-0.151
umber-0.148
ibia-0.148
Tokenecast
Token ID43299
Feature activation+4.60e-01

Pos logit contributions
fs+8.877
ibia+8.288
ighth+8.157
umber+7.765
cape+7.456
Neg logit contributions
aye-14.094
ersed-14.043
anchester-13.229
 tongues-13.215
interstitial-12.959
Token or
Token ID393
Feature activation-5.10e-01

Pos logit contributions
aye+1.976
ersed+1.732
oing+1.636
 tongues+1.622
 hormones+1.610
Neg logit contributions
ighth-2.169
fs-2.156
brates-2.140
umber-2.091
ibia-2.088
Token Fire
Token ID3764
Feature activation-3.31e-01

Pos logit contributions
fs+2.351
ighth+2.336
brates+2.287
umber+2.267
ibia+2.244
Neg logit contributions
aye-2.234
ersed-1.972
oing-1.853
 tongues-1.823
 hormones-1.809
TokenSt
Token ID1273
Feature activation-7.99e-01

Pos logit contributions
fs+1.568
ighth+1.562
umber+1.525
brates+1.524
ibia+1.489
Neg logit contributions
aye-1.410
ersed-1.250
oing-1.184
 tongues-1.154
 hormones-1.154
Tokenick
Token ID624
Feature activation+9.89e-01

Pos logit contributions
ighth+2.604
fs+2.560
umber+2.515
brates+2.436
emis+2.398
Neg logit contributions
aye-4.521
ersed-4.142
 tongues-4.031
oing-4.010
 hormones-3.959
Token
Token ID251
Feature activation+4.93e-01

Pos logit contributions
aye+0.883
ersed+0.795
oing+0.759
 tongues+0.752
 hormones+0.749
Neg logit contributions
ighth-0.611
fs-0.608
brates-0.596
umber-0.584
ibia-0.581
Token and
Token ID290
Feature activation+4.15e-01

Pos logit contributions
aye+2.298
ersed+2.046
oing+1.931
 hormones+1.907
 tongues+1.907
Neg logit contributions
ighth-2.150
fs-2.130
brates-2.112
umber-2.070
ibia-2.056
Token
Token ID564
Feature activation-6.90e-02

Pos logit contributions
aye+1.775
ersed+1.557
oing+1.464
 tongues+1.454
 hormones+1.447
Neg logit contributions
ighth-1.973
fs-1.967
brates-1.942
umber-1.908
ibia-1.898
Token
Token ID250
Feature activation+1.93e-01

Pos logit contributions
ighth+0.333
fs+0.332
brates+0.326
umber+0.321
ibia+0.321
Neg logit contributions
aye-0.292
ersed-0.256
oing-0.240
 tongues-0.238
 hormones-0.237
Tokenactive
Token ID5275
Feature activation-7.26e-02

Pos logit contributions
aye+0.984
ersed+0.882
oing+0.839
 tongues+0.833
 hormones+0.829
Neg logit contributions
ighth-0.760
fs-0.758
brates-0.742
umber-0.729
ibia-0.724
Token bystand
Token ID33326
Feature activation-2.83e+00

Pos logit contributions
ighth+0.331
fs+0.330
brates+0.322
umber+0.319
ibia+0.316
Neg logit contributions
aye-0.325
ersed-0.285
oing-0.270
 tongues-0.266
astered-0.265
Tokener
Token ID263
Feature activation-9.97e-01

Pos logit contributions
fs+6.297
ighth+6.221
umber+6.081
ces+5.689
cape+5.422
Neg logit contributions
aye-16.015
 hormones-15.360
 tongues-15.157
 ultras-15.093
ersed-14.946
Token training
Token ID3047
Feature activation-7.16e-02

Pos logit contributions
ighth+3.982
fs+3.961
umber+3.810
brates+3.782
ces+3.727
Neg logit contributions
aye-4.826
ersed-4.328
yip-4.141
oing-4.051
astered-4.031
Token
Token ID447
Feature activation-2.50e-01

Pos logit contributions
fs+0.299
ighth+0.298
brates+0.292
umber+0.288
ibia+0.286
Neg logit contributions
aye-0.350
ersed-0.310
oing-0.294
 tongues-0.293
 hormones-0.289
Token
Token ID251
Feature activation+3.93e-01

Pos logit contributions
ighth+0.845
fs+0.842
brates+0.820
ibia+0.800
umber+0.796
Neg logit contributions
aye-1.419
ersed-1.280
oing-1.228
 tongues-1.218
astered-1.211
Token.
Token ID13
Feature activation+4.30e-01

Pos logit contributions
aye+1.872
ersed+1.670
oing+1.579
 tongues+1.562
 hormones+1.560
Neg logit contributions
ighth-1.678
fs-1.663
brates-1.643
umber-1.613
ibia-1.604
Token,
Token ID11
Feature activation+2.14e-01

Pos logit contributions
aye+0.955
ersed+0.851
oing+0.808
 tongues+0.802
 hormones+0.798
Neg logit contributions
ighth-0.802
fs-0.797
brates-0.784
umber-0.769
ibia-0.765
Token in
Token ID287
Feature activation+5.76e-01

Pos logit contributions
aye+0.969
ersed+0.855
oing+0.807
 tongues+0.798
 hormones+0.794
Neg logit contributions
ighth-0.973
fs-0.969
brates-0.954
umber-0.938
ibia-0.933
Token conjunction
Token ID17856
Feature activation-2.25e-01

Pos logit contributions
aye+2.562
ersed+2.281
 tongues+2.173
oing+2.162
 hormones+2.150
Neg logit contributions
ighth-2.602
fs-2.571
brates-2.553
ibia-2.500
FORE-2.492
Token with
Token ID351
Feature activation+1.04e-01

Pos logit contributions
fs+0.941
ighth+0.936
brates+0.922
umber+0.909
ibia+0.899
Neg logit contributions
aye-1.100
ersed-0.974
oing-0.929
 tongues-0.917
astered-0.908
Token Encyclopedia
Token ID36732
Feature activation+2.80e-01

Pos logit contributions
aye+0.469
ersed+0.413
oing+0.389
 tongues+0.385
 hormones+0.382
Neg logit contributions
ighth-0.473
fs-0.471
brates-0.464
umber-0.457
ibia-0.453
Token Britann
Token ID46693
Feature activation-2.79e+00

Pos logit contributions
aye+1.289
ersed+1.141
oing+1.078
 tongues+1.068
 hormones+1.064
Neg logit contributions
ighth-1.238
fs-1.233
brates-1.214
umber-1.192
ibia-1.188
Tokenica
Token ID3970
Feature activation-5.70e-01

Pos logit contributions
icity+4.841
fs+4.615
ibia+4.205
ighth+3.996
umber+3.947
Neg logit contributions
aye-17.855
anchester-17.734
ersed-17.285
yip-17.178
 hormones-16.996
Token Films
Token ID30198
Feature activation-6.69e-01

Pos logit contributions
fs+2.360
ighth+2.344
brates+2.299
ibia+2.265
umber+2.263
Neg logit contributions
aye-2.790
ersed-2.489
oing-2.372
 tongues-2.320
 hormones-2.311
Token and
Token ID290
Feature activation+3.72e-01

Pos logit contributions
fs+2.681
ighth+2.661
brates+2.615
umber+2.569
ibia+2.568
Neg logit contributions
aye-3.336
ersed-3.006
oing-2.827
 tongues-2.796
 hormones-2.779
Token an
Token ID281
Feature activation+8.42e-02

Pos logit contributions
aye+1.664
ersed+1.468
oing+1.385
 tongues+1.373
 hormones+1.365
Neg logit contributions
ighth-1.699
fs-1.692
brates-1.666
umber-1.638
ibia-1.633
Token O
Token ID440
Feature activation+1.01e-01

Pos logit contributions
aye+0.363
ersed+0.318
oing+0.299
 tongues+0.296
 hormones+0.294
Neg logit contributions
ighth-0.399
fs-0.398
brates-0.391
umber-0.386
ibia-0.384
Token (
Token ID357
Feature activation-7.59e-01

Pos logit contributions
ighth+1.032
fs+1.032
brates+1.006
umber+0.994
ibia+0.982
Neg logit contributions
aye-1.345
ersed-1.205
oing-1.147
 tongues-1.127
astered-1.126
Token SELECT
Token ID33493
Feature activation+6.17e-01

Pos logit contributions
fs+3.003
ighth+2.991
brates+2.895
umber+2.870
ibia+2.828
Neg logit contributions
aye-3.838
ersed-3.435
oing-3.287
astered-3.213
 tongues-3.208
Token SPL
Token ID46341
Feature activation+1.22e+00

Pos logit contributions
aye+2.965
ersed+2.654
 tongues+2.525
 hormones+2.507
oing+2.495
Neg logit contributions
ighth-2.569
brates-2.533
fs-2.524
ibia-2.445
FORE-2.423
TokenIT
Token ID2043
Feature activation-1.65e-02

Pos logit contributions
aye+4.987
ersed+4.518
oing+4.111
 tongues+4.094
astered+4.066
Neg logit contributions
ighth-5.669
brates-5.644
emis-5.551
ibia-5.506
fs-5.502
Token (
Token ID357
Feature activation-4.73e-01

Pos logit contributions
ighth+0.063
fs+0.063
brates+0.061
umber+0.060
ibia+0.060
Neg logit contributions
aye-0.087
ersed-0.078
oing-0.074
 tongues-0.073
 hormones-0.073
Token content
Token ID2695
Feature activation-2.75e+00

Pos logit contributions
fs+1.917
ighth+1.906
umber+1.845
brates+1.845
ibia+1.813
Neg logit contributions
aye-2.363
ersed-2.094
oing-2.014
 hormones-1.965
 tongues-1.965
Token,
Token ID837
Feature activation-1.42e+00

Pos logit contributions
fs+8.921
umber+8.361
ighth+8.295
ters+7.994
brates+7.976
Neg logit contributions
aye-13.900
ersed-11.813
oing-11.643
 antibodies-11.573
astered-11.564
Token '
Token ID705
Feature activation-4.85e-01

Pos logit contributions
fs+5.718
ighth+5.625
umber+5.517
brates+5.355
ibia+5.285
Neg logit contributions
aye-6.773
ersed-5.925
oing-5.790
 hormones-5.542
astered-5.515
Token
Token ID198
Feature activation+1.51e-01

Pos logit contributions
fs+1.737
ighth+1.717
umber+1.661
brates+1.658
ibia+1.627
Neg logit contributions
aye-2.646
ersed-2.395
oing-2.299
 tongues-2.277
 hormones-2.270
Token
Token ID198
Feature activation-2.71e-01

Pos logit contributions
aye+0.763
ersed+0.685
oing+0.646
 tongues+0.641
 hormones+0.640
Neg logit contributions
ighth-0.601
fs-0.589
brates-0.585
umber-0.571
ibia-0.568
Token'
Token ID6
Feature activation-2.15e-01

Pos logit contributions
fs+0.853
ighth+0.846
brates+0.816
umber+0.812
ibia+0.798
Neg logit contributions
aye-1.604
ersed-1.459
oing-1.403
 tongues-1.397
 hormones-1.388
Token
Token ID198
Feature activation-1.57e-02

Pos logit contributions
aye+0.697
ersed+0.626
oing+0.593
 tongues+0.589
 hormones+0.587
Neg logit contributions
ighth-0.531
fs-0.524
brates-0.518
umber-0.507
ibia-0.504
TokenIn
Token ID818
Feature activation-2.09e-01

Pos logit contributions
ighth+0.061
fs+0.061
brates+0.060
umber+0.059
ibia+0.058
Neg logit contributions
aye-0.081
ersed-0.072
oing-0.069
 tongues-0.069
 hormones-0.068
Token a
Token ID257
Feature activation+3.87e-01

Pos logit contributions
fs+0.989
ighth+0.983
brates+0.960
umber+0.955
ibia+0.940
Neg logit contributions
aye-0.910
ersed-0.798
oing-0.746
 tongues-0.736
astered-0.733
Token statement
Token ID2643
Feature activation-1.21e-01

Pos logit contributions
aye+1.521
ersed+1.318
oing+1.230
 tongues+1.221
 hormones+1.208
Neg logit contributions
ighth-1.976
fs-1.963
brates-1.946
umber-1.915
ibia-1.905
Token,
Token ID11
Feature activation-7.10e-01

Pos logit contributions
fs+0.545
ighth+0.543
brates+0.531
umber+0.524
ibia+0.523
Neg logit contributions
aye-0.550
ersed-0.489
oing-0.463
 tongues-0.456
 hormones-0.453
Token Google
Token ID3012
Feature activation-2.67e+00

Pos logit contributions
fs+3.176
ighth+3.158
brates+3.069
umber+3.045
ibia+2.995
Neg logit contributions
aye-3.146
ersed-2.808
oing-2.711
astered-2.620
 tongues-2.582
Token defended
Token ID13768
Feature activation-1.48e+00

Pos logit contributions
fs+9.494
ighth+9.422
ibia+9.294
 else+9.144
umber+9.024
Neg logit contributions
aye-11.904
ersed-10.717
oing-10.680
interstitial-10.244
anchester-10.184
Token its
Token ID663
Feature activation-5.12e-01

Pos logit contributions
ighth+5.398
fs+5.395
umber+5.141
brates+5.089
ibia+5.016
Neg logit contributions
aye-7.601
ersed-6.681
oing-6.489
astered-6.443
 tongues-6.378
Token Android
Token ID5565
Feature activation-6.55e-01

Pos logit contributions
ighth+2.229
fs+2.228
brates+2.150
umber+2.135
ibia+2.134
Neg logit contributions
aye-2.371
ersed-2.099
oing-1.953
astered-1.950
yip-1.939
Token strategy
Token ID4811
Feature activation-4.60e-01

Pos logit contributions
fs+2.798
ighth+2.782
umber+2.639
brates+2.636
ibia+2.615
Neg logit contributions
aye-3.110
ersed-2.788
oing-2.597
astered-2.576
 tongues-2.575
Token and
Token ID290
Feature activation-5.70e-01

Pos logit contributions
fs+2.007
ighth+2.006
brates+1.965
umber+1.941
ibia+1.934
Neg logit contributions
aye-2.137
ersed-1.903
oing-1.797
 tongues-1.766
astered-1.756
Token Sc
Token ID1446
Feature activation-4.48e-01

Pos logit contributions
aye+0.134
ersed+0.118
oing+0.112
 tongues+0.111
 hormones+0.110
Neg logit contributions
ighth-0.128
fs-0.127
brates-0.125
umber-0.123
ibia-0.123
Tokenrub
Token ID25089
Feature activation+8.95e-01

Pos logit contributions
ighth+1.694
fs+1.691
umber+1.629
brates+1.627
ibia+1.611
Neg logit contributions
aye-2.350
ersed-2.121
 tongues-2.035
oing-2.022
 hormones-2.021
Tokenbing
Token ID4623
Feature activation+3.45e-01

Pos logit contributions
aye+3.965
ersed+3.481
oing+3.310
 hormones+3.229
 tongues+3.224
Neg logit contributions
ighth-4.051
brates-3.968
fs-3.896
ibia-3.876
umber-3.848
Token centers
Token ID10399
Feature activation+1.28e-01

Pos logit contributions
aye+1.549
ersed+1.358
oing+1.285
 tongues+1.275
 hormones+1.268
Neg logit contributions
ighth-1.576
fs-1.553
brates-1.540
umber-1.512
ibia-1.509
Token and
Token ID290
Feature activation+3.67e-01

Pos logit contributions
aye+0.564
ersed+0.496
oing+0.467
 tongues+0.462
 hormones+0.460
Neg logit contributions
ighth-0.596
fs-0.593
brates-0.585
umber-0.575
ibia-0.573
Token Content
Token ID14041
Feature activation-2.66e+00

Pos logit contributions
aye+1.672
ersed+1.475
oing+1.396
 tongues+1.382
 hormones+1.378
Neg logit contributions
ighth-1.649
fs-1.636
brates-1.617
umber-1.589
ibia-1.585
Token Distribution
Token ID27484
Feature activation-4.84e-01

Pos logit contributions
fs+10.220
ighth+10.188
 else+9.535
FORE+9.517
umber+9.486
Neg logit contributions
aye-11.436
ersed-10.834
oing-9.937
astered-9.448
yip-9.446
Token Networks
Token ID27862
Feature activation+3.10e-01

Pos logit contributions
fs+1.936
ighth+1.897
brates+1.872
umber+1.836
ibia+1.824
Neg logit contributions
aye-2.453
ersed-2.214
oing-2.092
astered-2.048
 hormones-2.042
Token (
Token ID357
Feature activation+3.36e-01

Pos logit contributions
aye+1.355
ersed+1.196
oing+1.123
 tongues+1.114
 hormones+1.111
Neg logit contributions
ighth-1.447
fs-1.431
brates-1.417
umber-1.391
ibia-1.385
TokenCD
Token ID8610
Feature activation-1.19e+00

Pos logit contributions
aye+1.161
ersed+0.980
oing+0.901
 tongues+0.888
astered+0.883
Neg logit contributions
ighth-1.879
fs-1.864
brates-1.848
umber-1.823
ibia-1.817
TokenN
Token ID45
Feature activation-4.35e-01

Pos logit contributions
fs+4.406
ighth+4.169
brates+4.049
emis+3.989
FORE+3.963
Neg logit contributions
aye-6.242
ersed-5.697
astered-5.371
yip-5.353
oing-5.297
Token acts
Token ID6529
Feature activation-7.10e-01

Pos logit contributions
ighth+1.753
fs+1.749
umber+1.691
brates+1.679
ibia+1.648
Neg logit contributions
aye-2.037
ersed-1.833
oing-1.734
 hormones-1.709
 tongues-1.707
Token self
Token ID2116
Feature activation+1.43e+00

Pos logit contributions
ighth+3.087
fs+3.070
brates+3.009
umber+2.974
ibia+2.955
Neg logit contributions
aye-3.199
ersed-2.846
oing-2.701
 hormones-2.673
 tongues-2.658
Tokenlessly
Token ID8613
Feature activation-1.45e+00

Pos logit contributions
aye+6.151
ersed+5.444
 tongues+5.073
oing+5.024
 hormones+4.946
Neg logit contributions
ighth-6.325
brates-6.311
fs-6.227
FORE-6.226
umber-6.167
Token for
Token ID329
Feature activation-1.10e+00

Pos logit contributions
fs+5.824
ighth+5.693
umber+5.655
brates+5.619
FORE+5.500
Neg logit contributions
aye-6.595
ersed-5.963
 hormones-5.631
 tongues-5.626
oing-5.585
Token the
Token ID262
Feature activation-1.39e+00

Pos logit contributions
fs+4.837
ighth+4.762
brates+4.639
umber+4.619
ibia+4.557
Neg logit contributions
aye-4.984
ersed-4.453
oing-4.140
 tongues-4.102
 hormones-4.099
Token common
Token ID2219
Feature activation-2.65e+00

Pos logit contributions
fs+5.907
ighth+5.855
umber+5.711
brates+5.611
ibia+5.578
Neg logit contributions
aye-6.296
ersed-5.635
oing-5.280
astered-5.153
 tongues-5.123
Token man
Token ID582
Feature activation-1.32e+00

Pos logit contributions
fs+9.388
ces+8.622
ighth+8.574
brates+8.357
ibia+8.197
Neg logit contributions
aye-12.559
ersed-11.251
anchester-10.657
oing-10.559
 hormones-10.512
Token-
Token ID12
Feature activation-4.98e-01

Pos logit contributions
fs+5.237
ighth+5.194
umber+5.056
brates+5.056
ibia+4.961
Neg logit contributions
aye-6.293
ersed-5.673
oing-5.359
 tongues-5.312
 hormones-5.305
TokenNOT
Token ID11929
Feature activation-5.30e-01

Pos logit contributions
fs+2.071
ighth+2.070
brates+2.014
umber+1.983
FORE+1.965
Neg logit contributions
aye-2.391
ersed-2.147
 tongues-2.047
oing-2.043
 hormones-2.030
Token lie
Token ID6486
Feature activation-4.76e-01

Pos logit contributions
fs+2.389
ighth+2.382
brates+2.322
FORE+2.300
umber+2.298
Neg logit contributions
aye-2.365
ersed-2.115
oing-1.971
 tongues-1.951
astered-1.943
Token 2
Token ID362
Feature activation-1.82e-01

Pos logit contributions
fs+2.031
ighth+2.013
brates+1.987
umber+1.942
ibia+1.927
Neg logit contributions
aye-2.254
ersed-2.002
oing-1.905
 hormones-1.875
 tongues-1.872
Token city
Token ID1748
Feature activation-5.51e-01

Pos logit contributions
aye+0.316
ersed+0.279
oing+0.263
 tongues+0.261
 hormones+0.259
Neg logit contributions
ighth-0.315
fs-0.314
brates-0.309
umber-0.304
ibia-0.302
Token face
Token ID1986
Feature activation+6.77e-01

Pos logit contributions
fs+2.710
ighth+2.687
brates+2.625
ibia+2.624
umber+2.613
Neg logit contributions
aye-2.251
ersed-1.955
oing-1.825
astered-1.794
 hormones-1.775
Token a
Token ID257
Feature activation+5.82e-01

Pos logit contributions
aye+2.907
ersed+2.552
oing+2.398
 tongues+2.395
 hormones+2.383
Neg logit contributions
ighth-3.201
fs-3.187
brates-3.149
umber-3.101
ibia-3.075
Token f
Token ID277
Feature activation-4.06e-01

Pos logit contributions
aye+2.557
ersed+2.263
oing+2.144
 tongues+2.131
 hormones+2.111
Neg logit contributions
ighth-2.675
fs-2.667
brates-2.642
umber-2.589
ibia-2.574
Tokenait
Token ID4548
Feature activation-6.96e-01

Pos logit contributions
fs+1.358
ighth+1.357
umber+1.302
brates+1.273
ibia+1.273
Neg logit contributions
aye-2.300
ersed-2.104
 tongues-2.021
 hormones-2.005
oing-2.005
Token accompl
Token ID6424
Feature activation-2.63e+00

Pos logit contributions
ighth+2.394
fs+2.392
brates+2.347
umber+2.289
ibia+2.262
Neg logit contributions
aye-3.807
ersed-3.468
yip-3.300
oing-3.299
 tongues-3.285
Tokeni
Token ID72
Feature activation-4.05e-01

Pos logit contributions
fs+6.260
 else+5.595
aire+5.553
que+5.496
ces+5.486
Neg logit contributions
anchester-14.896
aye-14.732
 tongues-14.615
ersed-14.606
 hormones-14.466
Token under
Token ID739
Feature activation+1.32e+00

Pos logit contributions
fs+1.734
ighth+1.726
umber+1.698
brates+1.690
ibia+1.678
Neg logit contributions
aye-1.902
ersed-1.682
oing-1.607
 tongues-1.590
 hormones-1.570
Token which
Token ID543
Feature activation+8.47e-02

Pos logit contributions
aye+5.339
ersed+4.607
oing+4.374
 tongues+4.302
 hormones+4.284
Neg logit contributions
ighth-6.371
brates-6.347
fs-6.329
ibia-6.211
FORE-6.124
Token they
Token ID484
Feature activation-7.27e-02

Pos logit contributions
aye+0.346
ersed+0.302
oing+0.283
astered+0.278
 tongues+0.278
Neg logit contributions
ighth-0.419
fs-0.418
brates-0.411
umber-0.407
ibia-0.405
Token are
Token ID389
Feature activation+1.11e-01

Pos logit contributions
ighth+0.371
fs+0.368
brates+0.361
umber+0.359
ibia+0.358
Neg logit contributions
aye-0.290
ersed-0.250
oing-0.233
astered-0.230
 hormones-0.228
Token the
Token ID262
Feature activation-5.42e-01

Pos logit contributions
fs+2.515
ighth+2.455
umber+2.402
brates+2.385
ibia+2.341
Neg logit contributions
aye-2.814
ersed-2.520
astered-2.317
oing-2.313
 tongues-2.292
Token hack
Token ID8156
Feature activation-5.15e-01

Pos logit contributions
fs+2.312
ighth+2.240
brates+2.193
umber+2.188
FORE+2.151
Neg logit contributions
aye-2.586
ersed-2.330
astered-2.137
oing-2.126
 tongues-2.114
Token,
Token ID11
Feature activation-3.69e-01

Pos logit contributions
fs+2.328
ighth+2.272
umber+2.242
brates+2.232
ibia+2.183
Neg logit contributions
aye-2.332
ersed-2.086
 tongues-1.926
oing-1.917
astered-1.905
Token Crowd
Token ID29452
Feature activation-6.05e-01

Pos logit contributions
fs+1.667
ighth+1.658
umber+1.623
brates+1.621
FORE+1.595
Neg logit contributions
aye-1.653
ersed-1.468
oing-1.363
astered-1.350
 tongues-1.344
Tokenstrike
Token ID33069
Feature activation-1.83e+00

Pos logit contributions
fs+1.901
ighth+1.897
brates+1.823
umber+1.803
ibia+1.763
Neg logit contributions
aye-3.550
ersed-3.249
oing-3.115
 tongues-3.113
 hormones-3.091
Token stood
Token ID6204
Feature activation-2.63e+00

Pos logit contributions
fs+6.406
ighth+6.372
 else+6.291
umber+6.164
ibia+6.128
Neg logit contributions
aye-8.983
ersed-8.199
oing-7.801
anchester-7.681
yip-7.641
Token by
Token ID416
Feature activation-1.45e+00

Pos logit contributions
 else+7.988
fs+7.948
ters+7.704
ighth+7.583
umber+7.465
Neg logit contributions
aye-12.689
ersed-11.583
yip-11.378
ersion-11.313
ellar-11.288
Token its
Token ID663
Feature activation-8.21e-01

Pos logit contributions
fs+5.378
umber+5.250
ighth+5.190
brates+5.071
 else+5.023
Neg logit contributions
aye-7.309
ersed-6.607
yip-6.158
astered-6.067
ersion-6.052
Token analysis
Token ID3781
Feature activation-8.63e-01

Pos logit contributions
fs+4.082
ighth+4.040
umber+3.940
brates+3.937
FORE+3.890
Neg logit contributions
aye-3.225
ersed-2.869
astered-2.551
yip-2.540
oing-2.529
Token writing
Token ID3597
Feature activation-5.06e-01

Pos logit contributions
fs+3.617
ighth+3.554
umber+3.519
brates+3.492
ibia+3.439
Neg logit contributions
aye-4.053
ersed-3.707
oing-3.431
yip-3.404
 tongues-3.398
Token,
Token ID11
Feature activation-7.64e-01

Pos logit contributions
fs+1.800
ighth+1.767
brates+1.727
umber+1.721
FORE+1.684
Neg logit contributions
aye-2.749
ersed-2.507
oing-2.372
 hormones-2.337
 tongues-2.331
Token be
Token ID307
Feature activation+3.09e-01

Pos logit contributions
aye+3.552
ersed+2.895
oing+2.843
 tongues+2.787
 hormones+2.757
Neg logit contributions
fs-5.397
ighth-5.376
brates-5.363
ibia-5.204
umber-5.196
Token located
Token ID5140
Feature activation-7.51e-01

Pos logit contributions
aye+1.746
ersed+1.576
oing+1.517
 tongues+1.505
 hormones+1.500
Neg logit contributions
fs-1.038
ighth-1.034
brates-1.031
ibia-0.988
umber-0.986
Token.
Token ID13
Feature activation-3.58e-02

Pos logit contributions
ighth+3.057
fs+3.031
umber+2.984
brates+2.943
ibia+2.923
Neg logit contributions
aye-3.572
ersed-3.318
astered-3.045
 tongues-3.045
oing-3.037
Token<|endoftext|>
Token ID50256
Feature activation+1.74e-02

Pos logit contributions
ighth+0.143
fs+0.143
brates+0.140
umber+0.137
ibia+0.137
Neg logit contributions
aye-0.182
ersed-0.162
oing-0.154
 tongues-0.153
 hormones-0.152
TokenThe
Token ID464
Feature activation+4.36e-03

Pos logit contributions
aye+0.084
ersed+0.075
oing+0.071
 tongues+0.070
 hormones+0.070
Neg logit contributions
ighth-0.074
fs-0.073
brates-0.072
umber-0.071
ibia-0.070
Token thorn
Token ID38999
Feature activation-2.62e+00

Pos logit contributions
aye+0.020
ersed+0.017
oing+0.016
 tongues+0.016
 hormones+0.016
Neg logit contributions
ighth-0.020
fs-0.020
brates-0.019
umber-0.019
ibia-0.019
Tokeny
Token ID88
Feature activation-8.85e-02

Pos logit contributions
fs+9.060
ighth+8.606
umber+8.281
brates+8.241
icity+8.180
Neg logit contributions
aye-11.781
ersed-11.147
anchester-10.374
astered-10.318
ersion-10.260
Token debates
Token ID15389
Feature activation-1.91e-01

Pos logit contributions
fs+0.421
ighth+0.421
brates+0.409
umber+0.406
ibia+0.405
Neg logit contributions
aye-0.381
ersed-0.334
oing-0.313
astered-0.308
 tongues-0.305
Token around
Token ID1088
Feature activation-6.15e-01

Pos logit contributions
fs+0.839
ighth+0.836
brates+0.815
umber+0.808
ibia+0.798
Neg logit contributions
aye-0.895
ersed-0.791
oing-0.746
 tongues-0.731
astered-0.729
Token encryption
Token ID15835
Feature activation-1.32e+00

Pos logit contributions
fs+2.793
ighth+2.790
brates+2.691
umber+2.684
ibia+2.654
Neg logit contributions
aye-2.731
ersed-2.399
oing-2.305
astered-2.223
 tongues-2.198
Token,
Token ID11
Feature activation-2.88e-01

Pos logit contributions
fs+5.984
ighth+5.899
brates+5.741
ibia+5.734
cape+5.673
Neg logit contributions
aye-5.633
ersed-4.805
yip-4.583
oing-4.581
interstitial-4.498
Token subsequently
Token ID12412
Feature activation-1.07e+00

Pos logit contributions
ighth+1.398
fs+1.390
umber+1.354
brates+1.351
ibia+1.339
Neg logit contributions
aye-1.567
ersed-1.420
oing-1.327
astered-1.321
 tongues-1.302
Token completing
Token ID14339
Feature activation-7.58e-01

Pos logit contributions
ighth+4.258
fs+4.223
umber+4.134
ibia+4.075
brates+4.000
Neg logit contributions
aye-5.123
ersed-4.668
astered-4.372
yip-4.315
oing-4.254
Token a
Token ID257
Feature activation-1.59e-01

Pos logit contributions
ighth+3.191
fs+3.140
brates+3.094
umber+3.041
ibia+3.002
Neg logit contributions
aye-3.509
ersed-3.229
oing-3.017
astered-2.971
 tongues-2.950
Token first
Token ID717
Feature activation-5.93e-01

Pos logit contributions
ighth+0.673
fs+0.669
brates+0.656
umber+0.646
ibia+0.642
Neg logit contributions
aye-0.769
ersed-0.685
oing-0.649
 tongues-0.642
 hormones-0.639
Token Sunderland
Token ID35706
Feature activation+2.59e-01

Pos logit contributions
fs+2.497
ighth+2.488
umber+2.429
brates+2.403
ibia+2.370
Neg logit contributions
aye-2.818
ersed-2.560
astered-2.388
oing-2.373
 tongues-2.361
Token tre
Token ID2054
Feature activation-2.62e+00

Pos logit contributions
aye+1.341
ersed+1.196
oing+1.139
 tongues+1.133
 hormones+1.131
Neg logit contributions
ighth-0.997
fs-0.991
brates-0.990
umber-0.962
ibia-0.952
Tokenble
Token ID903
Feature activation-4.23e-01

Pos logit contributions
fs+10.123
ighth+9.965
ces+9.821
brates+9.675
ibia+9.342
Neg logit contributions
ersed-10.974
aye-10.950
 hormones-10.712
avascript-10.591
 antibodies-10.522
Token by
Token ID416
Feature activation-8.98e-01

Pos logit contributions
ighth+1.863
fs+1.860
brates+1.813
umber+1.782
ibia+1.780
Neg logit contributions
aye-1.950
ersed-1.748
 tongues-1.641
oing-1.641
 hormones-1.623
Token shooting
Token ID4395
Feature activation-7.97e-01

Pos logit contributions
ighth+3.547
fs+3.513
umber+3.460
ibia+3.368
brates+3.340
Neg logit contributions
aye-4.375
ersed-3.976
astered-3.728
oing-3.687
 hormones-3.672
Token his
Token ID465
Feature activation+1.11e-01

Pos logit contributions
ighth+3.610
fs+3.553
brates+3.480
umber+3.420
ibia+3.373
Neg logit contributions
aye-3.503
ersed-3.114
oing-2.970
astered-2.863
 hormones-2.856
Token 19
Token ID678
Feature activation+6.36e-02

Pos logit contributions
aye+0.537
ersed+0.478
oing+0.454
 tongues+0.450
 hormones+0.448
Neg logit contributions
ighth-0.466
fs-0.465
brates-0.458
umber-0.449
ibia-0.447
Token97
Token ID5607
Feature activation+2.25e-01

Pos logit contributions
aye+5.810
ersed+5.120
oing+4.876
 tongues+4.801
astered+4.747
Neg logit contributions
ighth-4.904
brates-4.802
fs-4.771
umber-4.687
emis-4.678
Tokens
Token ID82
Feature activation-2.20e-01

Pos logit contributions
aye+0.874
ersed+0.756
oing+0.707
 tongues+0.699
 hormones+0.694
Neg logit contributions
ighth-1.156
fs-1.143
brates-1.135
umber-1.117
ibia-1.114
Token)}
Token ID38165
Feature activation-1.08e-02

Pos logit contributions
fs+0.493
ighth+0.486
brates+0.472
umber+0.457
ibia+0.448
Neg logit contributions
aye-1.508
ersed-1.388
oing-1.341
 tongues-1.336
 hormones-1.329
Token 10
Token ID838
Feature activation-9.55e-02

Pos logit contributions
ighth+0.049
fs+0.049
brates+0.048
umber+0.047
ibia+0.047
Neg logit contributions
aye-0.049
ersed-0.043
oing-0.041
 tongues-0.040
 hormones-0.040
Token.
Token ID13
Feature activation-1.90e+00

Pos logit contributions
ighth+0.385
fs+0.385
brates+0.377
umber+0.370
ibia+0.368
Neg logit contributions
aye-0.480
ersed-0.428
oing-0.408
 tongues-0.404
 hormones-0.402
Token B
Token ID347
Feature activation-2.62e+00

Pos logit contributions
fs+6.691
ces+6.361
 else+6.300
ighth+6.299
FORE+6.167
Neg logit contributions
aye-9.070
ersed-8.761
astered-8.328
yip-8.189
anchester-8.081
Tokenxe
Token ID27705
Feature activation-1.10e+00

Pos logit contributions
fs+7.406
ces+6.587
ighth+6.194
umber+6.096
ibia+6.038
Neg logit contributions
aye-14.012
ersed-13.495
 tongues-13.175
 hormones-13.135
 antibodies-12.864
Token7
Token ID22
Feature activation-1.56e+00

Pos logit contributions
fs+3.531
ighth+3.527
umber+3.389
brates+3.301
ibia+3.232
Neg logit contributions
aye-6.268
ersed-5.790
yip-5.482
 tongues-5.472
 hormones-5.468
Token {
Token ID1391
Feature activation+5.23e-01

Pos logit contributions
fs+4.154
ighth+3.924
umber+3.854
brates+3.853
ces+3.813
Neg logit contributions
aye-9.261
ersed-8.636
astered-8.267
 tongues-8.180
yip-8.140
Token(
Token ID7
Feature activation+1.08e+00

Pos logit contributions
aye+2.675
ersed+2.401
oing+2.255
astered+2.228
 hormones+2.225
Neg logit contributions
ighth-2.040
brates-1.974
fs-1.955
emis-1.930
umber-1.913
Token66
Token ID2791
Feature activation+6.77e-01

Pos logit contributions
aye+5.214
ersed+4.594
oing+4.374
 tongues+4.301
astered+4.271
Neg logit contributions
ighth-4.300
brates-4.227
fs-4.197
umber-4.127
emis-4.100
Token Born
Token ID18889
Feature activation+4.95e-01

Pos logit contributions
ighth+0.844
fs+0.841
brates+0.824
umber+0.812
ibia+0.808
Neg logit contributions
aye-0.852
ersed-0.749
oing-0.711
 tongues-0.697
 hormones-0.694
Token in
Token ID287
Feature activation+8.24e-02

Pos logit contributions
aye+2.131
ersed+1.844
oing+1.750
 tongues+1.736
 hormones+1.733
Neg logit contributions
ighth-2.342
fs-2.337
brates-2.308
ibia-2.252
umber-2.245
Token Steven
Token ID8239
Feature activation-2.58e-01

Pos logit contributions
aye+0.378
ersed+0.334
oing+0.315
 tongues+0.313
 hormones+0.311
Neg logit contributions
ighth-0.368
fs-0.367
brates-0.361
umber-0.355
ibia-0.353
Tokenage
Token ID496
Feature activation+4.87e-01

Pos logit contributions
ighth+0.881
fs+0.880
umber+0.852
brates+0.850
ibia+0.833
Neg logit contributions
aye-1.448
ersed-1.320
oing-1.261
 tongues-1.254
 hormones-1.251
Token,
Token ID11
Feature activation+3.60e-01

Pos logit contributions
aye+2.074
ersed+1.801
oing+1.709
 hormones+1.685
 tongues+1.684
Neg logit contributions
ighth-2.317
fs-2.301
brates-2.285
umber-2.237
ibia-2.220
Token Hert
Token ID40030
Feature activation-2.60e+00

Pos logit contributions
aye+1.681
ersed+1.488
oing+1.410
 tongues+1.400
 hormones+1.393
Neg logit contributions
ighth-1.573
fs-1.566
brates-1.543
umber-1.511
ibia-1.506
Tokens
Token ID82
Feature activation+2.07e-01

Pos logit contributions
fs+7.283
ibia+6.740
umber+6.732
ighth+6.346
ters+6.221
Neg logit contributions
aye-13.709
 ultras-12.914
 antibodies-12.907
 tongues-12.894
ersed-12.864
Token 1991
Token ID10249
Feature activation+1.31e-01

Pos logit contributions
aye+0.991
ersed+0.881
oing+0.835
 tongues+0.826
 hormones+0.822
Neg logit contributions
ighth-0.882
fs-0.877
brates-0.866
umber-0.847
ibia-0.844
Token:
Token ID25
Feature activation-1.52e-01

Pos logit contributions
aye+0.764
ersed+0.695
oing+0.665
 tongues+0.660
 hormones+0.657
Neg logit contributions
ighth-0.421
fs-0.419
brates-0.409
umber-0.400
ibia-0.397
Token First
Token ID3274
Feature activation-1.00e-01

Pos logit contributions
ighth+0.693
fs+0.691
brates+0.680
umber+0.667
ibia+0.664
Neg logit contributions
aye-0.681
ersed-0.596
oing-0.567
 tongues-0.557
 hormones-0.552
Token ride
Token ID6594
Feature activation-5.30e-01

Pos logit contributions
ighth+0.436
fs+0.436
brates+0.423
umber+0.421
ibia+0.415
Neg logit contributions
aye-0.469
ersed-0.416
oing-0.392
 tongues-0.388
 hormones-0.384
Token the
Token ID262
Feature activation+2.47e-01

Pos logit contributions
aye+2.054
ersed+1.784
oing+1.668
 tongues+1.658
 hormones+1.654
Neg logit contributions
ighth-2.408
fs-2.389
brates-2.377
ibia-2.338
umber-2.318
Token first
Token ID717
Feature activation+4.25e-01

Pos logit contributions
aye+1.117
ersed+0.982
oing+0.929
 tongues+0.920
 hormones+0.916
Neg logit contributions
fs-1.110
ighth-1.110
brates-1.098
ibia-1.079
umber-1.076
Token Ghostbusters
Token ID50037
Feature activation+7.28e-01

Pos logit contributions
aye+1.930
ersed+1.692
oing+1.608
 tongues+1.582
 hormones+1.577
Neg logit contributions
ighth-1.912
fs-1.895
brates-1.874
ibia-1.847
umber-1.844
Token,
Token ID11
Feature activation+1.84e-01

Pos logit contributions
aye+2.872
ersed+2.497
oing+2.299
 hormones+2.275
astered+2.274
Neg logit contributions
ighth-3.654
fs-3.590
brates-3.589
umber-3.523
ibia-3.501
Token the
Token ID262
Feature activation-3.96e-01

Pos logit contributions
aye+0.789
ersed+0.692
oing+0.648
 tongues+0.643
 hormones+0.641
Neg logit contributions
ighth-0.869
fs-0.864
brates-0.855
umber-0.837
ibia-0.837
Tokenne
Token ID710
Feature activation-2.59e+00

Pos logit contributions
fs+1.304
ighth+1.299
umber+1.258
brates+1.249
ibia+1.202
Neg logit contributions
aye-2.268
ersed-2.072
 tongues-1.987
oing-1.977
 hormones-1.972
Tokenigh
Token ID394
Feature activation-1.17e+00

Pos logit contributions
ither+9.015
fs+8.950
ighth+8.911
ces+8.539
brates+8.454
Neg logit contributions
aye-12.735
 tongues-11.496
ersed-11.208
 antibodies-11.016
 Instruments-10.963
Tokenbor
Token ID2865
Feature activation-2.81e-01

Pos logit contributions
fs+2.124
brates+1.983
ighth+1.946
umber+1.815
ces+1.690
Neg logit contributions
aye-8.445
ersed-7.920
oing-7.548
 hormones-7.543
 tongues-7.520
Tokenhood
Token ID2894
Feature activation-1.88e-01

Pos logit contributions
fs+0.894
ighth+0.890
brates+0.862
umber+0.861
ibia+0.838
Neg logit contributions
aye-1.642
ersed-1.501
oing-1.421
 hormones-1.419
 tongues-1.417
Token in
Token ID287
Feature activation-1.83e-02

Pos logit contributions
fs+0.906
ighth+0.903
brates+0.886
umber+0.880
ibia+0.871
Neg logit contributions
aye-0.792
ersed-0.691
oing-0.646
 tongues-0.645
 hormones-0.640
Token New
Token ID968
Feature activation+5.96e-01

Pos logit contributions
ighth+0.075
fs+0.075
brates+0.074
umber+0.073
ibia+0.072
Neg logit contributions
aye-0.090
ersed-0.081
oing-0.076
 tongues-0.076
 hormones-0.075
Tokenя
Token ID40623
Feature activation-7.34e-01

Pos logit contributions
fs+0.751
ighth+0.751
brates+0.735
umber+0.724
ibia+0.722
Neg logit contributions
aye-0.798
ersed-0.704
oing-0.666
 tongues-0.662
 hormones-0.657
Token.
Token ID13
Feature activation-7.24e-02

Pos logit contributions
fs+2.561
ighth+2.524
brates+2.465
ibia+2.441
umber+2.426
Neg logit contributions
aye-4.020
ersed-3.599
oing-3.477
 tongues-3.429
 hormones-3.415
Token
Token ID12466
Feature activation+1.08e-02

Pos logit contributions
fs+0.261
ighth+0.261
brates+0.255
umber+0.249
ibia+0.248
Neg logit contributions
aye-0.395
ersed-0.355
oing-0.341
 tongues-0.337
 hormones-0.336
Token
Token ID250
Feature activation+4.61e-01

Pos logit contributions
aye+0.060
ersed+0.055
oing+0.052
 tongues+0.052
 hormones+0.052
Neg logit contributions
ighth-0.038
fs-0.038
brates-0.037
umber-0.036
ibia-0.036
Tokenн
Token ID22177
Feature activation-1.58e-01

Pos logit contributions
aye+2.187
ersed+1.948
oing+1.841
 tongues+1.817
 hormones+1.808
Neg logit contributions
ighth-1.962
fs-1.954
brates-1.936
umber-1.896
ibia-1.875
Tokenо�
Token ID25443
Feature activation-2.58e+00

Pos logit contributions
ighth+0.612
fs+0.610
brates+0.594
umber+0.586
ibia+0.581
Neg logit contributions
aye-0.818
ersed-0.733
 tongues-0.696
oing-0.693
 hormones-0.689
Token
Token ID111
Feature activation-1.66e-01

Pos logit contributions
ighth+5.472
fs+5.267
ither+4.740
umber+4.654
brates+4.595
Neg logit contributions
aye-15.516
ersed-14.433
 tongues-14.403
 hormones-14.184
oing-13.876
Tokenи
Token ID18849
Feature activation-8.54e-02

Pos logit contributions
fs+0.550
ighth+0.550
brates+0.530
umber+0.524
ibia+0.523
Neg logit contributions
aye-0.954
ersed-0.860
 tongues-0.821
oing-0.819
astered-0.813
Tokenе
Token ID16843
Feature activation-1.19e+00

Pos logit contributions
fs+0.340
ighth+0.339
brates+0.331
umber+0.326
ibia+0.324
Neg logit contributions
aye-0.434
ersed-0.387
oing-0.368
 tongues-0.365
 hormones-0.363
Token
Token ID12466
Feature activation-3.16e-01

Pos logit contributions
fs+4.002
ighth+3.844
brates+3.785
ibia+3.742
umber+3.692
Neg logit contributions
aye-6.572
ersed-5.885
oing-5.638
astered-5.632
 hormones-5.583
Token
Token ID116
Feature activation+1.76e+00

Pos logit contributions
ighth+0.940
fs+0.937
brates+0.900
umber+0.885
ibia+0.878
Neg logit contributions
aye-1.918
ersed-1.754
oing-1.683
 tongues-1.674
 hormones-1.666
Token k
Token ID479
Feature activation+7.80e-01

Pos logit contributions
aye+2.230
ersed+1.978
oing+1.869
 tongues+1.857
 hormones+1.856
Neg logit contributions
ighth-2.094
fs-2.076
brates-2.057
umber-2.011
ibia-2.006
Tokenart
Token ID433
Feature activation-3.89e-01

Pos logit contributions
aye+3.015
ersed+2.600
oing+2.475
astered+2.407
urst+2.318
Neg logit contributions
ighth-3.962
brates-3.937
fs-3.924
FORE-3.868
ibia-3.835
Tokening
Token ID278
Feature activation-3.19e-01

Pos logit contributions
fs+2.000
ighth+1.996
brates+1.955
umber+1.947
ibia+1.943
Neg logit contributions
aye-1.505
ersed-1.292
oing-1.197
 tongues-1.192
 hormones-1.186
Token track
Token ID2610
Feature activation+1.47e-02

Pos logit contributions
fs+1.314
ighth+1.308
brates+1.282
ibia+1.265
umber+1.265
Neg logit contributions
aye-1.566
ersed-1.402
oing-1.322
 tongues-1.303
 hormones-1.297
Token in
Token ID287
Feature activation+7.37e-02

Pos logit contributions
aye+0.068
ersed+0.060
oing+0.057
 tongues+0.056
 hormones+0.056
Neg logit contributions
ighth-0.065
fs-0.065
brates-0.063
umber-0.062
ibia-0.062
Token Hert
Token ID40030
Feature activation-2.58e+00

Pos logit contributions
aye+0.334
ersed+0.295
oing+0.278
 tongues+0.275
 hormones+0.274
Neg logit contributions
ighth-0.333
fs-0.332
brates-0.327
umber-0.322
ibia-0.320
Tokenford
Token ID3841
Feature activation-4.09e-01

Pos logit contributions
fs+5.408
ibia+5.314
umber+5.084
ters+4.847
ces+4.482
Neg logit contributions
aye-15.047
 antibodies-14.637
 ultras-14.477
 tongues-14.459
 hormones-14.358
Tokenshire
Token ID10932
Feature activation-9.96e-02

Pos logit contributions
fs+1.259
ighth+1.258
umber+1.209
brates+1.206
ibia+1.187
Neg logit contributions
aye-2.427
ersed-2.222
oing-2.131
 tongues-2.116
 hormones-2.106
Token.
Token ID13
Feature activation+1.26e-01

Pos logit contributions
ighth+0.413
fs+0.412
brates+0.404
umber+0.398
ibia+0.397
Neg logit contributions
aye-0.487
ersed-0.436
oing-0.413
 tongues-0.408
 hormones-0.406
Token "
Token ID366
Feature activation+3.74e-01

Pos logit contributions
aye+0.625
ersed+0.558
oing+0.530
 tongues+0.525
 hormones+0.523
Neg logit contributions
ighth-0.515
fs-0.513
brates-0.504
umber-0.495
ibia-0.492
TokenThey
Token ID2990
Feature activation+2.21e-01

Pos logit contributions
aye+1.784
ersed+1.591
oing+1.493
 hormones+1.479
 tongues+1.478
Neg logit contributions
ighth-1.586
fs-1.565
brates-1.559
umber-1.533
ibia-1.516
Token from
Token ID422
Feature activation+1.14e-01

Pos logit contributions
aye+0.443
ersed+0.388
oing+0.367
 tongues+0.359
astered+0.358
Neg logit contributions
ighth-0.474
fs-0.473
brates-0.464
umber-0.458
ibia-0.455
Token a
Token ID257
Feature activation+2.05e-02

Pos logit contributions
aye+0.515
ersed+0.454
oing+0.429
 tongues+0.425
 hormones+0.423
Neg logit contributions
ighth-0.516
fs-0.514
brates-0.506
umber-0.498
ibia-0.496
Token Paris
Token ID6342
Feature activation-2.50e+00

Pos logit contributions
aye+0.095
ersed+0.084
oing+0.079
 tongues+0.079
 hormones+0.078
Neg logit contributions
ighth-0.091
fs-0.090
brates-0.089
umber-0.087
ibia-0.087
Tokenian
Token ID666
Feature activation-2.85e-01

Pos logit contributions
cape+8.449
fs+8.431
ighth+8.357
ces+8.299
umber+8.140
Neg logit contributions
aye-11.334
ersed-10.771
 hormones-10.043
oing-10.009
ptoms-9.994
Token disc
Token ID1221
Feature activation-7.14e-01

Pos logit contributions
ighth+1.194
fs+1.189
brates+1.160
umber+1.151
ibia+1.140
Neg logit contributions
aye-1.378
ersed-1.230
oing-1.166
 tongues-1.147
 hormones-1.145
Tokenothe
Token ID20388
Feature activation-2.57e+00

Pos logit contributions
fs+2.912
ighth+2.870
umber+2.823
brates+2.806
ibia+2.751
Neg logit contributions
aye-3.467
ersed-3.105
 hormones-2.989
 tongues-2.962
 antibodies-2.918
Tokenque
Token ID4188
Feature activation-9.33e-01

Pos logit contributions
que+1.399
fs+0.847
ighth+0.766
brates+0.735
ces+0.525
Neg logit contributions
aye-20.114
ersed-19.782
 hormones-19.207
 tongues-19.198
 antibodies-19.168
Token owned
Token ID6898
Feature activation-2.19e-01

Pos logit contributions
fs+3.830
ighth+3.801
umber+3.727
brates+3.694
ibia+3.676
Neg logit contributions
aye-4.442
ersed-3.961
oing-3.711
 hormones-3.700
 tongues-3.672
Token by
Token ID416
Feature activation+4.22e-01

Pos logit contributions
ighth+0.992
fs+0.986
brates+0.969
umber+0.961
ibia+0.949
Neg logit contributions
aye-0.989
ersed-0.874
oing-0.827
 tongues-0.814
 hormones-0.810
Token a
Token ID257
Feature activation+5.63e-01

Pos logit contributions
aye+1.906
ersed+1.664
 tongues+1.578
oing+1.575
 hormones+1.559
Neg logit contributions
ighth-1.909
fs-1.903
brates-1.876
ibia-1.842
umber-1.833
Token slim
Token ID18862
Feature activation+1.22e+00

Pos logit contributions
aye+2.605
ersed+2.295
 tongues+2.185
oing+2.172
 hormones+2.163
Neg logit contributions
ighth-2.465
fs-2.442
brates-2.424
ibia-2.377
umber-2.362
Token of
Token ID286
Feature activation-4.05e-01

Pos logit contributions
aye+0.193
ersed+0.171
oing+0.161
 tongues+0.160
 hormones+0.159
Neg logit contributions
ighth-0.187
fs-0.186
brates-0.183
umber-0.180
ibia-0.179
Token the
Token ID262
Feature activation-7.79e-01

Pos logit contributions
ighth+1.776
fs+1.764
brates+1.712
umber+1.705
ibia+1.698
Neg logit contributions
aye-1.901
ersed-1.698
oing-1.590
astered-1.559
 tongues-1.554
Token Anaheim
Token ID31100
Feature activation-1.45e+00

Pos logit contributions
ighth+3.195
fs+3.159
umber+3.073
brates+3.031
ibia+3.028
Neg logit contributions
aye-3.816
ersed-3.486
oing-3.206
astered-3.165
yip-3.140
Token Ducks
Token ID27622
Feature activation+8.45e-02

Pos logit contributions
fs+2.998
ighth+2.949
umber+2.694
brates+2.688
ibia+2.639
Neg logit contributions
aye-9.825
ersed-9.183
yip-8.844
astered-8.733
oing-8.666
Token.
Token ID13
Feature activation+1.62e-01

Pos logit contributions
aye+0.409
ersed+0.364
oing+0.345
 tongues+0.342
 hormones+0.341
Neg logit contributions
ighth-0.356
fs-0.354
brates-0.348
umber-0.342
ibia-0.340
Token Justin
Token ID10799
Feature activation-2.57e+00

Pos logit contributions
aye+0.790
ersed+0.705
oing+0.667
 tongues+0.664
 hormones+0.660
Neg logit contributions
ighth-0.676
fs-0.672
brates-0.662
umber-0.650
ibia-0.647
Token Faul
Token ID44760
Feature activation+2.08e-01

Pos logit contributions
umber+9.957
fs+9.940
ighth+9.438
ters+9.236
emis+9.108
Neg logit contributions
aye-11.998
ersed-11.034
astered-10.055
yip-9.911
oing-9.874
Tokenk
Token ID74
Feature activation+9.44e-01

Pos logit contributions
aye+1.285
ersed+1.175
oing+1.128
 tongues+1.119
 hormones+1.114
Neg logit contributions
ighth-0.593
fs-0.590
brates-0.578
umber-0.561
ibia-0.557
Token,
Token ID11
Feature activation+1.85e-01

Pos logit contributions
aye+3.999
ersed+3.525
oing+3.321
 tongues+3.252
 hormones+3.248
Neg logit contributions
ighth-4.385
brates-4.374
fs-4.343
ibia-4.203
FORE-4.182
Token one
Token ID530
Feature activation-5.41e-01

Pos logit contributions
aye+0.843
ersed+0.744
oing+0.703
 tongues+0.697
 hormones+0.693
Neg logit contributions
ighth-0.835
fs-0.832
brates-0.819
umber-0.805
ibia-0.801
Token of
Token ID286
Feature activation-6.75e-01

Pos logit contributions
ighth+2.729
fs+2.688
umber+2.638
brates+2.629
ibia+2.603
Neg logit contributions
aye-2.206
ersed-1.890
oing-1.763
 hormones-1.735
 tongues-1.732
Tokens
Token ID82
Feature activation-7.17e-01

Pos logit contributions
ighth+0.352
fs+0.351
brates+0.346
umber+0.341
ibia+0.339
Neg logit contributions
aye-0.291
ersed-0.253
oing-0.237
 tongues-0.234
 hormones-0.233
Token somewhat
Token ID6454
Feature activation-4.25e-01

Pos logit contributions
ighth+3.177
fs+3.157
umber+3.104
brates+3.073
ibia+3.015
Neg logit contributions
aye-3.239
ersed-2.891
oing-2.716
 tongues-2.665
yip-2.636
Token based
Token ID1912
Feature activation+1.13e+00

Pos logit contributions
ighth+1.891
fs+1.887
umber+1.841
brates+1.824
ibia+1.803
Neg logit contributions
aye-1.954
ersed-1.729
oing-1.635
astered-1.593
 tongues-1.580
Token on
Token ID319
Feature activation-2.81e-01

Pos logit contributions
aye+4.139
ersed+3.550
oing+3.282
 hormones+3.253
 tongues+3.226
Neg logit contributions
ighth-5.927
brates-5.877
fs-5.853
ibia-5.713
umber-5.680
Token the
Token ID262
Feature activation-1.56e-01

Pos logit contributions
ighth+1.252
fs+1.242
umber+1.215
brates+1.214
ibia+1.193
Neg logit contributions
aye-1.291
ersed-1.152
oing-1.084
 tongues-1.066
astered-1.061
Token comic
Token ID9048
Feature activation-2.55e+00

Pos logit contributions
ighth+0.709
fs+0.705
brates+0.690
umber+0.689
ibia+0.680
Neg logit contributions
aye-0.703
ersed-0.626
oing-0.587
 tongues-0.578
astered-0.574
Token book
Token ID1492
Feature activation-1.04e+00

Pos logit contributions
umber+9.180
fs+8.976
ighth+8.719
cape+8.468
brates+8.430
Neg logit contributions
aye-11.740
ersed-10.794
yip-9.998
interstitial-9.963
anchester-9.767
Token,
Token ID11
Feature activation-7.22e-02

Pos logit contributions
ighth+4.553
umber+4.532
fs+4.529
brates+4.381
ibia+4.363
Neg logit contributions
aye-4.671
ersed-4.187
oing-3.868
yip-3.831
astered-3.773
Token
Token ID447
Feature activation-1.27e-01

Pos logit contributions
ighth+0.343
fs+0.341
brates+0.334
umber+0.333
ibia+0.328
Neg logit contributions
aye-0.312
ersed-0.275
oing-0.258
 tongues-0.253
astered-0.251
Token
Token ID251
Feature activation-4.01e-01

Pos logit contributions
ighth+0.580
fs+0.580
brates+0.568
umber+0.562
ibia+0.559
Neg logit contributions
aye-0.566
ersed-0.500
oing-0.471
 tongues-0.465
astered-0.463
Token Ar
Token ID943
Feature activation-1.81e-01

Pos logit contributions
ighth+1.849
fs+1.841
brates+1.793
umber+1.790
FORE+1.766
Neg logit contributions
aye-1.768
ersed-1.579
oing-1.470
 tongues-1.459
astered-1.437
Token Donald
Token ID3759
Feature activation+1.04e+00

Pos logit contributions
ighth+0.970
fs+0.965
brates+0.944
umber+0.941
ibia+0.928
Neg logit contributions
aye-0.901
ersed-0.799
oing-0.744
astered-0.734
 tongues-0.731
Token Trump
Token ID1301
Feature activation+5.58e-01

Pos logit contributions
aye+4.208
ersed+3.625
oing+3.512
 hormones+3.446
 tongues+3.434
Neg logit contributions
ighth-4.948
brates-4.856
fs-4.850
ibia-4.743
emis-4.721
Token is
Token ID318
Feature activation+4.56e-02

Pos logit contributions
aye+2.684
ersed+2.364
oing+2.270
 tongues+2.240
 hormones+2.238
Neg logit contributions
ighth-2.346
fs-2.337
brates-2.327
ibia-2.246
umber-2.234
Token simply
Token ID2391
Feature activation-1.30e+00

Pos logit contributions
aye+0.221
ersed+0.196
oing+0.186
 tongues+0.185
 hormones+0.184
Neg logit contributions
ighth-0.192
fs-0.191
brates-0.188
umber-0.185
ibia-0.184
Token to
Token ID284
Feature activation-9.10e-01

Pos logit contributions
fs+5.102
ighth+5.086
umber+4.984
ibia+4.895
brates+4.888
Neg logit contributions
aye-6.195
ersed-5.680
oing-5.329
astered-5.269
yip-5.217
Token turn
Token ID1210
Feature activation+2.14e+00

Pos logit contributions
fs+3.621
ighth+3.602
umber+3.503
brates+3.470
ibia+3.424
Neg logit contributions
aye-4.448
ersed-4.011
oing-3.757
anchester-3.749
astered-3.739
Token off
Token ID572
Feature activation+1.41e+00

Pos logit contributions
aye+7.832
 tongues+6.749
 hormones+6.649
ersed+6.427
 antibodies+6.419
Neg logit contributions
brates-10.067
ighth-10.047
fs-9.927
ibia-9.898
FORE-9.717
Token the
Token ID262
Feature activation+9.06e-01

Pos logit contributions
aye+5.408
ersed+4.589
 tongues+4.557
 hormones+4.506
oing+4.329
Neg logit contributions
fs-6.970
ighth-6.940
brates-6.923
ibia-6.819
umber-6.770
Token cable
Token ID7862
Feature activation-5.34e-01

Pos logit contributions
aye+3.782
ersed+3.245
 tongues+3.167
 hormones+3.121
oing+3.092
Neg logit contributions
ighth-4.338
fs-4.325
brates-4.282
umber-4.206
ibia-4.206
Token news
Token ID1705
Feature activation-5.14e-01

Pos logit contributions
fs+2.769
ighth+2.769
brates+2.724
umber+2.652
FORE+2.641
Neg logit contributions
aye-2.018
ersed-1.772
oing-1.631
yip-1.580
anchester-1.561
Token shows
Token ID2523
Feature activation+2.91e-01

Pos logit contributions
fs+2.261
ighth+2.244
brates+2.210
umber+2.163
ibia+2.161
Neg logit contributions
aye-2.367
ersed-2.102
oing-1.981
yip-1.927
astered-1.923
Tokens
Token ID82
Feature activation+1.51e-01

Pos logit contributions
aye+0.594
ersed+0.520
oing+0.488
 tongues+0.483
 hormones+0.480
Neg logit contributions
ighth-0.680
fs-0.677
brates-0.667
umber-0.657
ibia-0.654
Token leadership
Token ID5531
Feature activation+4.70e-01

Pos logit contributions
aye+0.590
ersed+0.510
oing+0.476
 tongues+0.470
 hormones+0.466
Neg logit contributions
ighth-0.778
fs-0.776
brates-0.765
umber-0.754
ibia-0.751
Token:
Token ID25
Feature activation-5.51e-01

Pos logit contributions
aye+2.084
ersed+1.830
oing+1.741
 hormones+1.723
 tongues+1.719
Neg logit contributions
ighth-2.168
fs-2.147
brates-2.122
umber-2.076
ibia-2.070
Token
Token ID198
Feature activation+3.59e-01

Pos logit contributions
fs+2.055
ighth+2.033
umber+1.981
brates+1.979
ibia+1.951
Neg logit contributions
aye-2.882
ersed-2.618
oing-2.476
astered-2.449
 tongues-2.441
Token
Token ID198
Feature activation+2.93e-01

Pos logit contributions
aye+1.742
ersed+1.567
oing+1.465
 tongues+1.457
 hormones+1.454
Neg logit contributions
ighth-1.490
fs-1.449
brates-1.448
umber-1.415
ibia-1.406
TokenLib
Token ID25835
Feature activation+2.30e+00

Pos logit contributions
aye+1.391
ersed+1.231
oing+1.169
 tongues+1.151
 hormones+1.147
Neg logit contributions
ighth-1.261
fs-1.247
brates-1.237
umber-1.209
ibia-1.205
Tokenya
Token ID3972
Feature activation+7.39e-01

Pos logit contributions
aye+8.263
oing+6.207
ersed+6.037
oti+5.915
istani+5.848
Neg logit contributions
ighth-11.326
brates-11.318
FORE-10.908
fs-10.788
umber-10.678
Token needs
Token ID2476
Feature activation-3.72e-01

Pos logit contributions
aye+3.306
ersed+2.905
oing+2.767
 tongues+2.738
 hormones+2.730
Neg logit contributions
ighth-3.356
brates-3.296
fs-3.294
umber-3.204
ibia-3.167
Token to
Token ID284
Feature activation-5.35e-01

Pos logit contributions
fs+1.646
ighth+1.627
brates+1.600
ibia+1.582
umber+1.581
Neg logit contributions
aye-1.701
ersed-1.520
oing-1.417
astered-1.405
 tongues-1.400
Token exploit
Token ID14561
Feature activation-3.94e-01

Pos logit contributions
fs+2.242
ighth+2.235
umber+2.186
brates+2.182
ibia+2.170
Neg logit contributions
aye-2.548
ersed-2.308
oing-2.148
astered-2.129
 tongues-2.125
Token its
Token ID663
Feature activation-1.01e+00

Pos logit contributions
fs+1.746
ighth+1.736
brates+1.701
ibia+1.685
umber+1.680
Neg logit contributions
aye-1.802
ersed-1.602
oing-1.504
astered-1.493
 tongues-1.477
Token in
Token ID287
Feature activation+2.46e-01

Pos logit contributions
aye+0.468
ersed+0.410
oing+0.385
 tongues+0.379
astered+0.378
Neg logit contributions
ighth-0.514
fs-0.512
brates-0.501
umber-0.495
ibia-0.493
Token all
Token ID477
Feature activation-3.81e-01

Pos logit contributions
aye+1.142
ersed+1.012
oing+0.958
 tongues+0.951
 hormones+0.946
Neg logit contributions
ighth-1.080
fs-1.076
brates-1.061
umber-1.041
ibia-1.037
Token likelihood
Token ID14955
Feature activation-7.62e-01

Pos logit contributions
ighth+1.398
fs+1.395
brates+1.356
umber+1.349
ibia+1.327
Neg logit contributions
aye-2.045
ersed-1.832
oing-1.733
yip-1.714
astered-1.707
Token,
Token ID11
Feature activation-9.50e-02

Pos logit contributions
fs+3.178
ighth+3.121
ibia+3.040
brates+3.021
umber+3.003
Neg logit contributions
aye-3.626
ersed-3.226
oing-3.028
astered-3.010
 tongues-2.985
Token from
Token ID422
Feature activation+3.73e-01

Pos logit contributions
fs+0.489
ighth+0.489
brates+0.479
umber+0.475
ibia+0.473
Neg logit contributions
aye-0.370
ersed-0.319
oing-0.295
 tongues-0.293
astered-0.292
Token making
Token ID1642
Feature activation+2.24e+00

Pos logit contributions
aye+1.748
ersed+1.544
oing+1.476
 tongues+1.463
 hormones+1.460
Neg logit contributions
ighth-1.625
fs-1.611
brates-1.599
umber-1.566
ibia-1.551
Token the
Token ID262
Feature activation+1.70e+00

Pos logit contributions
aye+8.210
 antibodies+6.781
 tongues+6.718
ersed+6.688
 hormones+6.669
Neg logit contributions
ighth-10.716
fs-10.401
brates-10.387
FORE-10.313
umber-10.307
Token playoffs
Token ID12980
Feature activation+9.35e-01

Pos logit contributions
aye+7.441
 hormones+6.207
ersed+6.170
 tongues+6.104
 antibodies+6.100
Neg logit contributions
ighth-7.257
brates-7.252
fs-7.168
FORE-7.104
ibia-7.086
Token in
Token ID287
Feature activation+3.82e-01

Pos logit contributions
aye+3.982
ersed+3.450
 hormones+3.270
 tongues+3.269
oing+3.269
Neg logit contributions
ighth-4.387
brates-4.339
fs-4.331
umber-4.209
ibia-4.196
Token Kat
Token ID8595
Feature activation-1.33e+00

Pos logit contributions
aye+1.633
ersed+1.429
oing+1.345
 tongues+1.341
 hormones+1.330
Neg logit contributions
ighth-1.816
fs-1.805
brates-1.786
umber-1.753
ibia-1.748
Tokenow
Token ID322
Feature activation-9.38e-02

Pos logit contributions
fs+5.506
ighth+5.449
ibia+5.249
ters+5.146
brates+5.143
Neg logit contributions
aye-5.989
ersed-5.514
 hormones-5.373
oing-5.340
yip-5.291
Token
Token ID198
Feature activation+2.47e-01

Pos logit contributions
aye+0.681
ersed+0.607
oing+0.576
 tongues+0.571
 hormones+0.568
Neg logit contributions
ighth-0.575
fs-0.572
brates-0.562
umber-0.552
ibia-0.549
Token
Token ID198
Feature activation-1.97e-02

Pos logit contributions
aye+1.237
ersed+1.111
oing+1.046
 tongues+1.040
 hormones+1.037
Neg logit contributions
ighth-0.998
fs-0.977
brates-0.971
umber-0.949
ibia-0.945
TokenYou
Token ID1639
Feature activation+3.51e-01

Pos logit contributions
ighth+0.081
fs+0.080
brates+0.079
umber+0.078
ibia+0.077
Neg logit contributions
aye-0.098
ersed-0.087
oing-0.083
 tongues-0.083
 hormones-0.082
Token can
Token ID460
Feature activation+2.62e-01

Pos logit contributions
aye+1.540
ersed+1.355
oing+1.277
 tongues+1.266
 hormones+1.259
Neg logit contributions
ighth-1.632
fs-1.624
brates-1.605
umber-1.576
ibia-1.570
Token't
Token ID470
Feature activation+9.30e-01

Pos logit contributions
aye+1.282
ersed+1.143
oing+1.084
 tongues+1.073
 hormones+1.069
Neg logit contributions
ighth-1.091
fs-1.087
brates-1.067
umber-1.049
ibia-1.043
Token make
Token ID787
Feature activation+2.42e+00

Pos logit contributions
aye+4.434
ersed+3.889
 tongues+3.797
oing+3.761
 hormones+3.741
Neg logit contributions
ighth-3.884
fs-3.822
brates-3.814
ibia-3.704
umber-3.694
Token this
Token ID428
Feature activation+1.60e+00

Pos logit contributions
aye+9.126
 tongues+8.143
 hormones+7.831
 antibodies+7.794
ersed+7.621
Neg logit contributions
brates-10.890
ighth-10.882
fs-10.850
ibia-10.555
umber-10.410
Token up
Token ID510
Feature activation+7.21e-01

Pos logit contributions
aye+6.383
ersed+5.315
 hormones+5.242
 tongues+5.202
oing+5.202
Neg logit contributions
brates-7.595
ighth-7.575
fs-7.562
ibia-7.426
FORE-7.253
Token:
Token ID25
Feature activation-3.26e-01

Pos logit contributions
aye+3.302
ersed+2.915
oing+2.773
 tongues+2.764
 hormones+2.763
Neg logit contributions
ighth-3.192
fs-3.162
brates-3.125
ibia-3.046
umber-3.012
Token
Token ID198
Feature activation-9.38e-02

Pos logit contributions
ighth+1.396
fs+1.392
umber+1.355
brates+1.351
ibia+1.324
Neg logit contributions
aye-1.540
ersed-1.364
oing-1.305
astered-1.273
 tongues-1.263
Token
Token ID198
Feature activation-1.85e-01

Pos logit contributions
ighth+0.354
fs+0.352
brates+0.346
umber+0.339
ibia+0.337
Neg logit contributions
aye-0.496
ersed-0.445
oing-0.425
 tongues-0.421
 hormones-0.419
Token to
Token ID284
Feature activation-5.76e-01

Pos logit contributions
aye+3.129
ersed+2.803
oing+2.650
 tongues+2.625
astered+2.595
Neg logit contributions
ighth-3.063
fs-3.004
brates-2.980
ibia-2.917
FORE-2.896
Token the
Token ID262
Feature activation-4.06e-01

Pos logit contributions
fs+2.330
ighth+2.311
umber+2.282
brates+2.264
ibia+2.231
Neg logit contributions
aye-2.833
ersed-2.556
oing-2.394
astered-2.354
 tongues-2.341
Token required
Token ID2672
Feature activation+6.95e-02

Pos logit contributions
fs+1.565
ighth+1.562
umber+1.516
brates+1.512
ibia+1.499
Neg logit contributions
aye-2.091
ersed-1.892
oing-1.781
astered-1.756
 hormones-1.751
Token
Token ID198
Feature activation+2.58e-01

Pos logit contributions
aye+0.359
ersed+0.322
oing+0.307
 tongues+0.304
 hormones+0.303
Neg logit contributions
ighth-0.270
fs-0.269
brates-0.263
umber-0.259
ibia-0.257
Token
Token ID198
Feature activation+1.18e-01

Pos logit contributions
aye+1.292
ersed+1.160
oing+1.093
 tongues+1.090
 hormones+1.086
Neg logit contributions
ighth-1.039
fs-1.017
brates-1.009
umber-0.987
ibia-0.981
Tokendim
Token ID27740
Feature activation+2.76e+00

Pos logit contributions
aye+0.633
ersed+0.570
oing+0.544
 tongues+0.539
 hormones+0.537
Neg logit contributions
ighth-0.433
fs-0.431
brates-0.422
umber-0.414
ibia-0.411
Tokenensions
Token ID5736
Feature activation-3.02e-01

Pos logit contributions
aye+9.380
ersed+8.198
stone+7.309
oing+7.192
ersion+7.178
Neg logit contributions
FORE-13.092
brates-13.001
ighth-12.753
emis-12.371
fs-12.226
Token,
Token ID11
Feature activation+2.34e-01

Pos logit contributions
fs+1.303
ighth+1.290
umber+1.266
brates+1.265
ibia+1.244
Neg logit contributions
aye-1.425
ersed-1.272
oing-1.203
 tongues-1.186
astered-1.174
Token with
Token ID351
Feature activation+6.14e-01

Pos logit contributions
aye+1.034
ersed+0.908
oing+0.856
 tongues+0.846
 hormones+0.842
Neg logit contributions
ighth-1.089
fs-1.086
brates-1.069
umber-1.052
ibia-1.047
Token a
Token ID257
Feature activation+1.04e-01

Pos logit contributions
aye+2.658
ersed+2.329
 tongues+2.213
oing+2.197
 hormones+2.192
Neg logit contributions
ighth-2.871
fs-2.857
brates-2.829
umber-2.768
ibia-2.755
Token few
Token ID1178
Feature activation+4.64e-01

Pos logit contributions
aye+0.498
ersed+0.442
oing+0.419
 tongues+0.415
 hormones+0.413
Neg logit contributions
ighth-0.441
fs-0.439
brates-0.431
umber-0.424
ibia-0.422
TokenACT
Token ID10659
Feature activation-3.30e-01

Pos logit contributions
aye+0.658
ersed+0.578
oing+0.544
 tongues+0.539
 hormones+0.536
Neg logit contributions
ighth-0.697
fs-0.695
brates-0.684
umber-0.673
ibia-0.670
TokenERS
Token ID4877
Feature activation+8.29e-02

Pos logit contributions
fs+1.493
ighth+1.468
brates+1.463
FORE+1.440
ibia+1.431
Neg logit contributions
aye-1.529
ersed-1.299
oing-1.255
astered-1.240
 tongues-1.234
Token IN
Token ID3268
Feature activation-6.56e-02

Pos logit contributions
aye+0.358
ersed+0.314
oing+0.296
 tongues+0.293
 hormones+0.291
Neg logit contributions
ighth-0.392
fs-0.391
brates-0.385
umber-0.379
ibia-0.377
Token D
Token ID360
Feature activation+2.34e-01

Pos logit contributions
fs+0.290
ighth+0.289
brates+0.285
umber+0.280
FORE+0.279
Neg logit contributions
aye-0.306
ersed-0.269
oing-0.255
 hormones-0.251
 tongues-0.251
TokenANG
Token ID15567
Feature activation-6.18e-02

Pos logit contributions
aye+1.218
ersed+1.102
oing+1.049
 tongues+1.037
 hormones+1.033
Neg logit contributions
ighth-0.879
fs-0.871
brates-0.866
ibia-0.844
umber-0.834
TokenER
Token ID1137
Feature activation+6.03e-01

Pos logit contributions
fs+0.296
ighth+0.294
brates+0.290
umber+0.287
FORE+0.286
Neg logit contributions
aye-0.265
ersed-0.230
oing-0.214
 tongues-0.214
 hormones-0.212
Token.
Token ID13
Feature activation-1.04e-01

Pos logit contributions
aye+2.602
ersed+2.327
 tongues+2.180
oing+2.171
 hormones+2.156
Neg logit contributions
ighth-2.819
fs-2.728
brates-2.712
umber-2.677
ibia-2.669
Token
Token ID198
Feature activation+1.48e-01

Pos logit contributions
ighth+0.422
fs+0.422
brates+0.414
umber+0.406
ibia+0.403
Neg logit contributions
aye-0.522
ersed-0.467
oing-0.445
 tongues-0.439
 hormones-0.437
Token
Token ID198
Feature activation-4.22e-01

Pos logit contributions
aye+0.755
ersed+0.678
oing+0.642
 tongues+0.638
 hormones+0.635
Neg logit contributions
ighth-0.583
fs-0.575
brates-0.568
umber-0.557
ibia-0.553
TokenHow
Token ID2437
Feature activation-5.57e-01

Pos logit contributions
fs+1.725
ighth+1.715
brates+1.658
FORE+1.650
umber+1.641
Neg logit contributions
aye-2.095
ersed-1.883
 tongues-1.801
oing-1.794
 hormones-1.781
Token will
Token ID481
Feature activation+6.30e-01

Pos logit contributions
fs+2.444
ighth+2.413
brates+2.381
umber+2.324
 else+2.314
Neg logit contributions
aye-2.573
ersed-2.297
oing-2.153
astered-2.101
 tongues-2.086
Token Netflix
Token ID12074
Feature activation-4.61e-01

Pos logit contributions
fs+2.272
ighth+2.266
umber+2.212
brates+2.194
ibia+2.176
Neg logit contributions
aye-2.455
ersed-2.207
oing-2.065
astered-2.028
 tongues-2.007
Token)
Token ID8
Feature activation-5.80e-01

Pos logit contributions
fs+1.932
ighth+1.928
brates+1.885
umber+1.864
ibia+1.858
Neg logit contributions
aye-2.225
ersed-1.990
oing-1.854
 tongues-1.847
astered-1.829
Token supported
Token ID4855
Feature activation-4.92e-01

Pos logit contributions
fs+2.759
ighth+2.731
umber+2.673
brates+2.667
ibia+2.638
Neg logit contributions
aye-2.475
ersed-2.137
oing-2.022
 tongues-1.964
 hormones-1.960
Token the
Token ID262
Feature activation-6.79e-01

Pos logit contributions
fs+2.194
ighth+2.192
brates+2.139
umber+2.116
ibia+2.081
Neg logit contributions
aye-2.233
ersed-1.998
oing-1.865
astered-1.837
 tongues-1.830
Token FCC
Token ID15915
Feature activation+1.20e-01

Pos logit contributions
fs+3.104
ighth+3.078
brates+2.997
umber+2.993
ibia+2.928
Neg logit contributions
aye-2.996
ersed-2.657
oing-2.470
 tongues-2.445
astered-2.444
Token power
Token ID1176
Feature activation+3.97e-01

Pos logit contributions
aye+0.556
ersed+0.494
oing+0.465
 tongues+0.462
 hormones+0.459
Neg logit contributions
ighth-0.528
fs-0.527
brates-0.517
umber-0.510
ibia-0.506
Token grab
Token ID5552
Feature activation+5.49e-01

Pos logit contributions
aye+1.842
ersed+1.628
oing+1.541
 tongues+1.524
 hormones+1.519
Neg logit contributions
ighth-1.752
fs-1.732
brates-1.711
ibia-1.682
umber-1.681
Token of
Token ID286
Feature activation+8.80e-02

Pos logit contributions
aye+2.537
ersed+2.240
oing+2.130
 tongues+2.104
 hormones+2.102
Neg logit contributions
ighth-2.418
fs-2.385
brates-2.363
ibia-2.314
umber-2.309
Token net
Token ID2010
Feature activation-3.79e-01

Pos logit contributions
aye+0.403
ersed+0.357
oing+0.337
 tongues+0.333
 hormones+0.332
Neg logit contributions
ighth-0.394
fs-0.392
brates-0.386
umber-0.380
ibia-0.377
Token neutrality
Token ID20723
Feature activation-2.97e-01

Pos logit contributions
ighth+1.568
fs+1.568
brates+1.518
ibia+1.488
umber+1.488
Neg logit contributions
aye-1.865
ersed-1.661
oing-1.572
 hormones-1.554
 tongues-1.551
Token.
Token ID13
Feature activation-1.31e-01

Pos logit contributions
fs+1.251
ighth+1.239
brates+1.214
umber+1.201
ibia+1.191
Neg logit contributions
aye-1.426
ersed-1.277
oing-1.200
 tongues-1.195
 hormones-1.183
Token the
Token ID262
Feature activation+2.29e-01

Pos logit contributions
aye+1.500
ersed+1.310
 tongues+1.239
oing+1.239
 hormones+1.234
Neg logit contributions
fs-1.550
ighth-1.549
brates-1.530
umber-1.502
ibia-1.493
Token Mid
Token ID7215
Feature activation+8.94e-01

Pos logit contributions
aye+1.029
ersed+0.905
oing+0.855
 tongues+0.849
 hormones+0.845
Neg logit contributions
ighth-1.041
fs-1.037
brates-1.023
umber-1.005
ibia-1.000
Tokenas
Token ID292
Feature activation-1.51e-01

Pos logit contributions
aye+3.772
ersed+3.290
oing+3.125
 tongues+3.073
 hormones+3.035
Neg logit contributions
brates-4.197
ighth-4.189
fs-4.182
ibia-4.082
FORE-4.051
Token Touch
Token ID15957
Feature activation+8.81e-01

Pos logit contributions
ighth+0.429
fs+0.425
brates+0.411
umber+0.407
ibia+0.397
Neg logit contributions
aye-0.933
ersed-0.864
oing-0.825
astered-0.812
 tongues-0.812
Token that
Token ID326
Feature activation+3.06e-01

Pos logit contributions
aye+4.126
ersed+3.613
 tongues+3.442
oing+3.426
 hormones+3.422
Neg logit contributions
ighth-3.748
brates-3.717
fs-3.698
ibia-3.628
umber-3.595
Token turns
Token ID4962
Feature activation+9.43e-01

Pos logit contributions
aye+1.343
ersed+1.179
oing+1.110
 tongues+1.103
 hormones+1.096
Neg logit contributions
ighth-1.423
fs-1.416
brates-1.396
umber-1.373
ibia-1.366
Token everything
Token ID2279
Feature activation-1.09e-01

Pos logit contributions
aye+3.992
ersed+3.451
 tongues+3.337
 hormones+3.323
oing+3.285
Neg logit contributions
ighth-4.439
fs-4.431
brates-4.401
ibia-4.290
umber-4.288
Token into
Token ID656
Feature activation+1.23e-01

Pos logit contributions
ighth+0.533
fs+0.529
brates+0.517
umber+0.517
ibia+0.508
Neg logit contributions
aye-0.456
ersed-0.400
oing-0.373
 tongues-0.366
astered-0.365
Token s
Token ID264
Feature activation-6.33e-01

Pos logit contributions
aye+0.540
ersed+0.476
oing+0.447
 tongues+0.438
astered+0.437
Neg logit contributions
ighth-0.572
fs-0.569
brates-0.559
umber-0.551
ibia-0.548
Token***
Token ID8162
Feature activation+5.85e-01

Pos logit contributions
ighth+2.530
fs+2.486
umber+2.398
brates+2.373
ibia+2.361
Neg logit contributions
aye-3.199
ersed-2.881
 tongues-2.721
oing-2.707
 hormones-2.704
Token.
Token ID13
Feature activation+2.04e-01

Pos logit contributions
aye+2.782
ersed+2.472
oing+2.353
 tongues+2.349
 hormones+2.347
Neg logit contributions
ighth-2.485
fs-2.456
brates-2.437
ibia-2.375
umber-2.371
Tokenand
Token ID392
Feature activation+7.09e-01

Pos logit contributions
aye+4.530
ersed+4.006
oing+3.830
 tongues+3.726
astered+3.714
Neg logit contributions
ighth-3.522
brates-3.485
fs-3.440
ibia-3.373
umber-3.339
Token remain
Token ID3520
Feature activation-1.14e+00

Pos logit contributions
aye+3.009
ersed+2.634
oing+2.486
 tongues+2.469
 hormones+2.464
Neg logit contributions
ighth-3.369
fs-3.356
brates-3.324
ibia-3.248
umber-3.245
Token)
Token ID8
Feature activation+3.29e-01

Pos logit contributions
ighth+3.864
fs+3.827
brates+3.767
ibia+3.673
umber+3.663
Neg logit contributions
aye-6.102
ersed-5.434
yip-5.242
oing-5.161
 hormones-5.127
Token good
Token ID922
Feature activation+5.71e-01

Pos logit contributions
aye+1.393
ersed+1.222
oing+1.149
 tongues+1.139
 hormones+1.131
Neg logit contributions
ighth-1.577
fs-1.570
brates-1.549
umber-1.525
ibia-1.519
Token friends
Token ID2460
Feature activation-2.62e-01

Pos logit contributions
aye+2.074
ersed+1.777
oing+1.672
 tongues+1.653
 hormones+1.639
Neg logit contributions
ighth-3.083
fs-3.037
brates-3.015
umber-2.986
ibia-2.963
Token.
Token ID13
Feature activation+4.44e-01

Pos logit contributions
fs+1.111
ighth+1.102
brates+1.085
umber+1.073
ibia+1.064
Neg logit contributions
aye-1.257
ersed-1.119
oing-1.059
 tongues-1.046
astered-1.041
Token I
Token ID314
Feature activation-6.13e-02

Pos logit contributions
aye+1.978
ersed+1.748
oing+1.637
 tongues+1.636
 hormones+1.621
Neg logit contributions
ighth-2.028
fs-2.009
brates-1.987
umber-1.958
ibia-1.944
Token swear
Token ID21192
Feature activation+1.13e+00

Pos logit contributions
ighth+0.269
fs+0.268
brates+0.263
umber+0.259
ibia+0.258
Neg logit contributions
aye-0.287
ersed-0.253
oing-0.240
 tongues-0.237
 hormones-0.236
Token to
Token ID284
Feature activation-2.50e-01

Pos logit contributions
aye+4.751
ersed+4.089
 tongues+3.967
 hormones+3.888
oing+3.819
Neg logit contributions
ighth-5.363
brates-5.287
fs-5.245
umber-5.112
ibia-5.109
Token you
Token ID345
Feature activation+7.52e-01

Pos logit contributions
ighth+1.145
fs+1.143
brates+1.118
umber+1.105
ibia+1.093
Neg logit contributions
aye-1.110
ersed-0.976
oing-0.921
astered-0.903
 hormones-0.898
Token that
Token ID326
Feature activation+7.03e-01

Pos logit contributions
aye+3.386
ersed+2.982
oing+2.814
 tongues+2.808
 hormones+2.791
Neg logit contributions
ighth-3.385
fs-3.354
brates-3.336
umber-3.266
ibia-3.250
Token unexpected
Token ID10059
Feature activation+3.29e-01

Pos logit contributions
fs+0.852
ighth+0.849
brates+0.836
umber+0.822
ibia+0.821
Neg logit contributions
aye-0.842
ersed-0.744
oing-0.699
astered-0.689
 tongues-0.686
Token market
Token ID1910
Feature activation-2.61e-01

Pos logit contributions
aye+1.378
ersed+1.202
oing+1.132
 tongues+1.125
 hormones+1.118
Neg logit contributions
ighth-1.604
fs-1.593
brates-1.573
umber-1.551
ibia-1.540
Token shifts
Token ID15381
Feature activation+6.41e-01

Pos logit contributions
fs+1.274
ighth+1.260
brates+1.248
ibia+1.230
umber+1.221
Neg logit contributions
aye-1.078
ersed-0.932
oing-0.873
astered-0.864
 tongues-0.850
Token never
Token ID1239
Feature activation-4.23e-01

Pos logit contributions
aye+2.684
ersed+2.349
 tongues+2.202
 hormones+2.200
oing+2.193
Neg logit contributions
ighth-3.095
fs-3.063
brates-3.035
umber-2.971
ibia-2.950
Token seems
Token ID2331
Feature activation+4.21e-01

Pos logit contributions
fs+1.828
ighth+1.825
brates+1.790
ibia+1.779
umber+1.746
Neg logit contributions
aye-2.015
ersed-1.764
oing-1.663
astered-1.642
 hormones-1.611
Token to
Token ID284
Feature activation+5.56e-01

Pos logit contributions
aye+1.894
ersed+1.672
oing+1.575
 tongues+1.571
 hormones+1.559
Neg logit contributions
ighth-1.912
fs-1.901
brates-1.871
umber-1.844
ibia-1.829
Token damp
Token ID21151
Feature activation+1.17e+00

Pos logit contributions
aye+2.223
ersed+1.921
oing+1.814
 tongues+1.811
 hormones+1.792
Neg logit contributions
ighth-2.792
fs-2.778
brates-2.736
umber-2.695
ibia-2.675
Tokenen
Token ID268
Feature activation-1.64e-01

Pos logit contributions
aye+5.866
ersed+5.210
oing+4.906
 tongues+4.904
astered+4.877
Neg logit contributions
ighth-4.666
fs-4.646
brates-4.630
ibia-4.543
umber-4.470
Token their
Token ID511
Feature activation+6.74e-01

Pos logit contributions
fs+0.735
ighth+0.733
brates+0.721
umber+0.711
ibia+0.707
Neg logit contributions
aye-0.743
ersed-0.657
oing-0.616
astered-0.610
 hormones-0.606
Token determination
Token ID12123
Feature activation+3.88e-01

Pos logit contributions
aye+3.002
ersed+2.635
 tongues+2.551
oing+2.535
 hormones+2.529
Neg logit contributions
ighth-3.077
fs-3.058
brates-3.024
umber-2.958
ibia-2.932
Token to
Token ID284
Feature activation-2.91e-01

Pos logit contributions
aye+1.698
ersed+1.490
 tongues+1.412
oing+1.406
 hormones+1.404
Neg logit contributions
ighth-1.813
fs-1.782
brates-1.765
umber-1.723
ibia-1.717
Token doing
Token ID1804
Feature activation+3.84e-01

Pos logit contributions
fs+4.566
ighth+4.399
umber+4.354
brates+4.316
ces+4.301
Neg logit contributions
aye-5.176
ersed-4.718
oing-4.366
yip-4.365
astered-4.312
Token something
Token ID1223
Feature activation-5.10e-01

Pos logit contributions
aye+1.802
ersed+1.598
oing+1.512
 tongues+1.497
 hormones+1.489
Neg logit contributions
ighth-1.674
fs-1.667
brates-1.640
umber-1.612
ibia-1.603
Token they
Token ID484
Feature activation-4.93e-01

Pos logit contributions
fs+2.430
ighth+2.395
brates+2.377
umber+2.368
 else+2.318
Neg logit contributions
aye-2.176
ersed-1.915
oing-1.778
astered-1.759
 tongues-1.740
Token enjoy
Token ID2883
Feature activation-9.36e-01

Pos logit contributions
fs+2.096
ighth+2.073
umber+2.026
brates+2.010
ibia+1.980
Neg logit contributions
aye-2.382
ersed-2.098
oing-1.988
 tongues-1.972
yip-1.949
Token,
Token ID11
Feature activation-3.26e-01

Pos logit contributions
ighth+3.904
umber+3.879
fs+3.873
brates+3.725
ibia+3.709
Neg logit contributions
aye-4.437
ersed-3.958
 tongues-3.675
oing-3.667
astered-3.620
Token and
Token ID290
Feature activation+3.05e-01

Pos logit contributions
ighth+1.548
fs+1.545
brates+1.503
umber+1.495
ibia+1.482
Neg logit contributions
aye-1.400
ersed-1.222
oing-1.149
 tongues-1.125
astered-1.120
Token are
Token ID389
Feature activation+1.27e-01

Pos logit contributions
aye+1.404
ersed+1.244
oing+1.177
 tongues+1.168
 hormones+1.166
Neg logit contributions
ighth-1.350
fs-1.342
brates-1.327
umber-1.298
ibia-1.295
Token now
Token ID783
Feature activation-2.02e-01

Pos logit contributions
aye+0.522
ersed+0.453
oing+0.424
 tongues+0.420
 hormones+0.417
Neg logit contributions
ighth-0.630
fs-0.628
brates-0.617
umber-0.610
ibia-0.606
Token far
Token ID1290
Feature activation-6.48e-01

Pos logit contributions
ighth+0.853
fs+0.853
brates+0.829
umber+0.824
ibia+0.813
Neg logit contributions
aye-0.974
ersed-0.859
oing-0.811
 tongues-0.808
astered-0.802
Token happier
Token ID23030
Feature activation-1.39e+00

Pos logit contributions
fs+2.419
ighth+2.402
umber+2.364
brates+2.295
ibia+2.291
Neg logit contributions
aye-3.421
ersed-3.018
 tongues-2.899
oing-2.876
 hormones-2.861
Token.
Token ID13
Feature activation+1.32e-01

Pos logit contributions
fs+5.713
umber+5.599
ighth+5.543
brates+5.437
 else+5.411
Neg logit contributions
aye-6.479
ersed-5.685
yip-5.396
interstitial-5.382
oing-5.366
Token the
Token ID262
Feature activation+6.99e-02

Pos logit contributions
aye+0.122
ersed+0.107
oing+0.101
 tongues+0.099
astered+0.098
Neg logit contributions
ighth-0.132
fs-0.131
brates-0.129
umber-0.127
ibia-0.126
Token H
Token ID367
Feature activation-7.98e-01

Pos logit contributions
aye+0.321
ersed+0.285
oing+0.268
 tongues+0.264
 hormones+0.263
Neg logit contributions
ighth-0.313
fs-0.310
brates-0.305
umber-0.301
ibia-0.300
Token1
Token ID16
Feature activation+9.02e-02

Pos logit contributions
ighth+2.745
umber+2.652
fs+2.625
brates+2.534
ibia+2.492
Neg logit contributions
aye-4.508
ersed-4.143
oing-3.935
 hormones-3.901
 tongues-3.887
TokenN
Token ID45
Feature activation-2.05e-01

Pos logit contributions
aye+0.464
ersed+0.419
oing+0.397
 tongues+0.393
 hormones+0.391
Neg logit contributions
ighth-0.352
fs-0.351
brates-0.344
umber-0.338
ibia-0.334
Token1
Token ID16
Feature activation-3.01e-01

Pos logit contributions
ighth+0.703
fs+0.686
brates+0.674
umber+0.660
ibia+0.655
Neg logit contributions
aye-1.162
ersed-1.059
oing-1.017
 tongues-0.991
astered-0.989
Token strain
Token ID14022
Feature activation+2.22e-01

Pos logit contributions
ighth+1.542
fs+1.534
brates+1.508
umber+1.487
ibia+1.476
Neg logit contributions
aye-1.172
ersed-1.032
oing-0.961
 tongues-0.936
astered-0.934
Token which
Token ID543
Feature activation+9.02e-01

Pos logit contributions
aye+1.022
ersed+0.905
oing+0.855
 tongues+0.845
 hormones+0.841
Neg logit contributions
ighth-0.990
fs-0.986
brates-0.969
umber-0.954
ibia-0.949
Token caused
Token ID4073
Feature activation+5.36e-02

Pos logit contributions
aye+3.667
ersed+3.184
 tongues+3.029
 hormones+2.989
 antibodies+2.965
Neg logit contributions
ighth-4.431
fs-4.391
brates-4.327
umber-4.269
ibia-4.257
Token the
Token ID262
Feature activation+1.28e-01

Pos logit contributions
aye+0.238
ersed+0.210
oing+0.197
astered+0.194
 tongues+0.193
Neg logit contributions
ighth-0.247
fs-0.246
brates-0.242
umber-0.238
ibia-0.238
Token 1918
Token ID25859
Feature activation+4.32e-01

Pos logit contributions
aye+0.588
ersed+0.521
oing+0.491
 tongues+0.486
 hormones+0.484
Neg logit contributions
ighth-0.570
fs-0.567
brates-0.558
umber-0.549
ibia-0.546
Token flu
Token ID6562
Feature activation+3.92e-01

Pos logit contributions
aye+1.961
ersed+1.716
oing+1.639
 tongues+1.629
 hormones+1.619
Neg logit contributions
ighth-1.942
fs-1.931
brates-1.900
umber-1.869
ibia-1.865
Token
Token ID198
Feature activation-1.73e-01

Pos logit contributions
aye+0.253
ersed+0.228
oing+0.216
 tongues+0.214
 hormones+0.213
Neg logit contributions
ighth-0.189
fs-0.187
brates-0.184
umber-0.181
ibia-0.179
Token
Token ID13697
Feature activation+1.29e+00

Pos logit contributions
fs+0.683
ighth+0.683
brates+0.667
umber+0.655
ibia+0.651
Neg logit contributions
aye-0.882
ersed-0.791
oing-0.753
 tongues-0.749
 hormones-0.745
TokenSte
Token ID7447
Feature activation-7.41e-01

Pos logit contributions
aye+5.886
ersed+5.130
oing+4.831
 tongues+4.739
 hormones+4.712
Neg logit contributions
ighth-5.488
brates-5.397
umber-5.332
fs-5.298
ibia-5.268
Tokenlla
Token ID8466
Feature activation+9.12e-01

Pos logit contributions
fs+2.700
ighth+2.585
brates+2.553
ibia+2.516
umber+2.498
Neg logit contributions
aye-3.968
ersed-3.580
 hormones-3.535
 tongues-3.516
 ultras-3.478
Token-
Token ID12
Feature activation-4.19e-01

Pos logit contributions
aye+3.970
ersed+3.449
oing+3.298
 tongues+3.255
 hormones+3.248
Neg logit contributions
ighth-4.199
brates-4.124
fs-4.097
umber-4.013
ibia-4.000
Tokensan
Token ID12807
Feature activation-9.22e-01

Pos logit contributions
fs+1.457
ighth+1.435
brates+1.388
umber+1.373
ibia+1.355
Neg logit contributions
aye-2.309
ersed-2.129
 tongues-2.031
oing-2.022
 hormones-2.017
Token collapsed
Token ID14707
Feature activation+8.81e-03

Pos logit contributions
fs+3.743
ighth+3.662
umber+3.569
brates+3.547
ibia+3.524
Neg logit contributions
aye-4.438
ersed-4.032
 tongues-3.768
astered-3.733
oing-3.722
Token?」
Token ID42943
Feature activation+8.70e-02

Pos logit contributions
aye+0.040
ersed+0.035
oing+0.033
 tongues+0.033
 hormones+0.032
Neg logit contributions
fs-0.040
ighth-0.040
brates-0.039
umber-0.039
ibia-0.038
Token
Token ID198
Feature activation+3.24e-01

Pos logit contributions
aye+0.448
ersed+0.402
oing+0.383
 tongues+0.379
 hormones+0.378
Neg logit contributions
ighth-0.340
fs-0.337
brates-0.332
umber-0.325
ibia-0.324
Token
Token ID198
Feature activation+1.12e-01

Pos logit contributions
aye+1.594
ersed+1.435
oing+1.346
 tongues+1.340
 hormones+1.335
Neg logit contributions
ighth-1.332
fs-1.297
brates-1.291
umber-1.263
ibia-1.253
Token"
Token ID1
Feature activation+7.00e-02

Pos logit contributions
aye+0.666
ersed+0.605
oing+0.580
 tongues+0.574
 hormones+0.572
Neg logit contributions
ighth-0.353
fs-0.348
brates-0.343
umber-0.334
ibia-0.331
Token behavior
Token ID4069
Feature activation-2.98e-01

Pos logit contributions
aye+1.049
ersed+0.917
oing+0.861
 tongues+0.849
astered+0.845
Neg logit contributions
ighth-1.161
fs-1.158
brates-1.138
umber-1.121
ibia-1.117
Token,
Token ID11
Feature activation-2.34e-01

Pos logit contributions
fs+1.307
ighth+1.306
brates+1.282
umber+1.264
ibia+1.261
Neg logit contributions
aye-1.380
ersed-1.222
oing-1.155
 tongues-1.136
astered-1.132
Token the
Token ID262
Feature activation-3.58e-01

Pos logit contributions
ighth+1.001
fs+0.999
brates+0.975
umber+0.963
ibia+0.958
Neg logit contributions
aye-1.120
ersed-0.988
oing-0.938
astered-0.922
 tongues-0.916
Token school
Token ID1524
Feature activation-4.52e-01

Pos logit contributions
fs+1.592
ighth+1.586
brates+1.538
umber+1.530
ibia+1.518
Neg logit contributions
aye-1.634
ersed-1.443
oing-1.350
astered-1.345
yip-1.325
Token's
Token ID338
Feature activation-1.97e-01

Pos logit contributions
fs+1.915
ighth+1.886
brates+1.860
umber+1.819
ibia+1.816
Neg logit contributions
aye-2.160
ersed-1.914
oing-1.825
astered-1.803
 tongues-1.780
Token response
Token ID2882
Feature activation+3.15e-02

Pos logit contributions
fs+0.895
ighth+0.895
brates+0.873
umber+0.859
ibia+0.857
Neg logit contributions
aye-0.876
ersed-0.773
oing-0.725
astered-0.718
 tongues-0.707
Token,
Token ID11
Feature activation-3.21e-01

Pos logit contributions
aye+0.139
ersed+0.122
oing+0.115
 tongues+0.114
 hormones+0.113
Neg logit contributions
ighth-0.147
fs-0.146
brates-0.144
umber-0.142
ibia-0.141
Token and
Token ID290
Feature activation+1.36e-01

Pos logit contributions
ighth+1.487
fs+1.477
brates+1.434
umber+1.427
ibia+1.416
Neg logit contributions
aye-1.417
ersed-1.238
oing-1.171
astered-1.147
 tongues-1.143
Token his
Token ID465
Feature activation+2.47e-01

Pos logit contributions
aye+0.683
ersed+0.610
oing+0.580
 tongues+0.574
 hormones+0.572
Neg logit contributions
ighth-0.546
fs-0.544
brates-0.534
umber-0.524
ibia-0.521
Token dissatisfaction
Token ID41393
Feature activation-5.09e-01

Pos logit contributions
aye+1.116
ersed+0.985
oing+0.928
 tongues+0.918
 hormones+0.912
Neg logit contributions
ighth-1.118
fs-1.114
brates-1.094
umber-1.077
ibia-1.072
Token with
Token ID351
Feature activation+2.79e-02

Pos logit contributions
fs+2.064
ighth+2.044
brates+2.001
umber+1.977
ibia+1.958
Neg logit contributions
aye-2.536
ersed-2.250
oing-2.136
astered-2.098
 tongues-2.095
Token with
Token ID351
Feature activation+7.73e-02

Pos logit contributions
aye+0.527
ersed+0.470
oing+0.447
 tongues+0.442
 hormones+0.440
Neg logit contributions
ighth-0.428
fs-0.426
brates-0.418
umber-0.411
ibia-0.409
Token us
Token ID514
Feature activation+8.23e-01

Pos logit contributions
aye+0.347
ersed+0.306
oing+0.288
 tongues+0.285
 hormones+0.283
Neg logit contributions
ighth-0.353
fs-0.352
brates-0.346
umber-0.341
ibia-0.339
Token.
Token ID13
Feature activation+3.12e-02

Pos logit contributions
aye+3.951
ersed+3.539
 tongues+3.397
oing+3.378
 hormones+3.361
Neg logit contributions
ighth-3.384
fs-3.342
brates-3.332
FORE-3.227
ibia-3.216
Token There
Token ID1318
Feature activation+3.02e-01

Pos logit contributions
aye+0.144
ersed+0.128
oing+0.121
 tongues+0.120
 hormones+0.119
Neg logit contributions
ighth-0.138
fs-0.137
brates-0.135
umber-0.133
ibia-0.132
Token was
Token ID373
Feature activation-4.03e-01

Pos logit contributions
aye+1.245
ersed+1.084
oing+1.016
 tongues+1.005
 hormones+0.999
Neg logit contributions
ighth-1.492
fs-1.486
brates-1.465
umber-1.443
ibia-1.436
Token a
Token ID257
Feature activation-3.53e-01

Pos logit contributions
fs+1.852
ighth+1.847
brates+1.797
umber+1.793
ibia+1.770
Neg logit contributions
aye-1.780
ersed-1.580
oing-1.480
 tongues-1.452
 hormones-1.447
Token sequence
Token ID8379
Feature activation-7.19e-01

Pos logit contributions
fs+1.567
ighth+1.560
brates+1.521
umber+1.511
ibia+1.499
Neg logit contributions
aye-1.608
ersed-1.426
oing-1.344
astered-1.322
 tongues-1.320
Token set
Token ID900
Feature activation-1.04e-01

Pos logit contributions
fs+2.938
ighth+2.910
umber+2.842
brates+2.811
ibia+2.810
Neg logit contributions
aye-3.502
ersed-3.120
oing-2.950
 tongues-2.922
 hormones-2.898
Token in
Token ID287
Feature activation-1.65e-01

Pos logit contributions
ighth+0.486
fs+0.486
brates+0.477
umber+0.472
ibia+0.469
Neg logit contributions
aye-0.449
ersed-0.395
oing-0.369
 tongues-0.366
 hormones-0.365
Token a
Token ID257
Feature activation-2.58e-01

Pos logit contributions
ighth+0.747
fs+0.742
brates+0.727
umber+0.721
ibia+0.716
Neg logit contributions
aye-0.746
ersed-0.661
oing-0.624
 tongues-0.609
 hormones-0.607
Token mall
Token ID17374
Feature activation-7.38e-01

Pos logit contributions
fs+1.163
ighth+1.163
brates+1.134
umber+1.124
ibia+1.119
Neg logit contributions
aye-1.167
ersed-1.032
oing-0.974
 tongues-0.952
 hormones-0.950
Token:
Token ID25
Feature activation+1.68e-01

Pos logit contributions
fs+3.456
ighth+3.375
brates+3.347
FORE+3.263
umber+3.212
Neg logit contributions
aye-3.552
ersed-3.236
oing-2.980
 tongues-2.973
astered-2.959
Token
Token ID198
Feature activation+2.80e-01

Pos logit contributions
aye+0.786
ersed+0.697
oing+0.660
 tongues+0.653
 hormones+0.649
Neg logit contributions
ighth-0.736
fs-0.733
brates-0.719
umber-0.709
ibia-0.703
Token
Token ID198
Feature activation-5.88e-02

Pos logit contributions
aye+1.397
ersed+1.253
oing+1.179
 tongues+1.174
 hormones+1.170
Neg logit contributions
ighth-1.129
fs-1.105
brates-1.099
umber-1.073
ibia-1.069
Tokeni
Token ID72
Feature activation+3.06e-01

Pos logit contributions
ighth+0.248
fs+0.248
brates+0.241
umber+0.239
ibia+0.236
Neg logit contributions
aye-0.284
ersed-0.254
oing-0.241
 tongues-0.241
 hormones-0.239
TokenT
Token ID51
Feature activation-5.81e-01

Pos logit contributions
aye+1.547
ersed+1.385
oing+1.318
 tongues+1.305
 hormones+1.298
Neg logit contributions
ighth-1.225
fs-1.219
brates-1.196
umber-1.176
ibia-1.167
Tokenunes
Token ID4015
Feature activation-1.49e+00

Pos logit contributions
fs+2.009
ighth+1.982
ibia+1.906
umber+1.906
brates+1.904
Neg logit contributions
aye-3.230
ersed-2.923
oing-2.813
 tongues-2.779
 hormones-2.776
Token (
Token ID357
Feature activation+7.75e-01

Pos logit contributions
fs+5.298
ighth+5.139
cape+5.088
ibia+5.073
umber+5.032
Neg logit contributions
aye-7.616
ersed-7.000
oing-6.539
 tongues-6.367
astered-6.313
TokenHD
Token ID10227
Feature activation+6.01e-01

Pos logit contributions
aye+3.856
ersed+3.397
oing+3.262
astered+3.218
 tongues+3.197
Neg logit contributions
brates-3.051
ighth-3.039
fs-3.007
umber-2.926
ibia-2.926
Token)
Token ID8
Feature activation-1.52e-01

Pos logit contributions
aye+2.835
ersed+2.504
oing+2.377
 tongues+2.372
astered+2.360
Neg logit contributions
ighth-2.567
brates-2.534
fs-2.523
umber-2.471
ibia-2.455
Token |
Token ID930
Feature activation-7.89e-02

Pos logit contributions
ighth+0.614
fs+0.614
brates+0.596
umber+0.591
ibia+0.588
Neg logit contributions
aye-0.759
ersed-0.681
oing-0.644
 tongues-0.636
 hormones-0.633
Token iTunes
Token ID4830
Feature activation-1.69e+00

Pos logit contributions
fs+0.369
ighth+0.369
brates+0.361
umber+0.356
ibia+0.355
Neg logit contributions
aye-0.344
ersed-0.305
oing-0.287
 tongues-0.281
 hormones-0.280
Token medicine
Token ID9007
Feature activation-6.50e-01

Pos logit contributions
fs+4.093
ighth+4.074
brates+3.942
umber+3.926
ibia+3.911
Neg logit contributions
aye-3.702
ersed-3.253
yip-3.033
oing-2.991
astered-2.973
Token often
Token ID1690
Feature activation-7.62e-01

Pos logit contributions
fs+2.794
ighth+2.780
brates+2.711
ibia+2.675
umber+2.672
Neg logit contributions
aye-3.058
ersed-2.691
oing-2.553
astered-2.501
yip-2.497
Token did
Token ID750
Feature activation-3.85e-01

Pos logit contributions
ighth+3.214
fs+3.205
brates+3.114
ibia+3.061
umber+3.051
Neg logit contributions
aye-3.666
ersed-3.213
oing-3.065
astered-2.996
yip-2.993
Token more
Token ID517
Feature activation-9.88e-01

Pos logit contributions
ighth+1.718
fs+1.712
brates+1.682
umber+1.655
ibia+1.640
Neg logit contributions
aye-1.768
ersed-1.559
oing-1.466
 tongues-1.446
astered-1.436
Token harm
Token ID4419
Feature activation-1.13e+00

Pos logit contributions
fs+3.650
ighth+3.615
brates+3.546
umber+3.490
 else+3.453
Neg logit contributions
aye-5.108
ersed-4.708
yip-4.423
oing-4.368
astered-4.292
Token than
Token ID621
Feature activation-1.57e+00

Pos logit contributions
ighth+3.234
fs+3.211
umber+3.154
brates+3.125
ibia+2.967
Neg logit contributions
aye-6.768
ersed-6.256
yip-5.923
oing-5.887
 tongues-5.875
Token it
Token ID340
Feature activation-1.14e+00

Pos logit contributions
fs+4.301
ighth+4.199
brates+4.120
emis+4.064
 else+4.029
Neg logit contributions
aye-9.275
ersed-8.517
anchester-8.159
oing-8.157
urst-8.115
Token did
Token ID750
Feature activation-9.53e-01

Pos logit contributions
fs+4.443
ighth+4.396
brates+4.228
umber+4.212
ibia+4.202
Neg logit contributions
aye-5.685
ersed-5.088
yip-4.922
oing-4.829
astered-4.794
Token good
Token ID922
Feature activation-8.33e-01

Pos logit contributions
fs+3.152
ighth+3.075
brates+3.038
umber+2.966
ibia+2.961
Neg logit contributions
aye-5.350
ersed-4.847
oing-4.605
astered-4.574
yip-4.567
Token.
Token ID13
Feature activation-1.41e-02

Pos logit contributions
ighth+3.184
fs+3.176
brates+3.102
umber+3.100
ibia+3.035
Neg logit contributions
aye-4.253
ersed-3.835
oing-3.607
 tongues-3.579
yip-3.550
Token Mol
Token ID17958
Feature activation+1.84e+00

Pos logit contributions
ighth+0.058
fs+0.058
brates+0.057
umber+0.056
ibia+0.056
Neg logit contributions
aye-0.070
ersed-0.062
oing-0.059
 tongues-0.058
 hormones-0.058
Token to
Token ID284
Feature activation-9.49e-01

Pos logit contributions
fs+5.137
 else+5.060
umber+5.041
ighth+5.023
brates+5.004
Neg logit contributions
aye-6.856
ersed-6.105
yip-5.788
oing-5.778
astered-5.648
Token follow
Token ID1061
Feature activation-1.18e+00

Pos logit contributions
fs+3.765
ighth+3.685
brates+3.677
umber+3.672
ibia+3.562
Neg logit contributions
aye-4.630
ersed-4.207
yip-3.977
oing-3.959
interstitial-3.909
Token in
Token ID287
Feature activation-9.13e-01

Pos logit contributions
fs+4.698
umber+4.557
brates+4.539
ighth+4.531
ces+4.459
Neg logit contributions
aye-5.679
ersed-5.041
yip-4.919
oing-4.734
 hormones-4.668
Token the
Token ID262
Feature activation-6.20e-01

Pos logit contributions
fs+3.156
ighth+3.104
umber+3.044
brates+3.028
ibia+2.911
Neg logit contributions
aye-4.998
ersed-4.472
oing-4.254
 hormones-4.234
yip-4.232
Token same
Token ID976
Feature activation-3.81e-01

Pos logit contributions
fs+3.098
ighth+3.085
umber+3.031
brates+3.024
ibia+3.010
Neg logit contributions
aye-2.432
ersed-2.134
yip-1.961
oing-1.954
astered-1.923
Token footsteps
Token ID27146
Feature activation-1.89e+00

Pos logit contributions
fs+1.920
ighth+1.914
brates+1.884
umber+1.871
ibia+1.861
Neg logit contributions
aye-1.510
ersed-1.311
oing-1.216
yip-1.194
astered-1.190
Token.
Token ID13
Feature activation+1.27e-01

Pos logit contributions
fs+6.927
umber+6.834
 else+6.771
cape+6.613
ces+6.607
Neg logit contributions
aye-8.840
ersed-7.778
 tongues-7.569
yip-7.560
oing-7.411
Token He
Token ID679
Feature activation+1.07e-01

Pos logit contributions
aye+0.617
ersed+0.549
oing+0.521
 tongues+0.516
 hormones+0.514
Neg logit contributions
ighth-0.534
fs-0.532
brates-0.523
umber-0.513
ibia-0.510
Token taught
Token ID7817
Feature activation-6.19e-01

Pos logit contributions
aye+0.482
ersed+0.425
oing+0.401
 tongues+0.397
 hormones+0.394
Neg logit contributions
ighth-0.490
fs-0.488
brates-0.481
umber-0.473
ibia-0.470
Token a
Token ID257
Feature activation-1.17e-01

Pos logit contributions
fs+2.116
ighth+2.086
brates+2.076
umber+2.021
ibia+1.989
Neg logit contributions
aye-3.444
ersed-3.152
oing-2.943
astered-2.913
yip-2.907
Token free
Token ID1479
Feature activation-2.97e-02

Pos logit contributions
ighth+0.505
fs+0.504
brates+0.495
umber+0.487
ibia+0.484
Neg logit contributions
aye-0.556
ersed-0.495
oing-0.466
 hormones-0.461
 tongues-0.461
Token play
Token ID711
Feature activation+4.99e-01

Pos logit contributions
aye+2.635
ersed+2.332
 tongues+2.203
oing+2.198
 hormones+2.187
Neg logit contributions
ighth-2.539
fs-2.527
brates-2.484
umber-2.443
ibia-2.435
Token on
Token ID319
Feature activation+3.25e-01

Pos logit contributions
aye+2.310
ersed+2.060
oing+1.937
 tongues+1.929
 hormones+1.916
Neg logit contributions
ighth-2.200
fs-2.179
brates-2.145
umber-2.119
ibia-2.108
Token the
Token ID262
Feature activation+2.82e-01

Pos logit contributions
aye+1.443
ersed+1.273
oing+1.200
 tongues+1.194
 hormones+1.187
Neg logit contributions
ighth-1.497
fs-1.489
brates-1.469
umber-1.441
ibia-1.436
Token themes
Token ID13460
Feature activation-5.61e-01

Pos logit contributions
aye+1.304
ersed+1.154
oing+1.092
 tongues+1.083
 hormones+1.077
Neg logit contributions
ighth-1.245
fs-1.240
brates-1.221
umber-1.199
ibia-1.193
Token of
Token ID286
Feature activation+2.61e-01

Pos logit contributions
fs+2.695
ighth+2.678
brates+2.649
umber+2.611
ibia+2.601
Neg logit contributions
aye-2.335
ersed-2.037
oing-1.925
 tongues-1.864
astered-1.855
Token art
Token ID1242
Feature activation-1.51e+00

Pos logit contributions
aye+1.214
ersed+1.079
oing+1.020
 tongues+1.017
 hormones+1.015
Neg logit contributions
ighth-1.140
fs-1.136
brates-1.120
umber-1.099
ibia-1.095
Token,
Token ID11
Feature activation-4.86e-01

Pos logit contributions
fs+6.315
ighth+6.294
emis+6.198
brates+6.127
ibia+6.115
Neg logit contributions
aye-6.962
ersed-5.815
yip-5.696
oing-5.615
astered-5.519
Token nature
Token ID3450
Feature activation-1.34e+00

Pos logit contributions
ighth+2.116
fs+2.115
brates+2.077
umber+2.042
ibia+2.022
Neg logit contributions
aye-2.301
ersed-2.002
oing-1.887
astered-1.863
yip-1.819
Token and
Token ID290
Feature activation-4.26e-01

Pos logit contributions
ighth+5.553
fs+5.532
brates+5.457
ibia+5.393
cape+5.229
Neg logit contributions
aye-6.224
ersed-5.290
astered-5.119
oing-5.106
yip-5.093
Token diversity
Token ID9573
Feature activation-3.40e-02

Pos logit contributions
ighth+1.862
fs+1.854
brates+1.823
umber+1.785
ibia+1.771
Neg logit contributions
aye-2.013
ersed-1.763
oing-1.661
astered-1.634
 tongues-1.600
Token.
Token ID13
Feature activation+7.02e-01

Pos logit contributions
ighth+0.139
fs+0.139
brates+0.137
umber+0.134
ibia+0.134
Neg logit contributions
aye-0.168
ersed-0.149
oing-0.142
 tongues-0.140
 hormones-0.139
Token Facebook
Token ID3203
Feature activation-7.02e-01

Pos logit contributions
aye+0.184
ersed+0.158
oing+0.146
 tongues+0.143
 hormones+0.143
Neg logit contributions
fs-0.291
ighth-0.289
brates-0.286
ibia-0.281
umber-0.281
Token Share
Token ID8734
Feature activation-8.94e-01

Pos logit contributions
ighth+1.976
fs+1.950
brates+1.887
ibia+1.886
umber+1.854
Neg logit contributions
aye-4.307
ersed-4.020
oing-3.841
 hormones-3.779
 tongues-3.764
Token this
Token ID428
Feature activation+3.54e-01

Pos logit contributions
fs+4.210
ighth+4.177
brates+4.153
ibia+4.116
umber+4.102
Neg logit contributions
aye-3.783
ersed-3.503
oing-3.194
 tongues-3.141
ersion-3.101
Token comment
Token ID2912
Feature activation-3.46e-01

Pos logit contributions
aye+1.027
ersed+0.830
oing+0.751
 tongues+0.747
 hormones+0.731
Neg logit contributions
ighth-2.169
fs-2.160
brates-2.149
umber-2.107
ibia-2.105
Token on
Token ID319
Feature activation-7.64e-01

Pos logit contributions
fs+1.635
ighth+1.630
brates+1.596
umber+1.590
ibia+1.573
Neg logit contributions
aye-1.498
ersed-1.314
oing-1.245
 tongues-1.224
 hormones-1.219
Token Twitter
Token ID3009
Feature activation-1.95e+00

Pos logit contributions
fs+3.001
ighth+2.943
umber+2.884
ibia+2.852
brates+2.831
Neg logit contributions
aye-3.851
ersed-3.568
oing-3.331
yip-3.233
 hormones-3.216
Token
Token ID198
Feature activation-3.64e-01

Pos logit contributions
fs+6.550
umber+6.199
ighth+6.146
 else+6.116
brates+6.066
Neg logit contributions
aye-9.288
ersed-8.783
oing-8.549
yip-8.271
astered-8.261
Token
Token ID198
Feature activation-7.80e-01

Pos logit contributions
fs+1.282
ighth+1.258
brates+1.238
umber+1.219
ibia+1.212
Neg logit contributions
aye-2.027
ersed-1.824
oing-1.764
 tongues-1.735
 hormones-1.731
TokenNY
Token ID12805
Feature activation-3.84e-01

Pos logit contributions
fs+3.318
ighth+3.249
umber+3.191
brates+3.134
ibia+3.126
Neg logit contributions
aye-3.638
ersed-3.308
 tongues-3.187
 hormones-3.184
oing-3.132
TokenT
Token ID51
Feature activation-2.10e-01

Pos logit contributions
ighth+1.305
fs+1.291
FORE+1.235
umber+1.235
ibia+1.232
Neg logit contributions
aye-2.172
ersed-1.968
oing-1.887
 tongues-1.885
 hormones-1.880
Token Pick
Token ID12346
Feature activation-3.33e-01

Pos logit contributions
fs+1.080
ighth+1.076
brates+1.057
umber+1.055
ibia+1.047
Neg logit contributions
aye-0.812
ersed-0.719
oing-0.665
 hormones-0.657
 tongues-0.654