TOP ACTIVATIONS
MAX = 1.680

set aside . 2 . Steam the broccoli for
have it : 2 . Dis connect it and
proved Donkey Kong Country 2 : D iddy Kong
at week 4 ; ( 2 ) the real ta V
Ste lla - san collapsed ?」 " Yes .
sing - bird - of - pre y - cry stal
T AK EN DOWN UP ON PAR AM OUNT P ICT
into beat ings in Yar oslav l Colony No 1
She did things that even men in India didn
x 001 f 1 f 4 b be 66 IEEE 802
ation - rather - than - bre xit - is -
Stella 's body t umbled down . With her body in
administrative offices . Ichigo walked out of the Second Division Barr
aut om ation - rather - than - bre xit -
02 : 41 2 . Micro s atellite 03
x 001 f 1 f 4 b be 66 power off
only in its being , but also in its process .
18 59 . 11 ) When Dancing the Walt
Chern obyl . P fl ug be il also said some
a knife . 3 . In a large bowl

MIN ACTIVATIONS
MIN = -1.660

. 01 . 29 "] [ Round " 11 "] [
TS C OU NS EL : We can say
. im read ( image Path ) image = im utils
interesting session to say the least . I consider myself an
to address sab er - r att ling and the annexation
c v 2 . Ga ussian Bl ur ( gray ,
wary , to say the least . Some implanted
i - le $ art ext . Count ; $ i
2 . im read ( image Path ) image = im
a an Z ee "] [ Date " 2016 . 01
It 's basically a Hip st am atic for Android .
guise of Drew , Te ess ide Crown Court was told
I 'll Feel a Whole Lot Better " ( B yr
, _ P ulse Per iod ] to to rem ap
with a few nasty head f akes and shoulder dips ,
mass . It is , then , not at all unlikely
girls up to for prom isc uous behaviour .
s Format . Line Al ignment = [ system . draw
le Benn ati ( Radio Sh ack - Nissan ) s
So here it is , without further ado : one man

INTERVAL 0.845 - 1.680
CONTAINS 0.350%

4 { ( 0 s )} N f 6 { (
selfish request . Even if the God of this
firm that special ises in proxy battles in a move which
originally passed in 1970 ,[ 2 ] and is enforced by
seven heavens . 6 Daniel C .

INTERVAL 0.010 - 0.845
CONTAINS 48.117%

in an email to several colleagues , Sub prime
markets are in a death spiral and
Orange head , wasn t the Captain here just
I did manage to answer a few questions in the chat
of their own . His overhaul saw the dismissal of veterans

INTERVAL -0.825 - 0.010
CONTAINS 51.120%

the Washington Beer Mobile App for a few weeks : feedback
company , which has since been sold to the video start
Department , could be seen drawing his gun and yelling for
word ed editorial this week that Britain had " black mail
the Mid as Touch that turns everything into s *** .

INTERVAL -1.660 - -0.825
CONTAINS 0.412%

Man hattan ( 1979 ) Grade
] :: From Image ($ b mp ) $ SR =
away from Kirby 's bomb astic style of storytelling . According
Draw ing . Font ($ Font Name , $ FS 2
Far Away that stuck in the mind
Token set
Token ID900
Feature activation-5.37e-01

Pos logit contributions
avorite+1.231
ucc+1.220
ury+1.176
avour+1.145
ÃÂ+1.107
Neg logit contributions
peat-1.047
beit-0.920
wide-0.877
 primates-0.823
 elect-0.804
Token aside
Token ID7263
Feature activation+7.32e-02

Pos logit contributions
ucc+3.515
avorite+3.477
ury+3.363
avour+3.312
ÃÂ+3.166
Neg logit contributions
peat-2.565
beit-2.219
wide-2.105
 primates-1.972
 elect-1.936
Token.
Token ID13
Feature activation+1.74e-02

Pos logit contributions
peat+0.406
beit+0.358
wide+0.344
 primates+0.326
 elect+0.324
Neg logit contributions
avorite-0.439
ucc-0.429
ury-0.416
avour-0.402
ÃÂ-0.396
Token
Token ID198
Feature activation-1.10e-02

Pos logit contributions
peat+0.103
beit+0.092
wide+0.089
 primates+0.084
 elect+0.084
Neg logit contributions
avorite-0.099
ucc-0.095
ury-0.092
avour-0.089
ÃÂ-0.088
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
avorite+0.073
ucc+0.070
ury+0.068
avour+0.066
ÃÂ+0.066
Neg logit contributions
peat-0.056
beit-0.048
wide-0.047
 primates-0.043
 elect-0.043
Token2
Token ID17
Feature activation+1.68e+00

Pos logit contributions
avorite+0.420
ucc+0.406
ury+0.385
avour+0.365
ÃÂ+0.351
Neg logit contributions
peat-0.908
beit-0.834
wide-0.811
 primates-0.786
 elect-0.782
Token.
Token ID13
Feature activation+2.01e-01

Pos logit contributions
peat+6.633
wide+5.775
beit+5.513
wise+4.885
 elect+4.533
Neg logit contributions
avorite-13.238
ÃÂ-12.619
-11.501
-11.477
ThumbnailImage-11.446
Token Steam
Token ID9094
Feature activation-3.88e-01

Pos logit contributions
peat+1.289
beit+1.160
wide+1.133
 elect+1.080
 primates+1.067
Neg logit contributions
avorite-1.048
ucc-0.993
ury-0.963
ÃÂ-0.932
avour-0.922
Token the
Token ID262
Feature activation-2.29e-01

Pos logit contributions
avorite+2.389
ucc+2.370
ury+2.297
avour+2.242
ÃÂ+2.176
Neg logit contributions
peat-2.029
beit-1.807
wide-1.723
 primates-1.621
 elect-1.584
Token broccoli
Token ID44653
Feature activation-6.41e-01

Pos logit contributions
avorite+1.455
ucc+1.428
ury+1.383
avour+1.346
ÃÂ+1.319
Neg logit contributions
peat-1.177
beit-1.031
wide-0.983
 primates-0.924
 elect-0.909
Token for
Token ID329
Feature activation-2.59e-01

Pos logit contributions
ucc+3.830
avorite+3.779
ury+3.649
avour+3.552
ÃÂ+3.406
Neg logit contributions
peat-3.427
beit-3.023
wide-2.894
 primates-2.738
 elect-2.631
Token have
Token ID423
Feature activation+1.79e-01

Pos logit contributions
peat+0.162
beit+0.143
wide+0.139
 elect+0.133
 primates+0.132
Neg logit contributions
avorite-0.168
ucc-0.163
ury-0.157
avour-0.152
ÃÂ-0.152
Token it
Token ID340
Feature activation+1.79e-01

Pos logit contributions
peat+0.887
beit+0.769
wide+0.738
 primates+0.694
 elect+0.693
Neg logit contributions
avorite-1.180
ucc-1.153
ury-1.120
avour-1.090
ÃÂ-1.078
Token:
Token ID25
Feature activation+3.11e-01

Pos logit contributions
peat+0.953
beit+0.836
wide+0.805
 elect+0.759
 primates+0.757
Neg logit contributions
avorite-1.123
ucc-1.094
ury-1.061
avour-1.030
ÃÂ-1.016
Token
Token ID198
Feature activation+2.97e-01

Pos logit contributions
peat+1.666
beit+1.457
wide+1.408
 elect+1.339
 primates+1.325
Neg logit contributions
avorite-1.926
ucc-1.876
ury-1.812
avour-1.761
ÃÂ-1.753
Token
Token ID198
Feature activation+9.69e-02

Pos logit contributions
peat+1.427
beit+1.226
wide+1.208
 elect+1.094
 primates+1.086
Neg logit contributions
avorite-2.070
ucc-1.953
ÃÂ-1.909
ury-1.895
avour-1.854
Token2
Token ID17
Feature activation+1.52e+00

Pos logit contributions
peat+0.690
beit+0.624
wide+0.609
 primates+0.578
 elect+0.578
Neg logit contributions
avorite-0.434
ucc-0.415
ury-0.397
avour-0.380
ÃÂ-0.380
Token.
Token ID13
Feature activation+2.05e-01

Pos logit contributions
peat+6.462
wide+5.643
beit+5.484
wise+4.791
 elect+4.695
Neg logit contributions
avorite-11.462
ÃÂ-10.656
ucc-9.887
ury-9.529
avour-9.467
Token Dis
Token ID3167
Feature activation-4.10e-01

Pos logit contributions
peat+1.359
beit+1.227
wide+1.198
 elect+1.150
 primates+1.129
Neg logit contributions
avorite-1.014
ucc-0.969
ury-0.925
ÃÂ-0.900
avour-0.894
Tokenconnect
Token ID8443
Feature activation-3.66e-01

Pos logit contributions
avorite+1.445
ucc+1.436
ury+1.336
avour+1.309
ÃÂ+1.171
Neg logit contributions
peat-3.210
beit-2.957
wide-2.874
 primates-2.824
 elect-2.757
Token it
Token ID340
Feature activation-3.50e-01

Pos logit contributions
avorite+2.203
ucc+2.161
ury+2.109
avour+2.035
ÃÂ+1.966
Neg logit contributions
peat-1.998
beit-1.756
wide-1.688
 primates-1.613
 elect-1.564
Token and
Token ID290
Feature activation-1.38e-01

Pos logit contributions
avorite+1.985
ucc+1.944
ury+1.887
avour+1.820
ÃÂ+1.782
Neg logit contributions
peat-2.040
beit-1.808
wide-1.739
 primates-1.660
 elect-1.640
Token proved
Token ID8302
Feature activation-5.12e-01

Pos logit contributions
avorite+1.940
ucc+1.906
ury+1.860
avour+1.793
ÃÂ+1.751
Neg logit contributions
peat-1.640
beit-1.450
wide-1.350
 primates-1.276
 elect-1.261
Token Donkey
Token ID43823
Feature activation+2.26e-01

Pos logit contributions
avorite+3.273
ucc+3.215
ury+3.155
avour+3.061
ÃÂ+2.968
Neg logit contributions
peat-2.508
beit-2.202
wide-2.063
 primates-1.935
 elect-1.855
Token Kong
Token ID9071
Feature activation+2.91e-01

Pos logit contributions
peat+1.339
beit+1.186
wide+1.149
 primates+1.093
 elect+1.085
Neg logit contributions
avorite-1.295
ucc-1.253
ury-1.202
avour-1.159
ÃÂ-1.150
Token Country
Token ID12946
Feature activation+2.46e-01

Pos logit contributions
peat+2.155
beit+1.965
wide+1.910
 primates+1.839
 elect+1.830
Neg logit contributions
avorite-1.201
ucc-1.161
ury-1.108
avour-1.057
ÃÂ-1.031
Token 2
Token ID362
Feature activation+7.34e-01

Pos logit contributions
peat+1.431
beit+1.270
wide+1.225
 primates+1.165
 elect+1.157
Neg logit contributions
avorite-1.413
ucc-1.378
ury-1.333
avour-1.289
ÃÂ-1.267
Token:
Token ID25
Feature activation+1.51e+00

Pos logit contributions
peat+3.876
beit+3.419
wide+3.292
 elect+3.096
 primates+3.093
Neg logit contributions
avorite-4.560
ucc-4.424
ury-4.291
avour-4.156
ÃÂ-4.038
Token D
Token ID360
Feature activation+5.62e-01

Pos logit contributions
peat+6.762
beit+5.558
wide+5.504
 primates+5.082
 elect+5.035
Neg logit contributions
avorite-10.320
ucc-10.119
ury-9.724
ÃÂ-9.436
avour-9.415
Tokeniddy
Token ID34208
Feature activation-1.62e-01

Pos logit contributions
peat+2.217
beit+1.832
wide+1.737
 primates+1.570
 elect+1.562
Neg logit contributions
avorite-4.329
ucc-4.197
ury-4.036
ÃÂ-4.016
avour-3.971
Token Kong
Token ID9071
Feature activation+1.36e-01

Pos logit contributions
avorite+0.908
ucc+0.890
ury+0.870
avour+0.841
ÃÂ+0.825
Neg logit contributions
peat-0.942
beit-0.844
wide-0.804
 primates-0.769
 elect-0.764
Token
Token ID447
Feature activation-4.40e-01

Pos logit contributions
peat+0.796
beit+0.710
wide+0.679
 primates+0.646
 elect+0.646
Neg logit contributions
avorite-0.769
ucc-0.755
ury-0.735
avour-0.709
ÃÂ-0.698
Token
Token ID247
Feature activation-2.45e-01

Pos logit contributions
avorite+1.876
ucc+1.839
ury+1.766
avour+1.672
ÃÂ+1.632
Neg logit contributions
peat-3.164
beit-2.881
wide-2.793
 primates-2.690
 elect-2.675
Token at
Token ID379
Feature activation-1.86e-01

Pos logit contributions
avorite+2.604
ucc+2.586
ury+2.493
avour+2.414
ÃÂ+2.336
Neg logit contributions
peat-2.427
beit-2.136
wide-2.017
 primates-1.924
 elect-1.874
Token week
Token ID1285
Feature activation-2.86e-01

Pos logit contributions
avorite+1.148
ucc+1.128
ury+1.092
avour+1.059
ÃÂ+1.037
Neg logit contributions
peat-0.989
beit-0.865
wide-0.826
 primates-0.782
 elect-0.767
Token 4
Token ID604
Feature activation-1.03e-01

Pos logit contributions
avorite+1.588
ucc+1.560
ury+1.506
avour+1.457
ÃÂ+1.416
Neg logit contributions
peat-1.694
beit-1.505
wide-1.440
 primates-1.376
 elect-1.349
Token;
Token ID26
Feature activation+4.22e-01

Pos logit contributions
avorite+0.620
ucc+0.609
ury+0.590
avour+0.571
ÃÂ+0.560
Neg logit contributions
peat-0.572
beit-0.503
wide-0.482
 primates-0.458
 elect-0.453
Token (
Token ID357
Feature activation+8.47e-02

Pos logit contributions
peat+2.252
beit+1.989
wide+1.910
 primates+1.807
 elect+1.800
Neg logit contributions
avorite-2.611
ucc-2.545
ury-2.469
avour-2.393
ÃÂ-2.364
Token2
Token ID17
Feature activation+1.50e+00

Pos logit contributions
peat+0.647
beit+0.593
wide+0.584
 primates+0.547
 elect+0.545
Neg logit contributions
avorite-0.346
ucc-0.315
ury-0.303
ÃÂ-0.301
avour-0.288
Token)
Token ID8
Feature activation+6.18e-01

Pos logit contributions
peat+4.186
wide+3.542
beit+3.281
wise+2.694
 elect+2.454
Neg logit contributions
avorite-13.551
ÃÂ-12.758
ucc-11.880
avour-11.691
ury-11.681
Token the
Token ID262
Feature activation-1.65e-01

Pos logit contributions
peat+3.026
beit+2.694
wide+2.590
 elect+2.503
 primates+2.463
Neg logit contributions
avorite-4.047
ucc-3.938
ury-3.821
ÃÂ-3.742
avour-3.724
Token real
Token ID1103
Feature activation-4.96e-01

Pos logit contributions
avorite+0.996
ucc+0.975
ury+0.946
avour+0.916
ÃÂ+0.897
Neg logit contributions
peat-0.910
beit-0.798
wide-0.766
 primates-0.726
 elect-0.718
Token ta
Token ID20486
Feature activation-3.88e-01

Pos logit contributions
avorite+2.934
ucc+2.883
ury+2.826
avour+2.710
ÃÂ+2.630
Neg logit contributions
peat-2.725
beit-2.378
wide-2.259
 primates-2.140
 elect-2.085
TokenV
Token ID53
Feature activation-1.27e-01

Pos logit contributions
avorite+1.938
ucc+1.896
ury+1.827
avour+1.755
ÃÂ+1.703
Neg logit contributions
peat-2.524
beit-2.270
wide-2.193
 primates-2.106
 elect-2.090
TokenSte
Token ID7447
Feature activation-4.15e-01

Pos logit contributions
peat+1.869
beit+1.708
wide+1.667
 primates+1.551
 elect+1.548
Neg logit contributions
avorite-1.440
ucc-1.339
ury-1.292
ÃÂ-1.261
avour-1.243
Tokenlla
Token ID8466
Feature activation-4.42e-01

Pos logit contributions
avorite+1.281
ucc+1.275
ury+1.230
avour+1.145
uri+1.023
Neg logit contributions
peat-3.425
beit-3.169
wide-3.066
 primates-3.019
 elect-2.995
Token-
Token ID12
Feature activation-5.78e-02

Pos logit contributions
avorite+2.598
ucc+2.561
ury+2.501
avour+2.418
ÃÂ+2.350
Neg logit contributions
peat-2.438
beit-2.155
wide-2.075
 primates-1.973
 elect-1.938
Tokensan
Token ID12807
Feature activation-2.39e-01

Pos logit contributions
avorite+0.354
ucc+0.345
ury+0.334
avour+0.324
ÃÂ+0.320
Neg logit contributions
peat-0.314
beit-0.277
wide-0.266
 primates-0.251
 elect-0.250
Token collapsed
Token ID14707
Feature activation+4.96e-01

Pos logit contributions
avorite+1.413
ucc+1.387
ury+1.347
avour+1.302
ÃÂ+1.278
Neg logit contributions
peat-1.331
beit-1.172
wide-1.129
 primates-1.075
 elect-1.062
Token?」
Token ID42943
Feature activation+1.48e+00

Pos logit contributions
peat+2.555
beit+2.203
wide+2.141
 elect+2.042
 candid+2.009
Neg logit contributions
avorite-3.176
ucc-3.084
ury-2.939
avour-2.891
ÃÂ-2.866
Token
Token ID198
Feature activation+6.66e-01

Pos logit contributions
peat+6.641
beit+5.877
wide+5.739
 elect+5.285
 candid+5.201
Neg logit contributions
avorite-9.671
ucc-9.327
ury-9.086
ÃÂ-8.893
avour-8.846
Token
Token ID198
Feature activation+6.09e-01

Pos logit contributions
peat+2.919
wide+2.531
beit+2.497
 elect+2.197
wise+2.167
Neg logit contributions
avorite-5.022
ÃÂ-4.667
ucc-4.566
ury-4.437
avour-4.365
Token"
Token ID1
Feature activation+2.84e-01

Pos logit contributions
peat+3.397
beit+3.025
wide+2.967
 elect+2.678
wise+2.660
Neg logit contributions
avorite-3.680
ucc-3.428
ury-3.350
ÃÂ-3.322
avour-3.238
TokenYes
Token ID5297
Feature activation-1.06e-01

Pos logit contributions
peat+1.889
beit+1.708
wide+1.674
 elect+1.559
 primates+1.554
Neg logit contributions
avorite-1.428
ucc-1.319
ury-1.278
ÃÂ-1.252
avour-1.235
Token.
Token ID13
Feature activation+4.33e-01

Pos logit contributions
avorite+0.594
ucc+0.582
ury+0.563
avour+0.543
ÃÂ+0.533
Neg logit contributions
peat-0.630
beit-0.558
wide-0.539
 primates-0.514
 elect-0.509
Tokensing
Token ID12215
Feature activation-2.02e-01

Pos logit contributions
peat+1.984
beit+1.765
wide+1.703
 primates+1.621
 elect+1.611
Neg logit contributions
avorite-1.873
ucc-1.826
ury-1.766
avour-1.706
ÃÂ-1.677
Token-
Token ID12
Feature activation+3.68e-01

Pos logit contributions
avorite+1.460
ucc+1.451
ury+1.425
avour+1.374
uri+1.323
Neg logit contributions
peat-0.839
beit-0.730
wide-0.658
 primates-0.653
 candid-0.635
Tokenbird
Token ID16944
Feature activation+1.14e-01

Pos logit contributions
peat+2.167
beit+1.935
wide+1.860
 primates+1.788
 elect+1.770
Neg logit contributions
avorite-2.069
ucc-2.024
ury-1.959
avour-1.893
ÃÂ-1.835
Token-
Token ID12
Feature activation+5.65e-01

Pos logit contributions
peat+0.432
beit+0.361
wide+0.335
 primates+0.314
 elect+0.308
Neg logit contributions
avorite-0.868
ucc-0.861
ury-0.842
avour-0.819
ÃÂ-0.796
Tokenof
Token ID1659
Feature activation+2.98e-01

Pos logit contributions
peat+3.596
beit+3.239
wide+3.123
 primates+3.005
 elect+2.980
Neg logit contributions
avorite-2.917
ucc-2.839
ury-2.741
avour-2.637
ÃÂ-2.563
Token-
Token ID12
Feature activation+1.46e+00

Pos logit contributions
peat+1.083
beit+0.881
wide+0.832
 primates+0.751
 elect+0.747
Neg logit contributions
avorite-2.376
ucc-2.320
ury-2.260
avour-2.213
ÃÂ-2.209
Tokenpre
Token ID3866
Feature activation+1.70e-01

Pos logit contributions
peat+7.145
beit+5.708
wide+5.662
wise+5.169
 elect+4.962
Neg logit contributions
avorite-10.070
ÃÂ-9.796
ucc-9.302
Magikarp-8.993
ury-8.881
Tokeny
Token ID88
Feature activation-5.60e-01

Pos logit contributions
peat+0.704
beit+0.585
wide+0.561
 elect+0.509
 primates+0.508
Neg logit contributions
avorite-1.278
ucc-1.241
ury-1.200
ÃÂ-1.192
avour-1.180
Token-
Token ID12
Feature activation+3.39e-01

Pos logit contributions
avorite+3.836
ucc+3.794
ury+3.747
avour+3.601
uri+3.454
Neg logit contributions
peat-2.443
beit-2.131
wide-1.963
 primates-1.923
 candid-1.823
Tokencry
Token ID20470
Feature activation-1.23e-01

Pos logit contributions
peat+2.051
beit+1.824
wide+1.766
 primates+1.674
 elect+1.667
Neg logit contributions
avorite-1.861
ucc-1.810
ury-1.747
avour-1.688
ÃÂ-1.671
Tokenstal
Token ID7757
Feature activation-1.07e-01

Pos logit contributions
avorite+0.795
ucc+0.783
ury+0.759
avour+0.735
ÃÂ+0.709
Neg logit contributions
peat-0.616
beit-0.542
wide-0.504
 primates-0.488
 elect-0.483
Token T
Token ID309
Feature activation+1.29e-01

Pos logit contributions
avorite+0.076
ucc+0.074
ury+0.071
avour+0.069
ÃÂ+0.068
Neg logit contributions
peat-0.072
beit-0.064
wide-0.062
 primates-0.058
 elect-0.058
TokenAK
Token ID10206
Feature activation-5.67e-01

Pos logit contributions
peat+0.740
beit+0.655
wide+0.631
 primates+0.600
 elect+0.596
Neg logit contributions
avorite-0.747
ucc-0.729
ury-0.705
avour-0.683
ÃÂ-0.672
TokenEN
Token ID1677
Feature activation+3.01e-02

Pos logit contributions
ucc+2.476
avorite+2.471
ury+2.371
avour+2.256
uri+2.113
Neg logit contributions
peat-3.984
beit-3.584
wide-3.451
 primates-3.381
 elect-3.331
Token DOWN
Token ID30320
Feature activation+3.29e-01

Pos logit contributions
peat+0.168
beit+0.148
wide+0.143
 primates+0.135
 elect+0.134
Neg logit contributions
avorite-0.180
ucc-0.175
ury-0.170
avour-0.165
ÃÂ-0.162
Token UP
Token ID15958
Feature activation+3.05e-01

Pos logit contributions
peat+2.113
beit+1.907
wide+1.869
 elect+1.764
 primates+1.760
Neg logit contributions
avorite-1.672
ucc-1.608
ury-1.551
avour-1.496
ÃÂ-1.480
TokenON
Token ID1340
Feature activation+1.45e+00

Pos logit contributions
peat+2.054
beit+1.856
wide+1.809
 primates+1.723
 elect+1.721
Neg logit contributions
avorite-1.463
ucc-1.412
ury-1.358
avour-1.306
ÃÂ-1.285
Token PAR
Token ID29463
Feature activation+4.47e-01

Pos logit contributions
peat+7.146
beit+6.249
wide+6.139
 elect+5.747
 primates+5.691
Neg logit contributions
avorite-9.105
ucc-8.665
ury-8.437
ÃÂ-8.266
avour-8.192
TokenAM
Token ID2390
Feature activation+7.40e-01

Pos logit contributions
peat+2.517
beit+2.265
wide+2.212
 elect+2.018
 primates+1.997
Neg logit contributions
avorite-2.744
ÃÂ-2.524
ucc-2.521
ury-2.414
avour-2.381
TokenOUNT
Token ID28270
Feature activation+7.95e-01

Pos logit contributions
peat+3.335
beit+2.877
wide+2.805
 elect+2.505
 rats+2.432
Neg logit contributions
avorite-5.227
ucc-4.989
ÃÂ-4.867
ury-4.754
avour-4.703
Token P
Token ID350
Feature activation+2.02e-01

Pos logit contributions
peat+4.274
beit+3.766
wide+3.691
 primates+3.466
 elect+3.440
Neg logit contributions
avorite-4.853
ucc-4.655
ury-4.517
avour-4.418
ÃÂ-4.396
TokenICT
Token ID18379
Feature activation-2.52e-01

Pos logit contributions
peat+1.386
beit+1.261
wide+1.231
 elect+1.164
 primates+1.163
Neg logit contributions
avorite-0.964
ucc-0.905
ÃÂ-0.862
ury-0.858
avour-0.831
Token into
Token ID656
Feature activation-1.29e-01

Pos logit contributions
peat+1.765
beit+1.553
wide+1.509
 primates+1.423
 elect+1.413
Neg logit contributions
avorite-1.995
ucc-1.944
ury-1.877
avour-1.826
ÃÂ-1.801
Token beat
Token ID4405
Feature activation+2.90e-01

Pos logit contributions
avorite+0.794
ucc+0.779
ury+0.763
avour+0.733
ÃÂ+0.715
Neg logit contributions
peat-0.698
beit-0.606
wide-0.581
 primates-0.545
 elect-0.538
Tokenings
Token ID654
Feature activation-8.35e-03

Pos logit contributions
peat+1.483
beit+1.294
wide+1.241
 primates+1.170
 elect+1.161
Neg logit contributions
avorite-1.861
ucc-1.819
ury-1.766
avour-1.715
ÃÂ-1.690
Token in
Token ID287
Feature activation+2.34e-01

Pos logit contributions
avorite+0.051
ucc+0.050
ury+0.049
avour+0.047
ÃÂ+0.046
Neg logit contributions
peat-0.045
beit-0.039
wide-0.038
 primates-0.036
 elect-0.035
Token Yar
Token ID45184
Feature activation+2.66e-01

Pos logit contributions
peat+1.257
beit+1.104
wide+1.060
 primates+1.004
 elect+0.997
Neg logit contributions
avorite-1.449
ucc-1.416
ury-1.373
avour-1.332
ÃÂ-1.312
Tokenoslav
Token ID26388
Feature activation+1.45e+00

Pos logit contributions
peat+1.065
beit+0.899
wide+0.856
 elect+0.779
 primates+0.773
Neg logit contributions
avorite-2.013
ucc-1.949
ury-1.890
ÃÂ-1.867
avour-1.856
Tokenl
Token ID75
Feature activation+2.97e-02

Pos logit contributions
peat+5.533
beit+5.081
wide+4.854
 elect+4.077
 rats+4.066
Neg logit contributions
avorite-11.082
ÃÂ-10.070
ucc-10.063
ury-9.735
avour-9.526
Token Colony
Token ID34480
Feature activation+3.40e-01

Pos logit contributions
peat+0.168
beit+0.148
wide+0.142
 primates+0.135
 elect+0.134
Neg logit contributions
avorite-0.174
ucc-0.171
ury-0.166
avour-0.160
ÃÂ-0.158
Token No
Token ID1400
Feature activation-2.68e-01

Pos logit contributions
peat+1.844
beit+1.629
wide+1.582
 primates+1.491
 elect+1.484
Neg logit contributions
avorite-2.087
ucc-2.013
ury-1.950
avour-1.898
ÃÂ-1.866
Token 1
Token ID352
Feature activation+2.15e-01

Pos logit contributions
avorite+1.444
ucc+1.424
ury+1.386
avour+1.336
ÃÂ+1.305
Neg logit contributions
peat-1.638
beit-1.451
wide-1.389
 primates-1.337
 elect-1.313
Token
Token ID851
Feature activation+6.73e-02

Pos logit contributions
peat+1.190
beit+1.050
wide+1.011
 primates+0.958
 elect+0.952
Neg logit contributions
avorite-1.288
ucc-1.257
ury-1.218
avour-1.180
ÃÂ-1.161
Token
Token ID564
Feature activation-9.53e-02

Pos logit contributions
peat+0.736
beit+0.649
wide+0.624
 primates+0.590
 elect+0.585
Neg logit contributions
avorite-0.844
ucc-0.821
ury-0.795
avour-0.772
ÃÂ-0.761
Token
Token ID250
Feature activation+4.83e-03

Pos logit contributions
avorite+0.420
ucc+0.406
ury+0.392
avour+0.373
ÃÂ+0.368
Neg logit contributions
peat-0.677
beit-0.612
wide-0.596
 primates-0.573
 elect-0.571
TokenShe
Token ID3347
Feature activation+2.33e-01

Pos logit contributions
peat+0.033
beit+0.030
wide+0.029
 primates+0.028
 elect+0.028
Neg logit contributions
avorite-0.023
ucc-0.022
ury-0.021
ÃÂ-0.020
avour-0.020
Token did
Token ID750
Feature activation+4.24e-01

Pos logit contributions
peat+1.261
beit+1.118
wide+1.075
 elect+1.021
 primates+1.015
Neg logit contributions
avorite-1.422
ucc-1.382
ury-1.329
avour-1.295
ÃÂ-1.274
Token things
Token ID1243
Feature activation+5.97e-01

Pos logit contributions
peat+2.081
beit+1.810
wide+1.729
 primates+1.645
 elect+1.643
Neg logit contributions
avorite-2.798
ucc-2.731
ury-2.660
avour-2.580
ÃÂ-2.550
Token that
Token ID326
Feature activation+1.43e+00

Pos logit contributions
peat+2.716
beit+2.308
wide+2.218
 primates+2.111
 elect+2.099
Neg logit contributions
avorite-4.147
ucc-4.052
ury-3.913
ÃÂ-3.809
avour-3.808
Token even
Token ID772
Feature activation+6.71e-01

Pos logit contributions
peat+6.556
beit+5.758
wide+5.536
 elect+5.287
 primates+5.232
Neg logit contributions
ucc-9.052
avorite-9.034
ury-8.513
avour-8.376
ÃÂ-8.293
Token men
Token ID1450
Feature activation+1.15e+00

Pos logit contributions
peat+3.375
beit+2.942
wide+2.826
 primates+2.688
 elect+2.654
Neg logit contributions
avorite-4.333
ucc-4.246
ury-4.104
avour-4.000
ÃÂ-3.959
Token in
Token ID287
Feature activation+5.81e-01

Pos logit contributions
peat+4.572
beit+3.783
wide+3.627
 primates+3.456
 elect+3.433
Neg logit contributions
avorite-8.397
ucc-8.305
ury-8.005
avour-7.841
ÃÂ-7.740
Token India
Token ID3794
Feature activation+7.87e-01

Pos logit contributions
peat+3.171
beit+2.797
wide+2.701
 primates+2.609
 elect+2.598
Neg logit contributions
avorite-3.551
ucc-3.404
ÃÂ-3.232
ury-3.221
avour-3.166
Token didn
Token ID1422
Feature activation+2.22e-01

Pos logit contributions
peat+3.413
beit+2.912
wide+2.803
 primates+2.619
 elect+2.614
Neg logit contributions
avorite-5.557
ucc-5.457
ury-5.304
avour-5.148
ÃÂ-5.080
Tokenx
Token ID87
Feature activation-7.73e-01

Pos logit contributions
avorite+1.143
ucc+1.142
ury+1.083
avour+1.012
ÃÂ+0.936
Neg logit contributions
peat-2.595
beit-2.383
wide-2.302
 primates-2.263
 elect-2.243
Token001
Token ID8298
Feature activation+3.35e-01

Pos logit contributions
avorite+3.147
ucc+3.015
ury+2.905
avour+2.722
ÃÂ+2.534
Neg logit contributions
peat-5.616
beit-5.118
wide-4.943
 primates-4.927
 elect-4.785
Tokenf
Token ID69
Feature activation-4.80e-02

Pos logit contributions
peat+2.117
beit+1.894
wide+1.830
 primates+1.755
 elect+1.741
Neg logit contributions
avorite-1.754
ucc-1.709
ury-1.649
avour-1.586
ÃÂ-1.554
Token1
Token ID16
Feature activation+9.92e-01

Pos logit contributions
avorite+0.205
ucc+0.200
ury+0.191
avour+0.183
ÃÂ+0.174
Neg logit contributions
peat-0.349
beit-0.316
wide-0.307
 primates-0.298
 elect-0.295
Tokenf
Token ID69
Feature activation+4.59e-01

Pos logit contributions
peat+5.280
beit+4.779
wide+4.665
 elect+4.211
 primates+4.158
Neg logit contributions
avorite-6.156
ÃÂ-5.726
ucc-5.587
ury-5.452
avour-5.414
Token4
Token ID19
Feature activation+1.41e+00

Pos logit contributions
peat+3.237
beit+2.936
wide+2.851
 primates+2.736
 elect+2.724
Neg logit contributions
avorite-2.062
ucc-1.995
ury-1.912
avour-1.832
ÃÂ-1.795
Tokenb
Token ID65
Feature activation+1.01e+00

Pos logit contributions
peat+6.427
beit+5.648
wide+5.346
 elect+4.722
 primates+4.714
Neg logit contributions
avorite-9.479
ucc-8.956
ury-8.916
ÃÂ-8.737
avour-8.664
Tokenbe
Token ID1350
Feature activation+3.86e-01

Pos logit contributions
peat+3.803
beit+3.238
wide+2.934
 elect+2.509
wise+2.409
Neg logit contributions
avorite-8.068
ÃÂ-7.703
ucc-7.408
ury-7.296
Magikarp-7.261
Token66
Token ID2791
Feature activation+2.07e-03

Pos logit contributions
peat+3.048
beit+2.795
wide+2.726
 primates+2.629
 elect+2.621
Neg logit contributions
avorite-1.403
ucc-1.346
ury-1.278
avour-1.211
ÃÂ-1.179
Token IEEE
Token ID40552
Feature activation-5.29e-02

Pos logit contributions
peat+0.012
beit+0.010
wide+0.010
 primates+0.009
 elect+0.009
Neg logit contributions
avorite-0.012
ucc-0.012
ury-0.012
avour-0.011
ÃÂ-0.011
Token 802
Token ID33121
Feature activation+5.13e-01

Pos logit contributions
avorite+0.254
ucc+0.247
ury+0.237
avour+0.228
ÃÂ+0.223
Neg logit contributions
peat-0.356
beit-0.321
wide-0.311
 primates-0.299
 elect-0.297
Tokenation
Token ID341
Feature activation+2.68e-01

Pos logit contributions
ucc+2.274
avorite+2.224
ury+2.129
avour+2.054
ÃÂ+1.889
Neg logit contributions
peat-3.792
beit-3.455
wide-3.350
 primates-3.271
 candid-3.209
Token-
Token ID12
Feature activation+9.63e-01

Pos logit contributions
peat+0.970
beit+0.797
wide+0.752
 elect+0.679
 primates+0.677
Neg logit contributions
avorite-2.132
ucc-2.084
ury-2.032
avour-1.985
ÃÂ-1.983
Tokenrather
Token ID34330
Feature activation+5.49e-01

Pos logit contributions
peat+4.847
beit+4.266
wide+4.155
 elect+3.769
wise+3.724
Neg logit contributions
avorite-6.274
ucc-5.895
ÃÂ-5.818
ury-5.763
avour-5.606
Token-
Token ID12
Feature activation+1.39e+00

Pos logit contributions
peat+1.864
beit+1.513
wide+1.455
 elect+1.258
 primates+1.223
Neg logit contributions
avorite-4.587
ucc-4.379
ÃÂ-4.299
ury-4.239
avour-4.157
Tokenthan
Token ID14813
Feature activation+4.27e-01

Pos logit contributions
peat+9.206
beit+8.243
wide+8.127
 elect+7.455
wise+7.439
Neg logit contributions
avorite-7.127
ÃÂ-6.595
ucc-6.044
ury-5.808
Magikarp-5.796
Token-
Token ID12
Feature activation+1.40e+00

Pos logit contributions
peat+1.492
beit+1.204
wide+1.150
 elect+1.012
 primates+0.996
Neg logit contributions
avorite-3.502
ucc-3.367
ury-3.288
ÃÂ-3.277
avour-3.212
Tokenbre
Token ID4679
Feature activation-3.99e-01

Pos logit contributions
peat+9.283
beit+8.235
wide+8.192
 elect+7.415
wise+7.362
Neg logit contributions
avorite-7.337
ÃÂ-6.842
ucc-6.007
Magikarp-5.912
ury-5.792
Tokenxit
Token ID10198
Feature activation+4.04e-01

Pos logit contributions
avorite+1.663
ucc+1.593
ury+1.518
avour+1.450
ÃÂ+1.432
Neg logit contributions
peat-2.965
beit-2.703
wide-2.627
 primates-2.522
 elect-2.517
Token-
Token ID12
Feature activation+9.66e-01

Pos logit contributions
peat+1.385
beit+1.119
wide+1.070
 elect+0.926
 primates+0.905
Neg logit contributions
avorite-3.363
ucc-3.200
ÃÂ-3.165
ury-3.126
avour-3.068
Tokenis
Token ID271
Feature activation+5.87e-01

Pos logit contributions
peat+5.586
beit+4.937
wide+4.833
 elect+4.428
wise+4.366
Neg logit contributions
avorite-5.711
ÃÂ-5.250
ucc-5.176
ury-5.019
avour-4.899
Token-
Token ID12
Feature activation+8.25e-01

Pos logit contributions
peat+1.919
beit+1.515
wide+1.483
 elect+1.203
wise+1.175
Neg logit contributions
avorite-5.047
ÃÂ-4.813
ucc-4.727
ury-4.591
avour-4.507
Token Stella
Token ID45856
Feature activation+6.94e-02

Pos logit contributions
peat+4.020
beit+3.610
wide+3.509
 elect+3.341
 primates+3.318
Neg logit contributions
avorite-3.642
ucc-3.532
ury-3.406
avour-3.309
ÃÂ-3.281
Token's
Token ID338
Feature activation+4.05e-01

Pos logit contributions
peat+0.367
beit+0.321
wide+0.309
 primates+0.292
 elect+0.290
Neg logit contributions
avorite-0.434
ucc-0.424
ury-0.412
avour-0.400
ÃÂ-0.394
Token body
Token ID1767
Feature activation+4.33e-01

Pos logit contributions
peat+1.956
beit+1.690
wide+1.650
 elect+1.550
 primates+1.541
Neg logit contributions
avorite-2.726
ucc-2.645
ury-2.554
avour-2.497
ÃÂ-2.495
Token t
Token ID256
Feature activation-2.40e-02

Pos logit contributions
peat+2.138
beit+1.857
wide+1.819
 elect+1.708
 primates+1.687
Neg logit contributions
avorite-2.890
ucc-2.792
ury-2.680
ÃÂ-2.634
avour-2.628
Tokenumbled
Token ID11137
Feature activation+5.33e-01

Pos logit contributions
avorite+0.145
ucc+0.143
ury+0.138
avour+0.134
ÃÂ+0.131
Neg logit contributions
peat-0.132
beit-0.116
wide-0.111
 primates-0.106
 elect-0.105
Token down
Token ID866
Feature activation+1.40e+00

Pos logit contributions
peat+2.678
beit+2.317
wide+2.293
 elect+2.154
 primates+2.122
Neg logit contributions
avorite-3.513
ucc-3.389
ÃÂ-3.218
ury-3.211
avour-3.187
Token.
Token ID13
Feature activation+8.34e-01

Pos logit contributions
peat+6.435
wide+5.431
beit+5.366
 elect+5.051
 candid+4.990
Neg logit contributions
avorite-9.364
ucc-8.881
ÃÂ-8.717
avour-8.428
ury-8.399
Token With
Token ID2080
Feature activation+7.14e-01

Pos logit contributions
peat+4.654
beit+4.261
wide+4.228
 elect+3.843
wise+3.759
Neg logit contributions
avorite-4.994
ucc-4.507
ÃÂ-4.439
ury-4.397
avour-4.308
Token her
Token ID607
Feature activation+5.08e-01

Pos logit contributions
peat+3.347
beit+2.903
wide+2.839
 elect+2.678
 primates+2.643
Neg logit contributions
avorite-4.827
ucc-4.631
ury-4.503
ÃÂ-4.423
avour-4.419
Token body
Token ID1767
Feature activation+5.11e-01

Pos logit contributions
peat+2.497
beit+2.160
wide+2.121
 elect+1.991
 primates+1.970
Neg logit contributions
avorite-3.374
ucc-3.272
ury-3.147
avour-3.081
ÃÂ-3.072
Token in
Token ID287
Feature activation+5.33e-01

Pos logit contributions
peat+2.539
beit+2.188
wide+2.178
 elect+2.033
 primates+1.998
Neg logit contributions
avorite-3.406
ucc-3.274
ury-3.128
ÃÂ-3.087
avour-3.085
Token administrative
Token ID11553
Feature activation+2.97e-01

Pos logit contributions
peat+1.807
beit+1.571
wide+1.510
 primates+1.421
 elect+1.411
Neg logit contributions
avorite-2.442
ucc-2.381
ury-2.313
avour-2.250
ÃÂ-2.228
Token offices
Token ID9730
Feature activation+5.28e-01

Pos logit contributions
peat+1.417
beit+1.223
wide+1.169
 primates+1.096
 elect+1.088
Neg logit contributions
avorite-2.016
ucc-1.973
ury-1.919
avour-1.867
ÃÂ-1.841
Token.
Token ID13
Feature activation+7.61e-01

Pos logit contributions
peat+2.736
beit+2.410
wide+2.363
 elect+2.205
 primates+2.193
Neg logit contributions
avorite-3.338
ucc-3.215
ury-3.114
avour-3.045
ÃÂ-3.008
Token Ichigo
Token ID44547
Feature activation+3.08e-01

Pos logit contributions
peat+4.319
beit+3.927
wide+3.811
 elect+3.560
 primates+3.526
Neg logit contributions
avorite-4.445
ucc-4.151
ury-4.037
avour-3.929
ÃÂ-3.910
Token walked
Token ID6807
Feature activation+9.00e-01

Pos logit contributions
peat+1.559
beit+1.372
wide+1.316
 elect+1.244
 primates+1.232
Neg logit contributions
avorite-2.000
ucc-1.942
ury-1.873
avour-1.831
ÃÂ-1.808
Token out
Token ID503
Feature activation+1.39e+00

Pos logit contributions
peat+4.040
beit+3.455
wide+3.413
 elect+3.203
 candid+3.193
Neg logit contributions
avorite-6.343
ucc-6.050
ÃÂ-5.821
ury-5.770
avour-5.729
Token of
Token ID286
Feature activation+7.96e-01

Pos logit contributions
peat+5.917
wide+5.105
beit+5.081
 elect+4.767
 candid+4.724
Neg logit contributions
avorite-9.742
ucc-9.041
ÃÂ-8.846
avour-8.730
ury-8.717
Token the
Token ID262
Feature activation+1.00e+00

Pos logit contributions
peat+3.884
beit+3.441
wide+3.299
 elect+3.087
 primates+3.069
Neg logit contributions
avorite-5.208
ucc-5.031
ury-4.861
avour-4.779
ÃÂ-4.749
Token Second
Token ID5498
Feature activation+6.60e-01

Pos logit contributions
peat+4.716
beit+4.147
wide+4.089
 primates+3.871
 elect+3.840
Neg logit contributions
avorite-6.649
ucc-6.342
ury-6.173
ÃÂ-6.114
avour-6.068
Token Division
Token ID7458
Feature activation+5.18e-01

Pos logit contributions
peat+3.668
beit+3.253
wide+3.174
 elect+2.972
 primates+2.969
Neg logit contributions
avorite-3.915
ucc-3.774
ury-3.654
avour-3.546
ÃÂ-3.516
Token Barr
Token ID27883
Feature activation+4.32e-01

Pos logit contributions
peat+2.815
beit+2.500
wide+2.435
 primates+2.290
 elect+2.281
Neg logit contributions
avorite-3.147
ucc-3.033
ury-2.949
avour-2.869
ÃÂ-2.837
Tokenaut
Token ID2306
Feature activation-3.80e-01

Pos logit contributions
peat+5.802
beit+5.374
wide+5.288
 elect+4.937
wise+4.858
Neg logit contributions
avorite-3.029
ÃÂ-2.732
ucc-2.557
ury-2.463
Magikarp-2.404
Tokenom
Token ID296
Feature activation-5.31e-01

Pos logit contributions
ucc+2.328
avorite+2.287
ury+2.264
avour+2.175
uri+2.076
Neg logit contributions
peat-2.017
beit-1.746
wide-1.663
 primates-1.618
 elect-1.586
Tokenation
Token ID341
Feature activation+2.68e-01

Pos logit contributions
ucc+2.274
avorite+2.224
ury+2.129
avour+2.054
ÃÂ+1.889
Neg logit contributions
peat-3.792
beit-3.455
wide-3.350
 primates-3.271
 candid-3.209
Token-
Token ID12
Feature activation+9.63e-01

Pos logit contributions
peat+0.970
beit+0.797
wide+0.752
 elect+0.679
 primates+0.677
Neg logit contributions
avorite-2.132
ucc-2.084
ury-2.032
avour-1.985
ÃÂ-1.983
Tokenrather
Token ID34330
Feature activation+5.49e-01

Pos logit contributions
peat+4.847
beit+4.266
wide+4.155
 elect+3.769
wise+3.724
Neg logit contributions
avorite-6.274
ucc-5.895
ÃÂ-5.818
ury-5.763
avour-5.606
Token-
Token ID12
Feature activation+1.39e+00

Pos logit contributions
peat+1.864
beit+1.513
wide+1.455
 elect+1.258
 primates+1.223
Neg logit contributions
avorite-4.587
ucc-4.379
ÃÂ-4.299
ury-4.239
avour-4.157
Tokenthan
Token ID14813
Feature activation+4.27e-01

Pos logit contributions
peat+9.206
beit+8.243
wide+8.127
 elect+7.455
wise+7.439
Neg logit contributions
avorite-7.127
ÃÂ-6.595
ucc-6.044
ury-5.808
Magikarp-5.796
Token-
Token ID12
Feature activation+1.40e+00

Pos logit contributions
peat+1.492
beit+1.204
wide+1.150
 elect+1.012
 primates+0.996
Neg logit contributions
avorite-3.502
ucc-3.367
ury-3.288
ÃÂ-3.277
avour-3.212
Tokenbre
Token ID4679
Feature activation-3.99e-01

Pos logit contributions
peat+9.283
beit+8.235
wide+8.192
 elect+7.415
wise+7.362
Neg logit contributions
avorite-7.337
ÃÂ-6.842
ucc-6.007
Magikarp-5.912
ury-5.792
Tokenxit
Token ID10198
Feature activation+4.04e-01

Pos logit contributions
avorite+1.663
ucc+1.593
ury+1.518
avour+1.450
ÃÂ+1.432
Neg logit contributions
peat-2.965
beit-2.703
wide-2.627
 primates-2.522
 elect-2.517
Token-
Token ID12
Feature activation+9.66e-01

Pos logit contributions
peat+1.385
beit+1.119
wide+1.070
 elect+0.926
 primates+0.905
Neg logit contributions
avorite-3.363
ucc-3.200
ÃÂ-3.165
ury-3.126
avour-3.068
Token 02
Token ID7816
Feature activation+1.50e-02

Pos logit contributions
avorite+0.184
ucc+0.181
ury+0.175
avour+0.169
ÃÂ+0.166
Neg logit contributions
peat-0.173
beit-0.153
wide-0.147
 primates-0.140
 elect-0.139
Token:
Token ID25
Feature activation+5.15e-01

Pos logit contributions
peat+0.081
beit+0.071
wide+0.068
 primates+0.065
 elect+0.064
Neg logit contributions
avorite-0.091
ucc-0.090
ury-0.087
avour-0.084
ÃÂ-0.083
Token41
Token ID3901
Feature activation+1.11e-01

Pos logit contributions
peat+3.108
beit+2.778
wide+2.706
 elect+2.548
 primates+2.547
Neg logit contributions
avorite-2.832
ucc-2.730
ury-2.620
avour-2.542
ÃÂ-2.524
Token
Token ID198
Feature activation+2.30e-01

Pos logit contributions
peat+0.593
beit+0.520
wide+0.498
 primates+0.473
 elect+0.469
Neg logit contributions
avorite-0.690
ucc-0.675
ury-0.657
avour-0.636
ÃÂ-0.626
Token
Token ID198
Feature activation-1.33e-01

Pos logit contributions
peat+1.126
beit+0.972
wide+0.951
 elect+0.869
 primates+0.865
Neg logit contributions
avorite-1.574
ucc-1.497
ury-1.453
ÃÂ-1.445
avour-1.419
Token2
Token ID17
Feature activation+1.38e+00

Pos logit contributions
avorite+0.480
ucc+0.470
ury+0.447
avour+0.422
ÃÂ+0.402
Neg logit contributions
peat-1.037
beit-0.952
wide-0.922
 primates-0.904
 elect-0.896
Token.
Token ID13
Feature activation+2.87e-01

Pos logit contributions
peat+5.995
wide+5.309
beit+5.133
wise+4.426
 elect+4.400
Neg logit contributions
avorite-10.337
ÃÂ-9.514
ucc-8.927
ury-8.591
avour-8.567
Token Micro
Token ID4527
Feature activation-1.60e-01

Pos logit contributions
peat+1.783
beit+1.597
wide+1.548
 primates+1.473
 elect+1.468
Neg logit contributions
avorite-1.533
ucc-1.486
ury-1.432
avour-1.384
ÃÂ-1.363
Tokens
Token ID82
Feature activation+2.39e-02

Pos logit contributions
avorite+0.899
ucc+0.882
ury+0.854
avour+0.823
ÃÂ+0.805
Neg logit contributions
peat-0.942
beit-0.835
wide-0.803
 primates-0.770
 elect-0.763
Tokenatellite
Token ID26493
Feature activation-2.51e-01

Pos logit contributions
peat+0.144
beit+0.128
wide+0.124
 primates+0.118
 elect+0.117
Neg logit contributions
avorite-0.131
ucc-0.129
ury-0.124
avour-0.120
ÃÂ-0.117
Token 03
Token ID7643
Feature activation+5.34e-01

Pos logit contributions
avorite+1.388
ucc+1.372
ury+1.340
avour+1.284
ÃÂ+1.247
Neg logit contributions
peat-1.477
beit-1.310
wide-1.253
 primates-1.212
 elect-1.189
Tokenx
Token ID87
Feature activation-3.03e-01

Pos logit contributions
peat+0.531
beit+0.486
wide+0.468
 primates+0.460
 elect+0.452
Neg logit contributions
avorite-0.257
ucc-0.252
ury-0.240
avour-0.226
ÃÂ-0.219
Token001
Token ID8298
Feature activation+3.69e-01

Pos logit contributions
avorite+1.310
ucc+1.267
ury+1.228
avour+1.151
ÃÂ+1.100
Neg logit contributions
peat-2.181
beit-1.984
wide-1.912
 primates-1.887
 elect-1.844
Tokenf
Token ID69
Feature activation+2.12e-02

Pos logit contributions
peat+2.339
beit+2.093
wide+2.021
 primates+1.940
 elect+1.922
Neg logit contributions
avorite-1.932
ucc-1.883
ury-1.817
avour-1.746
ÃÂ-1.712
Token1
Token ID16
Feature activation+9.32e-01

Pos logit contributions
peat+0.154
beit+0.140
wide+0.136
 primates+0.132
 elect+0.130
Neg logit contributions
avorite-0.091
ucc-0.088
ury-0.085
avour-0.081
ÃÂ-0.077
Tokenf
Token ID69
Feature activation+4.41e-01

Pos logit contributions
peat+4.913
beit+4.427
wide+4.361
wise+3.882
 elect+3.875
Neg logit contributions
avorite-5.952
ÃÂ-5.572
ucc-5.318
ury-5.158
avour-5.146
Token4
Token ID19
Feature activation+1.38e+00

Pos logit contributions
peat+3.148
beit+2.857
wide+2.775
 primates+2.674
 elect+2.656
Neg logit contributions
avorite-1.948
ucc-1.891
ury-1.812
avour-1.731
ÃÂ-1.685
Tokenb
Token ID65
Feature activation+8.20e-01

Pos logit contributions
peat+5.843
beit+5.115
wide+4.922
wise+4.109
 rats+3.976
Neg logit contributions
avorite-10.190
ÃÂ-9.535
ucc-9.049
ury-9.014
avour-8.833
Tokenbe
Token ID1350
Feature activation+3.67e-01

Pos logit contributions
peat+3.283
beit+2.806
wide+2.593
 elect+2.237
wise+2.166
Neg logit contributions
avorite-6.404
ÃÂ-6.045
ucc-5.961
ury-5.835
avour-5.748
Token66
Token ID2791
Feature activation+1.71e-01

Pos logit contributions
peat+2.720
beit+2.479
wide+2.446
 elect+2.318
 primates+2.296
Neg logit contributions
avorite-1.551
ÃÂ-1.392
ucc-1.384
ury-1.344
avour-1.305
Token power
Token ID1176
Feature activation-3.00e-01

Pos logit contributions
peat+0.953
beit+0.840
wide+0.808
 primates+0.768
 elect+0.759
Neg logit contributions
avorite-1.021
ucc-1.001
ury-0.970
avour-0.937
ÃÂ-0.923
Token off
Token ID572
Feature activation+1.00e-01

Pos logit contributions
avorite+1.566
ucc+1.542
ury+1.484
avour+1.418
ÃÂ+1.398
Neg logit contributions
peat-1.874
beit-1.686
wide-1.615
 primates-1.557
 elect-1.523
Token only
Token ID691
Feature activation+2.59e-02

Pos logit contributions
peat+2.530
beit+2.237
wide+2.160
 elect+2.047
 primates+2.030
Neg logit contributions
avorite-2.769
ucc-2.660
ury-2.569
avour-2.501
ÃÂ-2.483
Token in
Token ID287
Feature activation+1.52e-02

Pos logit contributions
peat+0.132
beit+0.115
wide+0.110
 primates+0.104
 elect+0.103
Neg logit contributions
avorite-0.167
ucc-0.163
ury-0.158
avour-0.154
ÃÂ-0.151
Token its
Token ID663
Feature activation-3.18e-02

Pos logit contributions
peat+0.083
beit+0.073
wide+0.070
 primates+0.066
 elect+0.066
Neg logit contributions
avorite-0.093
ucc-0.091
ury-0.088
avour-0.085
ÃÂ-0.084
Token being
Token ID852
Feature activation+3.46e-01

Pos logit contributions
avorite+0.197
ucc+0.193
ury+0.187
avour+0.182
ÃÂ+0.178
Neg logit contributions
peat-0.170
beit-0.149
wide-0.142
 primates-0.134
 elect-0.133
Token,
Token ID11
Feature activation+7.14e-01

Pos logit contributions
peat+1.734
beit+1.509
wide+1.448
 primates+1.358
 elect+1.354
Neg logit contributions
avorite-2.266
ucc-2.206
ury-2.141
avour-2.082
ÃÂ-2.061
Tokenbut
Token ID4360
Feature activation+1.36e+00

Pos logit contributions
peat+3.432
beit+2.991
wide+2.865
 elect+2.645
 primates+2.633
Neg logit contributions
avorite-4.808
ucc-4.626
ury-4.507
avour-4.382
ÃÂ-4.374
Token also
Token ID635
Feature activation+5.34e-01

Pos logit contributions
peat+6.316
beit+5.424
wide+5.341
 elect+4.884
 candid+4.840
Neg logit contributions
avorite-9.020
ucc-8.659
ÃÂ-8.302
ury-8.282
avour-8.123
Token in
Token ID287
Feature activation+1.69e-01

Pos logit contributions
peat+2.498
beit+2.142
wide+2.055
 elect+1.902
 primates+1.902
Neg logit contributions
avorite-3.683
ucc-3.583
ury-3.475
avour-3.391
ÃÂ-3.370
Token its
Token ID663
Feature activation-1.46e-02

Pos logit contributions
peat+0.901
beit+0.790
wide+0.759
 primates+0.718
 elect+0.713
Neg logit contributions
avorite-1.049
ucc-1.025
ury-0.994
avour-0.964
ÃÂ-0.949
Token process
Token ID1429
Feature activation-1.24e-01

Pos logit contributions
avorite+0.089
ucc+0.087
ury+0.085
avour+0.082
ÃÂ+0.080
Neg logit contributions
peat-0.079
beit-0.069
wide-0.066
 primates-0.063
 elect-0.062
Token.
Token ID13
Feature activation+1.37e-01

Pos logit contributions
avorite+0.740
ucc+0.734
ury+0.713
avour+0.687
ÃÂ+0.674
Neg logit contributions
peat-0.674
beit-0.597
wide-0.570
 primates-0.545
 elect-0.537
Token 18
Token ID1248
Feature activation-4.05e-01

Pos logit contributions
peat+0.823
beit+0.739
wide+0.716
 primates+0.681
 elect+0.679
Neg logit contributions
avorite-0.720
ucc-0.697
ury-0.672
avour-0.649
ÃÂ-0.643
Token59
Token ID3270
Feature activation+2.93e-01

Pos logit contributions
avorite+1.902
ucc+1.891
ury+1.819
avour+1.741
ÃÂ+1.657
Neg logit contributions
peat-2.726
beit-2.451
wide-2.358
 primates-2.295
 elect-2.277
Token.
Token ID13
Feature activation+3.20e-01

Pos logit contributions
peat+1.741
beit+1.544
wide+1.483
 primates+1.420
 elect+1.405
Neg logit contributions
avorite-1.646
ucc-1.612
ury-1.560
avour-1.506
ÃÂ-1.476
Token
Token ID198
Feature activation-8.94e-02

Pos logit contributions
peat+1.699
beit+1.488
wide+1.423
 primates+1.352
 elect+1.337
Neg logit contributions
avorite-2.000
ucc-1.958
ury-1.898
avour-1.842
ÃÂ-1.815
Token
Token ID198
Feature activation-1.68e-01

Pos logit contributions
avorite+0.561
ucc+0.554
ury+0.538
avour+0.521
ÃÂ+0.508
Neg logit contributions
peat-0.465
beit-0.407
wide-0.388
 primates-0.369
 elect-0.365
Token11
Token ID1157
Feature activation+1.36e+00

Pos logit contributions
avorite+0.840
ucc+0.830
ury+0.801
avour+0.771
ÃÂ+0.727
Neg logit contributions
peat-1.083
beit-0.970
wide-0.931
 primates-0.909
 elect-0.902
Token)
Token ID8
Feature activation+1.95e-01

Pos logit contributions
peat+3.925
wide+3.251
beit+3.235
wise+2.455
 elect+2.353
Neg logit contributions
avorite-12.290
ÃÂ-11.335
ucc-10.711
ury-10.473
avour-10.461
Token When
Token ID1649
Feature activation-1.72e-01

Pos logit contributions
peat+1.195
beit+1.076
wide+1.044
 elect+0.988
 primates+0.984
Neg logit contributions
avorite-1.067
ucc-1.014
ury-0.975
ÃÂ-0.946
avour-0.945
Token Dancing
Token ID31799
Feature activation-4.86e-01

Pos logit contributions
avorite+0.991
ucc+0.971
ury+0.942
avour+0.910
ÃÂ+0.893
Neg logit contributions
peat-0.991
beit-0.872
wide-0.842
 primates-0.801
 elect-0.792
Token the
Token ID262
Feature activation-1.40e-01

Pos logit contributions
avorite+2.664
ucc+2.656
ury+2.597
avour+2.509
ÃÂ+2.416
Neg logit contributions
peat-2.845
beit-2.473
wide-2.401
 primates-2.310
 elect-2.247
Token Walt
Token ID18935
Feature activation-6.49e-01

Pos logit contributions
avorite+0.725
ucc+0.708
ury+0.685
avour+0.660
ÃÂ+0.643
Neg logit contributions
peat-0.885
beit-0.792
wide-0.765
 primates-0.731
 elect-0.724
Token Chern
Token ID37243
Feature activation+1.88e-01

Pos logit contributions
peat+2.787
beit+2.443
wide+2.364
 primates+2.237
 elect+2.236
Neg logit contributions
avorite-3.371
ucc-3.291
ury-3.183
avour-3.110
ÃÂ-3.070
Tokenobyl
Token ID46666
Feature activation-5.35e-02

Pos logit contributions
peat+1.254
beit+1.133
wide+1.104
 elect+1.044
 primates+1.037
Neg logit contributions
avorite-0.937
ucc-0.876
ury-0.843
ÃÂ-0.831
avour-0.823
Token.
Token ID13
Feature activation+1.15e-01

Pos logit contributions
avorite+0.313
ucc+0.307
ury+0.297
avour+0.287
ÃÂ+0.283
Neg logit contributions
peat-0.306
beit-0.269
wide-0.257
 primates-0.245
 elect-0.242
Token P
Token ID350
Feature activation+5.35e-01

Pos logit contributions
peat+0.614
beit+0.538
wide+0.516
 primates+0.489
 elect+0.485
Neg logit contributions
avorite-0.717
ucc-0.702
ury-0.681
avour-0.660
ÃÂ-0.650
Tokenfl
Token ID2704
Feature activation+1.05e-01

Pos logit contributions
peat+2.767
beit+2.435
wide+2.333
 primates+2.182
 elect+2.180
Neg logit contributions
avorite-3.359
ucc-3.256
ury-3.152
ÃÂ-3.080
avour-3.071
Tokenug
Token ID1018
Feature activation+1.35e+00

Pos logit contributions
peat+0.772
beit+0.703
wide+0.686
 elect+0.656
 primates+0.646
Neg logit contributions
avorite-0.446
ucc-0.404
ÃÂ-0.401
ury-0.388
avour-0.376
Tokenbe
Token ID1350
Feature activation+3.15e-01

Pos logit contributions
peat+4.893
beit+4.107
wide+3.792
wise+3.092
 elect+3.048
Neg logit contributions
avorite-11.066
ÃÂ-10.466
ucc-10.039
ury-9.831
Magikarp-9.791
Tokenil
Token ID346
Feature activation+3.80e-01

Pos logit contributions
peat+2.226
beit+2.039
wide+1.953
 elect+1.830
 primates+1.812
Neg logit contributions
avorite-1.486
ÃÂ-1.349
ucc-1.344
ury-1.291
avour-1.227
Token also
Token ID635
Feature activation-4.95e-02

Pos logit contributions
peat+1.920
beit+1.685
wide+1.613
 primates+1.519
 elect+1.517
Neg logit contributions
avorite-2.465
ucc-2.381
ury-2.309
avour-2.256
ÃÂ-2.225
Token said
Token ID531
Feature activation+1.56e-01

Pos logit contributions
avorite+0.323
ucc+0.317
ury+0.308
avour+0.299
ÃÂ+0.294
Neg logit contributions
peat-0.248
beit-0.215
wide-0.206
 primates-0.193
 elect-0.192
Token some
Token ID617
Feature activation+4.38e-02

Pos logit contributions
peat+0.803
beit+0.700
wide+0.672
 primates+0.634
 elect+0.629
Neg logit contributions
avorite-1.004
ucc-0.981
ury-0.953
avour-0.926
ÃÂ-0.912
Token a
Token ID257
Feature activation+1.39e-01

Pos logit contributions
avorite+0.149
ucc+0.146
ury+0.142
avour+0.138
ÃÂ+0.136
Neg logit contributions
peat-0.109
beit-0.094
wide-0.090
 primates-0.084
 elect-0.083
Token knife
Token ID9845
Feature activation+2.53e-01

Pos logit contributions
peat+0.736
beit+0.645
wide+0.620
 primates+0.586
 elect+0.582
Neg logit contributions
avorite-0.873
ucc-0.853
ury-0.827
avour-0.803
ÃÂ-0.792
Token.
Token ID13
Feature activation+8.81e-02

Pos logit contributions
peat+1.352
beit+1.192
wide+1.149
 elect+1.085
 primates+1.081
Neg logit contributions
avorite-1.567
ucc-1.521
ury-1.472
avour-1.429
ÃÂ-1.424
Token
Token ID198
Feature activation-5.28e-02

Pos logit contributions
peat+0.519
beit+0.463
wide+0.450
 elect+0.424
 primates+0.421
Neg logit contributions
avorite-0.505
ucc-0.480
ury-0.466
ÃÂ-0.455
avour-0.450
Token
Token ID198
Feature activation-1.15e-01

Pos logit contributions
avorite+0.335
ucc+0.329
ury+0.319
avour+0.310
ÃÂ+0.304
Neg logit contributions
peat-0.273
beit-0.238
wide-0.228
 primates-0.216
 elect-0.214
Token3
Token ID18
Feature activation+1.35e+00

Pos logit contributions
avorite+0.380
ucc+0.373
ury+0.348
avour+0.331
ÃÂ+0.306
Neg logit contributions
peat-0.934
beit-0.862
wide-0.838
 primates-0.820
 elect-0.814
Token.
Token ID13
Feature activation+8.33e-02

Pos logit contributions
peat+5.803
wide+5.062
beit+4.887
wise+4.322
 elect+4.164
Neg logit contributions
avorite-10.465
ÃÂ-9.994
ucc-8.894
-8.855
 srf-8.846
Token In
Token ID554
Feature activation-2.72e-02

Pos logit contributions
peat+0.543
beit+0.491
wide+0.479
 elect+0.456
 primates+0.450
Neg logit contributions
avorite-0.426
ucc-0.402
ury-0.388
ÃÂ-0.379
avour-0.372
Token a
Token ID257
Feature activation-1.62e-01

Pos logit contributions
avorite+0.185
ucc+0.183
ury+0.177
avour+0.174
ÃÂ+0.169
Neg logit contributions
peat-0.127
beit-0.110
wide-0.104
 primates-0.098
 elect-0.096
Token large
Token ID1588
Feature activation-8.86e-02

Pos logit contributions
avorite+1.191
ucc+1.174
ury+1.146
avour+1.119
ÃÂ+1.097
Neg logit contributions
peat-0.674
beit-0.570
wide-0.529
 primates-0.496
 elect-0.488
Token bowl
Token ID9396
Feature activation+2.57e-02

Pos logit contributions
avorite+0.559
ucc+0.549
ury+0.535
avour+0.518
ÃÂ+0.507
Neg logit contributions
peat-0.461
beit-0.402
wide-0.381
 primates-0.362
 elect-0.357
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
avorite+1.125
ucc+1.123
ury+1.093
avour+1.047
ÃÂ+1.003
Neg logit contributions
peat-1.282
beit-1.136
wide-1.095
 primates-1.061
 elect-1.041
Token01
Token ID486
Feature activation-5.65e-01

Pos logit contributions
avorite+1.319
ucc+1.319
ury+1.269
avour+1.219
ÃÂ+1.167
Neg logit contributions
peat-1.542
beit-1.369
wide-1.316
 primates-1.279
 elect-1.265
Token.
Token ID13
Feature activation-1.58e-01

Pos logit contributions
ucc+2.876
ury+2.790
avorite+2.725
avour+2.643
uri+2.528
Neg logit contributions
peat-3.536
beit-3.141
wide-2.996
 primates-2.978
 elect-2.915
Token29
Token ID1959
Feature activation-7.66e-01

Pos logit contributions
avorite+0.861
ucc+0.839
ury+0.811
avour+0.782
ÃÂ+0.768
Neg logit contributions
peat-0.961
beit-0.857
wide-0.828
 primates-0.790
 elect-0.785
Token"]
Token ID8973
Feature activation+3.60e-01

Pos logit contributions
ucc+3.738
ury+3.736
avorite+3.525
avour+3.512
uri+3.382
Neg logit contributions
peat-4.860
beit-4.098
 primates-4.032
wide-3.987
 candid-3.952
Token [
Token ID685
Feature activation-1.66e+00

Pos logit contributions
peat+1.614
beit+1.413
wide+1.369
 elect+1.281
 primates+1.241
Neg logit contributions
avorite-2.611
ucc-2.418
ÃÂ-2.386
ury-2.348
avour-2.285
TokenRound
Token ID22685
Feature activation-4.91e-01

Pos logit contributions
ury+2.782
ucc+2.394
avorite+2.279
avour+2.194
Torrent+1.891
Neg logit contributions
peat-15.688
beit-13.898
 primates-13.778
 candid-13.226
wide-13.224
Token "
Token ID366
Feature activation-4.43e-01

Pos logit contributions
avorite+2.959
ucc+2.934
ury+2.879
avour+2.792
uri+2.697
Neg logit contributions
peat-2.616
beit-2.273
wide-2.117
 primates-2.050
 elect-1.994
Token11
Token ID1157
Feature activation-2.18e-01

Pos logit contributions
avorite+1.955
ucc+1.938
ury+1.862
avour+1.782
ÃÂ+1.681
Neg logit contributions
peat-3.092
beit-2.792
wide-2.683
 primates-2.629
 elect-2.593
Token"]
Token ID8973
Feature activation+8.70e-01

Pos logit contributions
avorite+1.184
ucc+1.180
ury+1.161
avour+1.118
ÃÂ+1.065
Neg logit contributions
peat-1.295
beit-1.123
wide-1.091
 primates-1.050
 candid-1.028
Token [
Token ID685
Feature activation-1.26e+00

Pos logit contributions
peat+3.538
beit+3.045
wide+2.963
 elect+2.769
 rats+2.617
Neg logit contributions
avorite-6.724
ÃÂ-6.185
ucc-6.010
ury-5.887
avour-5.756
TokenTS
Token ID4694
Feature activation+1.77e-02

Pos logit contributions
peat+1.473
beit+1.338
wide+1.303
 elect+1.202
 primates+1.200
Neg logit contributions
avorite-1.635
ucc-1.534
ÃÂ-1.493
ury-1.492
avour-1.454
Token
Token ID447
Feature activation-2.40e-01

Pos logit contributions
peat+0.102
beit+0.089
wide+0.086
 primates+0.082
 elect+0.081
Neg logit contributions
avorite-0.102
ucc-0.101
ury-0.097
avour-0.094
ÃÂ-0.093
Token
Token ID247
Feature activation-4.37e-02

Pos logit contributions
avorite+1.095
ucc+1.060
ury+1.017
avour+0.975
ÃÂ+0.954
Neg logit contributions
peat-1.675
beit-1.518
wide-1.473
 primates-1.415
 elect-1.408
Token C
Token ID327
Feature activation+2.48e-02

Pos logit contributions
avorite+0.224
ucc+0.222
ury+0.212
avour+0.205
ÃÂ+0.200
Neg logit contributions
peat-0.278
beit-0.248
wide-0.239
 primates-0.229
 elect-0.227
TokenOU
Token ID2606
Feature activation-5.65e-01

Pos logit contributions
peat+0.112
beit+0.095
wide+0.091
 primates+0.084
 elect+0.084
Neg logit contributions
avorite-0.175
ucc-0.171
ury-0.166
avour-0.162
ÃÂ-0.160
TokenNS
Token ID8035
Feature activation-1.57e+00

Pos logit contributions
ucc+2.155
avorite+2.094
ury+2.042
avour+1.925
uri+1.767
Neg logit contributions
peat-4.284
beit-3.880
wide-3.749
 primates-3.668
 elect-3.649
TokenEL
Token ID3698
Feature activation-4.75e-01

Pos logit contributions
ucc+5.733
ury+5.394
avour+5.062
Language+4.818
avorite+4.773
Neg logit contributions
peat-11.463
beit-10.550
 primates-10.344
 rats-9.990
 warr-9.875
Token:
Token ID25
Feature activation-1.41e-01

Pos logit contributions
ucc+2.768
avorite+2.726
ury+2.691
avour+2.580
ÃÂ+2.480
Neg logit contributions
peat-2.636
beit-2.302
wide-2.189
 primates-2.130
 elect-2.097
Token We
Token ID775
Feature activation-9.67e-02

Pos logit contributions
avorite+0.782
ucc+0.750
ury+0.726
avour+0.702
ÃÂ+0.695
Neg logit contributions
peat-0.852
beit-0.767
wide-0.742
 primates-0.705
 elect-0.701
Token can
Token ID460
Feature activation-5.27e-01

Pos logit contributions
avorite+0.607
ucc+0.593
ury+0.575
avour+0.558
ÃÂ+0.550
Neg logit contributions
peat-0.509
beit-0.446
wide-0.429
 primates-0.405
 elect-0.403
Token say
Token ID910
Feature activation+1.94e-01

Pos logit contributions
avorite+2.847
ucc+2.819
ury+2.726
avour+2.612
ÃÂ+2.558
Neg logit contributions
peat-3.182
beit-2.830
wide-2.713
 primates-2.585
 elect-2.524
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
ucc+2.137
avorite+2.136
ury+2.068
avour+1.987
ÃÂ+1.896
Neg logit contributions
peat-2.349
beit-2.091
wide-1.994
 primates-1.937
 elect-1.910
Tokenim
Token ID320
Feature activation+1.35e-01

Pos logit contributions
avorite+1.763
ucc+1.721
ury+1.624
avour+1.530
ÃÂ+1.448
Neg logit contributions
peat-4.159
beit-3.826
wide-3.714
 primates-3.614
 elect-3.594
Tokenread
Token ID961
Feature activation-3.01e-01

Pos logit contributions
peat+0.592
beit+0.500
wide+0.482
 elect+0.438
 primates+0.434
Neg logit contributions
avorite-0.977
ucc-0.953
ury-0.925
ÃÂ-0.906
avour-0.902
Token(
Token ID7
Feature activation-5.86e-01

Pos logit contributions
avorite+1.668
ucc+1.654
ury+1.599
avour+1.543
ÃÂ+1.491
Neg logit contributions
peat-1.775
beit-1.575
wide-1.508
 primates-1.458
 elect-1.441
Tokenimage
Token ID9060
Feature activation-1.45e+00

Pos logit contributions
ucc+2.985
avorite+2.958
ury+2.882
avour+2.763
uri+2.665
Neg logit contributions
peat-3.667
beit-3.296
wide-3.115
 primates-3.105
 elect-3.045
TokenPath
Token ID15235
Feature activation-1.55e+00

Pos logit contributions
ucc+1.984
ury+1.710
avour+1.528
Distance+1.389
avorite+1.368
Neg logit contributions
peat-13.742
beit-12.958
 primates-12.562
 elect-12.417
 rats-12.315
Token)
Token ID8
Feature activation-2.75e-01

Pos logit contributions
ucc+8.788
ury+8.306
uri+7.938
avour+7.897
Distance+7.604
Neg logit contributions
peat-8.066
beit-7.010
 primates-6.855
 warr-6.743
 elect-6.471
Token image
Token ID2939
Feature activation-1.12e+00

Pos logit contributions
avorite+1.642
ucc+1.616
ury+1.568
avour+1.517
ÃÂ+1.479
Neg logit contributions
peat-1.523
beit-1.340
wide-1.280
 primates-1.221
 elect-1.203
Token =
Token ID796
Feature activation-3.17e-01

Pos logit contributions
ucc+4.178
ury+4.028
avorite+3.805
avour+3.660
uri+3.654
Neg logit contributions
peat-8.299
beit-7.539
 primates-7.257
 elect-7.186
wide-7.158
Token im
Token ID545
Feature activation-6.42e-01

Pos logit contributions
ucc+1.652
avorite+1.645
ury+1.610
avour+1.547
uri+1.480
Neg logit contributions
peat-1.923
beit-1.716
wide-1.645
 primates-1.555
 elect-1.528
Tokenutils
Token ID26791
Feature activation-4.64e-01

Pos logit contributions
ucc+2.471
avorite+2.415
ury+2.339
avour+2.201
uri+2.076
Neg logit contributions
peat-4.822
beit-4.436
wide-4.224
 primates-4.168
 elect-4.112
Token interesting
Token ID3499
Feature activation-1.10e-01

Pos logit contributions
avorite+1.007
ucc+0.990
ury+0.966
avour+0.941
ÃÂ+0.917
Neg logit contributions
peat-0.700
beit-0.597
wide-0.568
 primates-0.533
 elect-0.523
Token session
Token ID6246
Feature activation-1.00e-01

Pos logit contributions
avorite+0.725
ucc+0.713
ury+0.693
avour+0.674
ÃÂ+0.661
Neg logit contributions
peat-0.543
beit-0.470
wide-0.447
 primates-0.420
 elect-0.415
Token to
Token ID284
Feature activation-9.24e-02

Pos logit contributions
avorite+0.601
ucc+0.588
ury+0.570
avour+0.552
ÃÂ+0.543
Neg logit contributions
peat-0.552
beit-0.486
wide-0.467
 primates-0.443
 elect-0.440
Token say
Token ID910
Feature activation-5.62e-02

Pos logit contributions
avorite+0.580
ucc+0.570
ury+0.552
avour+0.536
ÃÂ+0.524
Neg logit contributions
peat-0.485
beit-0.422
wide-0.406
 primates-0.381
 elect-0.375
Token the
Token ID262
Feature activation-6.76e-01

Pos logit contributions
avorite+0.353
ucc+0.346
ury+0.335
avour+0.325
ÃÂ+0.320
Neg logit contributions
peat-0.295
beit-0.258
wide-0.248
 primates-0.234
 elect-0.232
Token least
Token ID1551
Feature activation-1.54e+00

Pos logit contributions
ucc+3.794
avorite+3.792
ury+3.698
avour+3.554
ÃÂ+3.401
Neg logit contributions
peat-3.812
beit-3.352
wide-3.267
 primates-3.052
 elect-2.976
Token.
Token ID13
Feature activation-2.64e-01

Pos logit contributions
ucc+7.173
avorite+6.994
ury+6.982
avour+6.628
ÃÂ+6.613
Neg logit contributions
peat-9.283
beit-8.014
wide-7.875
 primates-7.494
 elect-7.323
Token I
Token ID314
Feature activation-3.35e-01

Pos logit contributions
avorite+1.473
ucc+1.440
ury+1.391
avour+1.344
ÃÂ+1.321
Neg logit contributions
peat-1.567
beit-1.392
wide-1.343
 primates-1.280
 elect-1.271
Token consider
Token ID2074
Feature activation-6.11e-01

Pos logit contributions
avorite+2.077
ucc+2.035
ury+1.982
avour+1.921
ÃÂ+1.877
Neg logit contributions
peat-1.772
beit-1.552
wide-1.485
 primates-1.403
 elect-1.386
Token myself
Token ID3589
Feature activation-1.81e-02

Pos logit contributions
ucc+3.597
avorite+3.597
ury+3.466
avour+3.325
ÃÂ+3.235
Neg logit contributions
peat-3.344
beit-2.937
wide-2.799
 primates-2.623
 elect-2.608
Token an
Token ID281
Feature activation+6.69e-02

Pos logit contributions
avorite+0.121
ucc+0.119
ury+0.116
avour+0.112
ÃÂ+0.111
Neg logit contributions
peat-0.088
beit-0.076
wide-0.073
 primates-0.068
 elect-0.068
Token to
Token ID284
Feature activation-2.39e-01

Pos logit contributions
avorite+0.135
ucc+0.132
ury+0.128
avour+0.124
ÃÂ+0.122
Neg logit contributions
peat-0.112
beit-0.097
wide-0.093
 primates-0.088
 elect-0.087
Token address
Token ID2209
Feature activation-2.61e-01

Pos logit contributions
avorite+1.510
ucc+1.488
ury+1.449
avour+1.403
ÃÂ+1.364
Neg logit contributions
peat-1.239
beit-1.074
wide-1.026
 primates-0.965
 elect-0.937
Token sab
Token ID17463
Feature activation-3.79e-01

Pos logit contributions
avorite+1.640
ucc+1.621
ury+1.580
avour+1.509
ÃÂ+1.470
Neg logit contributions
peat-1.364
beit-1.176
wide-1.111
 primates-1.052
 elect-1.028
Tokener
Token ID263
Feature activation-2.95e-01

Pos logit contributions
ucc+1.704
avorite+1.701
ury+1.616
avour+1.509
uri+1.467
Neg logit contributions
peat-2.641
beit-2.375
wide-2.296
 primates-2.215
 candid-2.180
Token-
Token ID12
Feature activation-9.77e-01

Pos logit contributions
avorite+1.967
ucc+1.954
ury+1.897
avour+1.827
ÃÂ+1.790
Neg logit contributions
peat-1.399
beit-1.215
wide-1.146
 primates-1.078
 elect-1.050
Tokenr
Token ID81
Feature activation-1.50e+00

Pos logit contributions
ucc+5.042
ury+4.870
avorite+4.771
uri+4.524
avour+4.393
Neg logit contributions
peat-5.788
beit-5.208
wide-4.917
 primates-4.852
 elect-4.808
Tokenatt
Token ID1078
Feature activation-3.79e-01

Pos logit contributions
ury+1.157
ucc+1.081
avorite+0.705
uri+0.518
avour+0.479
Neg logit contributions
peat-15.537
beit-14.581
wide-14.201
 primates-13.979
 numbered-13.840
Tokenling
Token ID1359
Feature activation-1.42e-01

Pos logit contributions
ucc+1.893
avorite+1.860
ury+1.808
avour+1.722
ÃÂ+1.662
Neg logit contributions
peat-2.446
beit-2.185
wide-2.135
 primates-2.061
 elect-2.024
Token and
Token ID290
Feature activation+8.00e-02

Pos logit contributions
avorite+0.870
ucc+0.858
ury+0.831
avour+0.799
ÃÂ+0.785
Neg logit contributions
peat-0.764
beit-0.668
wide-0.632
 primates-0.600
 elect-0.597
Token the
Token ID262
Feature activation+1.69e-01

Pos logit contributions
peat+0.430
beit+0.376
wide+0.360
 primates+0.341
 elect+0.337
Neg logit contributions
avorite-0.493
ucc-0.484
ury-0.469
avour-0.454
ÃÂ-0.446
Token annexation
Token ID41329
Feature activation-1.85e-02

Pos logit contributions
peat+0.911
beit+0.800
wide+0.764
 primates+0.723
 elect+0.716
Neg logit contributions
avorite-1.043
ucc-1.023
ury-0.992
avour-0.960
ÃÂ-0.942
Token c
Token ID269
Feature activation-7.17e-01

Pos logit contributions
peat+0.077
beit+0.067
wide+0.063
 primates+0.060
 elect+0.059
Neg logit contributions
avorite-0.108
ucc-0.107
ury-0.104
avour-0.101
ÃÂ-0.099
Tokenv
Token ID85
Feature activation-7.99e-01

Pos logit contributions
ucc+4.274
ury+4.135
avorite+4.078
avour+3.976
uri+3.814
Neg logit contributions
peat-3.841
beit-3.441
 primates-3.216
wide-3.179
 elect-3.155
Token2
Token ID17
Feature activation-3.22e-01

Pos logit contributions
ucc+2.240
ury+2.100
avorite+1.972
avour+1.900
uri+1.793
Neg logit contributions
peat-6.774
beit-6.288
 primates-6.067
wide-5.980
 elect-5.929
Token.
Token ID764
Feature activation-3.34e-01

Pos logit contributions
avorite+1.759
ucc+1.750
ury+1.693
avour+1.629
ÃÂ+1.563
Neg logit contributions
peat-1.920
beit-1.707
wide-1.633
 primates-1.578
 elect-1.557
Token Ga
Token ID12822
Feature activation-1.02e+00

Pos logit contributions
avorite+1.698
ucc+1.686
ury+1.619
avour+1.546
ÃÂ+1.471
Neg logit contributions
peat-2.128
beit-1.893
wide-1.797
 primates-1.736
 rats-1.714
Tokenussian
Token ID31562
Feature activation-1.50e+00

Pos logit contributions
ucc+3.073
ury+2.987
avorite+2.937
avour+2.722
uri+2.462
Neg logit contributions
peat-8.413
beit-7.724
 primates-7.428
wide-7.394
 elect-7.302
TokenBl
Token ID3629
Feature activation-1.20e+00

Pos logit contributions
ucc+3.282
ury+2.990
avorite+2.682
avour+2.621
Language+2.396
Neg logit contributions
peat-13.099
beit-12.041
 primates-11.817
 elect-11.639
 rats-11.521
Tokenur
Token ID333
Feature activation-4.45e-01

Pos logit contributions
ury+4.169
ucc+4.124
avorite+3.788
avour+3.657
uri+3.500
Neg logit contributions
peat-9.316
beit-8.617
 primates-8.439
wide-8.164
 rats-8.144
Token (
Token ID357
Feature activation-4.25e-01

Pos logit contributions
ucc+2.533
avorite+2.503
ury+2.458
avour+2.364
uri+2.249
Neg logit contributions
peat-2.519
beit-2.218
wide-2.108
 primates-2.074
 elect-2.025
Token gray
Token ID12768
Feature activation-7.45e-01

Pos logit contributions
ucc+2.569
avorite+2.568
ury+2.522
avour+2.411
uri+2.324
Neg logit contributions
peat-2.228
beit-1.933
wide-1.830
 primates-1.764
 elect-1.706
Token,
Token ID837
Feature activation+5.46e-03

Pos logit contributions
ucc+4.595
ury+4.468
avorite+4.391
avour+4.279
uri+4.137
Neg logit contributions
peat-3.786
beit-3.281
 primates-3.102
wide-3.062
 elect-2.986
Token wary
Token ID23300
Feature activation-6.64e-02

Pos logit contributions
avorite+2.948
ucc+2.901
ury+2.849
avour+2.725
ÃÂ+2.697
Neg logit contributions
peat-2.721
beit-2.385
wide-2.275
 primates-2.178
 elect-2.114
Token,
Token ID11
Feature activation-2.23e-01

Pos logit contributions
avorite+0.413
ucc+0.406
ury+0.396
avour+0.379
ÃÂ+0.377
Neg logit contributions
peat-0.351
beit-0.306
wide-0.290
 primates-0.274
 elect-0.271
Token to
Token ID284
Feature activation-4.92e-01

Pos logit contributions
avorite+1.403
ucc+1.373
ury+1.341
avour+1.293
ÃÂ+1.276
Neg logit contributions
peat-1.168
beit-1.006
wide-0.964
 primates-0.906
 elect-0.897
Token say
Token ID910
Feature activation-2.46e-01

Pos logit contributions
ucc+2.644
avorite+2.643
ury+2.523
avour+2.429
ÃÂ+2.383
Neg logit contributions
peat-2.964
beit-2.606
wide-2.553
 primates-2.403
 elect-2.358
Token the
Token ID262
Feature activation-9.26e-01

Pos logit contributions
avorite+1.481
ucc+1.457
ury+1.413
avour+1.362
ÃÂ+1.337
Neg logit contributions
peat-1.346
beit-1.173
wide-1.134
 primates-1.071
 elect-1.058
Token least
Token ID1551
Feature activation-1.48e+00

Pos logit contributions
ucc+5.076
ury+4.977
avorite+4.975
avour+4.730
uri+4.565
Neg logit contributions
peat-5.253
beit-4.647
wide-4.488
 primates-4.249
 elect-4.071
Token.
Token ID13
Feature activation-4.51e-01

Pos logit contributions
avorite+6.119
ucc+6.093
ury+6.006
ÃÂ+5.651
uri+5.488
Neg logit contributions
peat-9.722
beit-8.530
wide-8.202
 primates-7.893
 elect-7.717
Token
Token ID564
Feature activation-2.34e-01

Pos logit contributions
avorite+2.604
ucc+2.598
ury+2.517
avour+2.420
ÃÂ+2.366
Neg logit contributions
peat-2.555
beit-2.215
wide-2.107
 primates-2.029
 elect-1.998
Token
Token ID250
Feature activation-5.36e-01

Pos logit contributions
avorite+1.047
ucc+1.017
ury+0.975
avour+0.932
ÃÂ+0.912
Neg logit contributions
peat-1.645
beit-1.491
wide-1.448
 primates-1.392
 elect-1.385
TokenSome
Token ID4366
Feature activation+3.59e-01

Pos logit contributions
avorite+2.441
ucc+2.419
ury+2.320
avour+2.220
ÃÂ+2.117
Neg logit contributions
peat-3.684
beit-3.310
wide-3.184
 primates-3.102
 elect-3.082
Token implanted
Token ID40740
Feature activation+1.74e-01

Pos logit contributions
peat+1.661
beit+1.430
wide+1.379
 primates+1.306
 elect+1.305
Neg logit contributions
avorite-2.466
ucc-2.402
ury-2.341
avour-2.278
ÃÂ-2.258
Tokeni
Token ID72
Feature activation+4.55e-02

Pos logit contributions
avorite+1.780
ucc+1.776
ury+1.709
avour+1.652
uri+1.588
Neg logit contributions
peat-1.944
beit-1.733
wide-1.655
 primates-1.616
 elect-1.602
Token -
Token ID532
Feature activation+5.79e-02

Pos logit contributions
peat+0.243
beit+0.214
wide+0.203
 primates+0.196
 elect+0.191
Neg logit contributions
avorite-0.278
ucc-0.275
ury-0.266
avour-0.258
ÃÂ-0.252
Tokenle
Token ID293
Feature activation-1.88e-01

Pos logit contributions
peat+0.403
beit+0.365
wide+0.354
 primates+0.340
 elect+0.338
Neg logit contributions
avorite-0.265
ucc-0.257
ury-0.247
avour-0.236
ÃÂ-0.231
Token $
Token ID720
Feature activation-5.53e-01

Pos logit contributions
avorite+1.128
ucc+1.118
ury+1.087
avour+1.054
ÃÂ+1.027
Neg logit contributions
peat-1.013
beit-0.892
wide-0.849
 primates-0.816
 elect-0.800
Tokenart
Token ID433
Feature activation-5.65e-01

Pos logit contributions
avorite+3.031
ucc+3.014
ury+2.908
avour+2.802
uri+2.700
Neg logit contributions
peat-3.262
beit-2.883
wide-2.766
 primates-2.716
 elect-2.694
Tokenext
Token ID2302
Feature activation-1.46e+00

Pos logit contributions
ucc+2.011
avorite+1.951
ury+1.892
avour+1.803
uri+1.685
Neg logit contributions
peat-4.374
beit-4.016
wide-3.896
 primates-3.877
 elect-3.800
Token.
Token ID13
Feature activation-4.30e-01

Pos logit contributions
ucc+8.388
avorite+8.169
ury+8.154
avour+7.812
uri+7.640
Neg logit contributions
peat-7.216
beit-6.378
 primates-5.936
wide-5.721
 elect-5.671
TokenCount
Token ID12332
Feature activation-2.95e-01

Pos logit contributions
avorite+1.935
ucc+1.912
ury+1.830
avour+1.758
ÃÂ+1.675
Neg logit contributions
peat-2.968
beit-2.696
wide-2.575
 primates-2.536
 elect-2.511
Token ;
Token ID2162
Feature activation+2.52e-01

Pos logit contributions
ucc+1.811
avorite+1.793
ury+1.758
avour+1.682
ÃÂ+1.646
Neg logit contributions
peat-1.540
beit-1.367
wide-1.263
 primates-1.227
 elect-1.208
Token $
Token ID720
Feature activation-3.05e-01

Pos logit contributions
peat+1.287
beit+1.124
wide+1.074
 primates+1.018
 elect+1.004
Neg logit contributions
avorite-1.610
ucc-1.585
ury-1.537
avour-1.488
ÃÂ-1.471
Tokeni
Token ID72
Feature activation-1.94e-01

Pos logit contributions
avorite+1.663
ucc+1.658
ury+1.596
avour+1.534
uri+1.472
Neg logit contributions
peat-1.799
beit-1.611
wide-1.544
 primates-1.508
 elect-1.493
Token2
Token ID17
Feature activation-3.94e-01

Pos logit contributions
ucc+2.468
ury+2.296
avorite+2.097
avour+2.075
uri+1.974
Neg logit contributions
peat-7.729
beit-7.176
 primates-6.937
wide-6.800
 elect-6.771
Token.
Token ID13
Feature activation-5.16e-01

Pos logit contributions
ucc+2.137
avorite+2.136
ury+2.068
avour+1.987
ÃÂ+1.896
Neg logit contributions
peat-2.349
beit-2.091
wide-1.994
 primates-1.937
 elect-1.910
Tokenim
Token ID320
Feature activation+1.35e-01

Pos logit contributions
avorite+1.763
ucc+1.721
ury+1.624
avour+1.530
ÃÂ+1.448
Neg logit contributions
peat-4.159
beit-3.826
wide-3.714
 primates-3.614
 elect-3.594
Tokenread
Token ID961
Feature activation-3.01e-01

Pos logit contributions
peat+0.592
beit+0.500
wide+0.482
 elect+0.438
 primates+0.434
Neg logit contributions
avorite-0.977
ucc-0.953
ury-0.925
ÃÂ-0.906
avour-0.902
Token(
Token ID7
Feature activation-5.86e-01

Pos logit contributions
avorite+1.668
ucc+1.654
ury+1.599
avour+1.543
ÃÂ+1.491
Neg logit contributions
peat-1.775
beit-1.575
wide-1.508
 primates-1.458
 elect-1.441
Tokenimage
Token ID9060
Feature activation-1.45e+00

Pos logit contributions
ucc+2.985
avorite+2.958
ury+2.882
avour+2.763
uri+2.665
Neg logit contributions
peat-3.667
beit-3.296
wide-3.115
 primates-3.105
 elect-3.045
TokenPath
Token ID15235
Feature activation-1.55e+00

Pos logit contributions
ucc+1.984
ury+1.710
avour+1.528
Distance+1.389
avorite+1.368
Neg logit contributions
peat-13.742
beit-12.958
 primates-12.562
 elect-12.417
 rats-12.315
Token)
Token ID8
Feature activation-2.75e-01

Pos logit contributions
ucc+8.788
ury+8.306
uri+7.938
avour+7.897
Distance+7.604
Neg logit contributions
peat-8.066
beit-7.010
 primates-6.855
 warr-6.743
 elect-6.471
Token image
Token ID2939
Feature activation-1.12e+00

Pos logit contributions
avorite+1.642
ucc+1.616
ury+1.568
avour+1.517
ÃÂ+1.479
Neg logit contributions
peat-1.523
beit-1.340
wide-1.280
 primates-1.221
 elect-1.203
Token =
Token ID796
Feature activation-3.17e-01

Pos logit contributions
ucc+4.178
ury+4.028
avorite+3.805
avour+3.660
uri+3.654
Neg logit contributions
peat-8.299
beit-7.539
 primates-7.257
 elect-7.186
wide-7.158
Token im
Token ID545
Feature activation-6.42e-01

Pos logit contributions
ucc+1.652
avorite+1.645
ury+1.610
avour+1.547
uri+1.480
Neg logit contributions
peat-1.923
beit-1.716
wide-1.645
 primates-1.555
 elect-1.528
Token a
Token ID257
Feature activation-5.38e-01

Pos logit contributions
peat+0.098
beit+0.084
wide+0.080
 primates+0.075
 elect+0.074
Neg logit contributions
avorite-0.139
ucc-0.138
ury-0.135
avour-0.130
ÃÂ-0.128
Tokenan
Token ID272
Feature activation-3.16e-01

Pos logit contributions
ucc+2.462
avorite+2.435
ury+2.363
avour+2.263
uri+2.126
Neg logit contributions
peat-3.686
beit-3.327
wide-3.190
 primates-3.101
 elect-3.051
Token Z
Token ID1168
Feature activation-6.89e-01

Pos logit contributions
ucc+1.971
avorite+1.970
ury+1.919
avour+1.856
ÃÂ+1.814
Neg logit contributions
peat-1.640
beit-1.398
wide-1.339
 primates-1.277
 elect-1.232
Tokenee
Token ID1453
Feature activation-4.82e-01

Pos logit contributions
ucc+2.690
avorite+2.688
ury+2.524
avour+2.392
uri+2.231
Neg logit contributions
peat-5.207
beit-4.755
wide-4.629
 primates-4.506
 elect-4.426
Token"]
Token ID8973
Feature activation+3.04e-01

Pos logit contributions
ucc+2.720
avorite+2.696
ury+2.639
avour+2.547
ÃÂ+2.429
Neg logit contributions
peat-2.794
beit-2.420
wide-2.332
 primates-2.252
 elect-2.205
Token [
Token ID685
Feature activation-1.43e+00

Pos logit contributions
peat+1.368
beit+1.204
wide+1.155
 elect+1.087
 primates+1.065
Neg logit contributions
avorite-2.177
ucc-2.047
ury-1.988
ÃÂ-1.984
avour-1.936
TokenDate
Token ID10430
Feature activation-6.79e-01

Pos logit contributions
ucc+6.270
ury+6.222
avorite+6.120
avour+5.969
Distance+5.629
Neg logit contributions
peat-9.553
beit-8.499
 primates-8.116
wide-7.925
 candid-7.924
Token "
Token ID366
Feature activation-8.23e-01

Pos logit contributions
ucc+4.047
avorite+3.995
ury+3.953
avour+3.839
uri+3.679
Neg logit contributions
peat-3.658
beit-3.149
wide-2.970
 primates-2.908
 elect-2.798
Token2016
Token ID5304
Feature activation-2.11e-01

Pos logit contributions
ucc+3.811
ury+3.600
avorite+3.527
avour+3.397
uri+3.245
Neg logit contributions
peat-5.559
beit-4.932
 primates-4.713
wide-4.668
 candid-4.611
Token.
Token ID13
Feature activation-2.50e-01

Pos logit contributions
avorite+1.125
ucc+1.123
ury+1.093
avour+1.047
ÃÂ+1.003
Neg logit contributions
peat-1.282
beit-1.136
wide-1.095
 primates-1.061
 elect-1.041
Token01
Token ID486
Feature activation-5.65e-01

Pos logit contributions
avorite+1.319
ucc+1.319
ury+1.269
avour+1.219
ÃÂ+1.167
Neg logit contributions
peat-1.542
beit-1.369
wide-1.316
 primates-1.279
 elect-1.265
Token It
Token ID632
Feature activation-3.56e-01

Pos logit contributions
avorite+2.571
ucc+2.552
ury+2.464
avour+2.376
ÃÂ+2.291
Neg logit contributions
peat-2.869
beit-2.533
wide-2.423
 primates-2.347
 elect-2.314
Token's
Token ID338
Feature activation-3.96e-01

Pos logit contributions
avorite+1.954
ucc+1.911
ury+1.853
avour+1.785
ÃÂ+1.742
Neg logit contributions
peat-2.144
beit-1.898
wide-1.822
 primates-1.744
 elect-1.717
Token basically
Token ID6209
Feature activation+2.03e-01

Pos logit contributions
avorite+2.543
ucc+2.482
ury+2.421
avour+2.353
ÃÂ+2.278
Neg logit contributions
peat-1.994
beit-1.711
wide-1.607
 primates-1.524
 elect-1.488
Token a
Token ID257
Feature activation-1.77e-02

Pos logit contributions
peat+0.970
beit+0.838
wide+0.802
 primates+0.753
 elect+0.748
Neg logit contributions
avorite-1.371
ucc-1.342
ury-1.305
avour-1.269
ÃÂ-1.254
Token Hip
Token ID29437
Feature activation-3.02e-01

Pos logit contributions
avorite+0.114
ucc+0.111
ury+0.108
avour+0.105
ÃÂ+0.103
Neg logit contributions
peat-0.090
beit-0.079
wide-0.075
 primates-0.071
 elect-0.070
Tokenst
Token ID301
Feature activation-1.42e+00

Pos logit contributions
avorite+1.740
ucc+1.717
ury+1.669
avour+1.603
ÃÂ+1.560
Neg logit contributions
peat-1.712
beit-1.534
wide-1.450
 primates-1.394
 elect-1.378
Tokenam
Token ID321
Feature activation+1.56e-01

Pos logit contributions
ucc+6.926
avorite+6.642
ury+6.597
avour+6.266
uri+5.944
Neg logit contributions
peat-8.765
beit-8.177
wide-7.782
 primates-7.587
 rats-7.208
Tokenatic
Token ID1512
Feature activation-1.61e-01

Pos logit contributions
peat+0.747
beit+0.644
wide+0.616
 primates+0.578
 elect+0.574
Neg logit contributions
avorite-1.051
ucc-1.029
ury-1.000
avour-0.973
ÃÂ-0.959
Token for
Token ID329
Feature activation+9.38e-02

Pos logit contributions
avorite+0.957
ucc+0.939
ury+0.914
avour+0.880
ÃÂ+0.861
Neg logit contributions
peat-0.891
beit-0.788
wide-0.751
 primates-0.717
 elect-0.705
Token Android
Token ID5565
Feature activation-3.64e-01

Pos logit contributions
peat+0.508
beit+0.447
wide+0.431
 primates+0.408
 elect+0.406
Neg logit contributions
avorite-0.575
ucc-0.561
ury-0.543
avour-0.527
ÃÂ-0.520
Token.
Token ID13
Feature activation-4.07e-01

Pos logit contributions
avorite+2.162
ucc+2.158
ury+2.096
avour+2.021
ÃÂ+1.938
Neg logit contributions
peat-1.990
beit-1.760
wide-1.646
 primates-1.586
 elect-1.556
Token guise
Token ID34731
Feature activation-1.18e-01

Pos logit contributions
avorite+0.922
ucc+0.902
ury+0.877
avour+0.848
ÃÂ+0.826
Neg logit contributions
peat-0.863
beit-0.760
wide-0.728
 primates-0.690
 elect-0.684
Token of
Token ID286
Feature activation-1.39e-01

Pos logit contributions
avorite+0.757
ucc+0.747
ury+0.739
avour+0.710
ÃÂ+0.685
Neg logit contributions
peat-0.598
beit-0.519
wide-0.485
 primates-0.464
 elect-0.449
Token Drew
Token ID18624
Feature activation-4.10e-01

Pos logit contributions
avorite+0.851
ucc+0.833
ury+0.811
avour+0.787
ÃÂ+0.763
Neg logit contributions
peat-0.757
beit-0.663
wide-0.630
 primates-0.596
 elect-0.590
Token,
Token ID11
Feature activation-5.29e-01

Pos logit contributions
avorite+2.253
ucc+2.213
ury+2.170
avour+2.081
ÃÂ+1.984
Neg logit contributions
peat-2.443
beit-2.191
wide-2.082
 primates-1.995
 elect-1.953
Token Te
Token ID1665
Feature activation-3.44e-01

Pos logit contributions
avorite+3.186
ucc+3.114
ury+3.071
avour+2.950
ÃÂ+2.843
Neg logit contributions
peat-2.898
beit-2.527
wide-2.413
 primates-2.277
 elect-2.252
Tokeness
Token ID408
Feature activation-1.41e+00

Pos logit contributions
avorite+1.817
ucc+1.783
ury+1.720
avour+1.659
ÃÂ+1.604
Neg logit contributions
peat-2.139
beit-1.920
wide-1.858
 primates-1.778
 elect-1.761
Tokenide
Token ID485
Feature activation-3.66e-01

Pos logit contributions
ucc+5.336
avorite+5.231
ury+5.129
avour+4.710
uri+4.549
Neg logit contributions
peat-10.698
beit-9.485
wide-9.080
 primates-8.892
 elect-8.747
Token Crown
Token ID12223
Feature activation-3.87e-03

Pos logit contributions
avorite+1.947
ucc+1.902
ury+1.854
avour+1.778
ÃÂ+1.720
Neg logit contributions
peat-2.269
beit-2.029
wide-1.942
 primates-1.859
 elect-1.844
Token Court
Token ID3078
Feature activation-2.23e-01

Pos logit contributions
avorite+0.018
ucc+0.018
ury+0.017
avour+0.016
ÃÂ+0.016
Neg logit contributions
peat-0.027
beit-0.024
wide-0.023
 primates-0.022
 elect-0.022
Token was
Token ID373
Feature activation-1.87e-01

Pos logit contributions
avorite+1.337
ucc+1.310
ury+1.282
avour+1.228
ÃÂ+1.203
Neg logit contributions
peat-1.234
beit-1.082
wide-1.037
 primates-0.986
 elect-0.974
Token told
Token ID1297
Feature activation-6.29e-01

Pos logit contributions
avorite+1.171
ucc+1.147
ury+1.114
avour+1.080
ÃÂ+1.062
Neg logit contributions
peat-0.989
beit-0.865
wide-0.830
 primates-0.784
 elect-0.777
TokenI
Token ID40
Feature activation-4.15e-01

Pos logit contributions
avorite+2.011
ucc+2.000
ury+1.928
avour+1.829
ÃÂ+1.733
Neg logit contributions
peat-3.301
beit-3.010
wide-2.882
 primates-2.818
 elect-2.810
Token'll
Token ID1183
Feature activation-4.97e-01

Pos logit contributions
avorite+2.331
ucc+2.286
ury+2.219
avour+2.143
ÃÂ+2.083
Neg logit contributions
peat-2.435
beit-2.166
wide-2.073
 primates-1.983
 elect-1.958
Token Feel
Token ID18571
Feature activation-1.20e-01

Pos logit contributions
avorite+2.833
ucc+2.801
ury+2.708
avour+2.617
ÃÂ+2.527
Neg logit contributions
peat-2.847
beit-2.530
wide-2.420
 primates-2.304
 elect-2.258
Token a
Token ID257
Feature activation-1.47e-01

Pos logit contributions
avorite+0.649
ucc+0.635
ury+0.613
avour+0.592
ÃÂ+0.579
Neg logit contributions
peat-0.728
beit-0.650
wide-0.626
 primates-0.598
 elect-0.594
Token Whole
Token ID23431
Feature activation-3.47e-01

Pos logit contributions
avorite+0.764
ucc+0.744
ury+0.718
avour+0.691
ÃÂ+0.677
Neg logit contributions
peat-0.928
beit-0.832
wide-0.803
 primates-0.769
 elect-0.765
Token Lot
Token ID15099
Feature activation-1.41e+00

Pos logit contributions
avorite+1.996
ucc+1.965
ury+1.911
avour+1.839
ÃÂ+1.789
Neg logit contributions
peat-1.973
beit-1.743
wide-1.668
 primates-1.614
 elect-1.596
Token Better
Token ID11625
Feature activation-2.80e-01

Pos logit contributions
ucc+6.184
ury+5.854
avorite+5.744
avour+5.535
uri+5.226
Neg logit contributions
peat-9.193
beit-8.321
 primates-8.058
wide-7.897
 elect-7.682
Token"
Token ID1
Feature activation-3.78e-01

Pos logit contributions
avorite+1.491
ucc+1.472
ury+1.427
avour+1.364
ÃÂ+1.336
Neg logit contributions
peat-1.719
beit-1.528
wide-1.472
 primates-1.418
 elect-1.400
Token (
Token ID357
Feature activation-6.21e-01

Pos logit contributions
avorite+2.141
ucc+2.104
ury+2.048
avour+1.969
ÃÂ+1.922
Neg logit contributions
peat-2.179
beit-1.937
wide-1.846
 primates-1.761
 elect-1.744
TokenB
Token ID33
Feature activation-2.38e-01

Pos logit contributions
ucc+2.915
avorite+2.914
ury+2.810
avour+2.657
ÃÂ+2.559
Neg logit contributions
peat-4.092
beit-3.690
wide-3.504
 primates-3.468
 elect-3.461
Tokenyr
Token ID2417
Feature activation+3.37e-01

Pos logit contributions
avorite+1.284
ucc+1.283
ury+1.243
avour+1.177
ÃÂ+1.140
Neg logit contributions
peat-1.457
beit-1.293
wide-1.250
 primates-1.200
 elect-1.190
Token,
Token ID11
Feature activation+2.51e-01

Pos logit contributions
avorite+0.499
ucc+0.494
ury+0.481
avour+0.460
ÃÂ+0.452
Neg logit contributions
peat-0.525
beit-0.465
wide-0.448
 primates-0.428
 elect-0.422
Token _
Token ID4808
Feature activation-4.51e-01

Pos logit contributions
peat+1.484
beit+1.322
wide+1.279
 primates+1.213
 elect+1.210
Neg logit contributions
avorite-1.418
ucc-1.372
ury-1.327
avour-1.285
ÃÂ-1.267
TokenP
Token ID47
Feature activation-3.15e-01

Pos logit contributions
avorite+1.714
ucc+1.673
ury+1.622
avour+1.482
uri+1.407
Neg logit contributions
peat-3.388
beit-3.124
wide-3.013
 primates-2.995
 candid-2.949
Tokenulse
Token ID9615
Feature activation-7.64e-01

Pos logit contributions
avorite+1.087
ucc+1.055
ury+1.001
avour+0.939
ÃÂ+0.889
Neg logit contributions
peat-2.537
beit-2.332
wide-2.271
 primates-2.204
 elect-2.188
TokenPer
Token ID5990
Feature activation+9.30e-02

Pos logit contributions
ucc+2.377
avorite+2.296
ury+2.288
avour+2.071
uri+1.914
Neg logit contributions
peat-6.210
beit-5.746
wide-5.544
 primates-5.529
 elect-5.459
Tokeniod
Token ID2101
Feature activation-1.41e+00

Pos logit contributions
peat+0.555
beit+0.492
wide+0.475
 primates+0.446
 elect+0.444
Neg logit contributions
avorite-0.532
ucc-0.508
ury-0.488
ÃÂ-0.481
avour-0.476
Token]
Token ID60
Feature activation-7.15e-01

Pos logit contributions
ucc+6.584
ury+6.507
avorite+6.435
avour+5.945
ÃÂ+5.914
Neg logit contributions
peat-8.583
beit-7.814
wide-7.241
 primates-7.037
 elect-6.957
Token to
Token ID284
Feature activation-5.05e-01

Pos logit contributions
avorite+4.030
ucc+4.002
ury+3.890
avour+3.744
ÃÂ+3.649
Neg logit contributions
peat-4.068
beit-3.657
wide-3.434
 primates-3.300
 elect-3.246
Token to
Token ID284
Feature activation-5.36e-02

Pos logit contributions
avorite+3.063
ucc+3.035
ury+2.935
avour+2.854
ÃÂ+2.752
Neg logit contributions
peat-2.708
beit-2.386
wide-2.269
 primates-2.139
 elect-2.091
Token rem
Token ID816
Feature activation-1.08e-01

Pos logit contributions
avorite+0.336
ucc+0.326
ury+0.317
avour+0.306
ÃÂ+0.304
Neg logit contributions
peat-0.284
beit-0.249
wide-0.240
 elect-0.227
 primates-0.227
Tokenap
Token ID499
Feature activation+6.82e-02

Pos logit contributions
avorite+0.592
ucc+0.578
ury+0.560
avour+0.539
ÃÂ+0.526
Neg logit contributions
peat-0.653
beit-0.587
wide-0.562
 primates-0.540
 elect-0.533
Token with
Token ID351
Feature activation-7.08e-02

Pos logit contributions
ucc+4.511
ury+4.377
avorite+4.366
avour+4.191
ÃÂ+4.014
Neg logit contributions
peat-4.086
beit-3.560
wide-3.319
 primates-3.248
 elect-3.178
Token a
Token ID257
Feature activation-6.64e-02

Pos logit contributions
avorite+0.452
ucc+0.443
ury+0.429
avour+0.417
ÃÂ+0.410
Neg logit contributions
peat-0.364
beit-0.317
wide-0.302
 primates-0.287
 elect-0.282
Token few
Token ID1178
Feature activation+9.04e-03

Pos logit contributions
avorite+0.446
ucc+0.437
ury+0.425
avour+0.413
ÃÂ+0.407
Neg logit contributions
peat-0.320
beit-0.276
wide-0.264
 primates-0.248
 elect-0.246
Token nasty
Token ID17166
Feature activation+1.16e-01

Pos logit contributions
peat+0.045
beit+0.039
wide+0.037
 primates+0.035
 elect+0.035
Neg logit contributions
avorite-0.060
ucc-0.058
ury-0.057
avour-0.055
ÃÂ-0.054
Token head
Token ID1182
Feature activation-3.04e-02

Pos logit contributions
peat+0.606
beit+0.530
wide+0.512
 primates+0.484
 elect+0.482
Neg logit contributions
avorite-0.733
ucc-0.715
ury-0.693
avour-0.674
ÃÂ-0.665
Token f
Token ID277
Feature activation-1.40e+00

Pos logit contributions
avorite+0.194
ucc+0.193
ury+0.186
avour+0.181
ÃÂ+0.177
Neg logit contributions
peat-0.156
beit-0.135
wide-0.128
 primates-0.121
 elect-0.120
Tokenakes
Token ID1124
Feature activation+4.16e-01

Pos logit contributions
ucc+6.234
ury+5.977
avour+5.961
avorite+5.939
uri+5.511
Neg logit contributions
peat-9.225
beit-8.368
 primates-7.830
wide-7.636
 elect-7.552
Token and
Token ID290
Feature activation+2.84e-01

Pos logit contributions
peat+2.176
beit+1.902
wide+1.856
 elect+1.740
 primates+1.732
Neg logit contributions
avorite-2.627
ucc-2.556
ury-2.451
avour-2.405
ÃÂ-2.374
Token shoulder
Token ID8163
Feature activation+3.19e-01

Pos logit contributions
peat+1.403
beit+1.218
wide+1.171
 primates+1.101
 elect+1.096
Neg logit contributions
avorite-1.870
ucc-1.827
ury-1.777
avour-1.728
ÃÂ-1.703
Token dips
Token ID48904
Feature activation+1.74e-01

Pos logit contributions
peat+1.689
beit+1.484
wide+1.430
 primates+1.348
 elect+1.346
Neg logit contributions
avorite-1.992
ucc-1.938
ury-1.883
avour-1.827
ÃÂ-1.802
Token,
Token ID11
Feature activation-1.52e-01

Pos logit contributions
peat+0.926
beit+0.812
wide+0.784
 primates+0.739
 elect+0.737
Neg logit contributions
avorite-1.079
ucc-1.051
ury-1.016
avour-0.990
ÃÂ-0.974
Token mass
Token ID2347
Feature activation-1.65e-01

Pos logit contributions
avorite+1.619
ucc+1.603
ury+1.566
avour+1.513
ÃÂ+1.472
Neg logit contributions
peat-1.112
beit-0.964
wide-0.905
 primates-0.851
 elect-0.848
Token.
Token ID13
Feature activation-2.87e-01

Pos logit contributions
avorite+0.955
ucc+0.936
ury+0.909
avour+0.876
ÃÂ+0.857
Neg logit contributions
peat-0.940
beit-0.834
wide-0.799
 primates-0.760
 elect-0.755
Token It
Token ID632
Feature activation-1.93e-01

Pos logit contributions
avorite+1.634
ucc+1.599
ury+1.549
avour+1.496
ÃÂ+1.466
Neg logit contributions
peat-1.676
beit-1.485
wide-1.430
 primates-1.364
 elect-1.355
Token is
Token ID318
Feature activation-1.19e-01

Pos logit contributions
avorite+1.148
ucc+1.124
ury+1.097
avour+1.059
ÃÂ+1.037
Neg logit contributions
peat-1.069
beit-0.938
wide-0.903
 primates-0.858
 elect-0.848
Token,
Token ID11
Feature activation-5.20e-02

Pos logit contributions
avorite+0.777
ucc+0.760
ury+0.738
avour+0.717
ÃÂ+0.707
Neg logit contributions
peat-0.600
beit-0.522
wide-0.500
 primates-0.471
 elect-0.468
Token then
Token ID788
Feature activation-1.40e+00

Pos logit contributions
avorite+0.348
ucc+0.340
ury+0.333
avour+0.323
ÃÂ+0.316
Neg logit contributions
peat-0.250
beit-0.215
wide-0.205
 primates-0.192
 elect-0.189
Token,
Token ID11
Feature activation-7.80e-01

Pos logit contributions
ury+6.232
ucc+6.119
avour+5.807
uri+5.647
avorite+5.630
Neg logit contributions
peat-8.685
beit-7.792
 primates-7.360
 elect-7.226
wide-7.169
Token not
Token ID407
Feature activation-1.91e-01

Pos logit contributions
avorite+4.817
ucc+4.705
ury+4.668
avour+4.479
uri+4.321
Neg logit contributions
peat-3.962
beit-3.465
wide-3.255
 primates-3.087
 elect-3.019
Token at
Token ID379
Feature activation+2.51e-01

Pos logit contributions
avorite+1.257
ucc+1.233
ury+1.199
avour+1.161
ÃÂ+1.140
Neg logit contributions
peat-0.940
beit-0.815
wide-0.776
 primates-0.731
 elect-0.724
Token all
Token ID477
Feature activation-3.79e-01

Pos logit contributions
peat+1.337
beit+1.176
wide+1.132
 primates+1.070
 elect+1.065
Neg logit contributions
avorite-1.560
ucc-1.519
ury-1.477
avour-1.428
ÃÂ-1.412
Token unlikely
Token ID7485
Feature activation+2.82e-01

Pos logit contributions
avorite+2.485
ucc+2.454
ury+2.391
avour+2.314
ÃÂ+2.232
Neg logit contributions
peat-1.837
beit-1.604
wide-1.502
 primates-1.411
 elect-1.392
Token girls
Token ID4813
Feature activation+3.99e-02

Pos logit contributions
avorite+4.409
ucc+4.351
ury+4.291
avour+4.128
ÃÂ+3.982
Neg logit contributions
peat-3.519
beit-3.090
wide-2.873
 primates-2.670
 elect-2.659
Token up
Token ID510
Feature activation+3.01e-01

Pos logit contributions
peat+0.217
beit+0.191
wide+0.186
 primates+0.176
 elect+0.175
Neg logit contributions
avorite-0.243
ucc-0.237
ury-0.228
avour-0.222
ÃÂ-0.220
Token to
Token ID284
Feature activation+1.59e-01

Pos logit contributions
peat+1.408
beit+1.212
wide+1.169
 elect+1.094
 primates+1.091
Neg logit contributions
avorite-2.067
ucc-2.008
ury-1.942
avour-1.897
ÃÂ-1.871
Token for
Token ID329
Feature activation+2.94e-01

Pos logit contributions
peat+0.825
beit+0.717
wide+0.691
 elect+0.657
 primates+0.657
Neg logit contributions
avorite-1.020
ucc-0.994
ury-0.962
avour-0.935
ÃÂ-0.928
Token prom
Token ID1552
Feature activation-5.97e-02

Pos logit contributions
peat+1.852
beit+1.649
wide+1.605
 primates+1.531
 elect+1.531
Neg logit contributions
avorite-1.543
ucc-1.498
ury-1.429
avour-1.375
ÃÂ-1.370
Tokenisc
Token ID2304
Feature activation-1.39e+00

Pos logit contributions
avorite+0.298
ucc+0.291
ury+0.280
avour+0.269
ÃÂ+0.264
Neg logit contributions
peat-0.391
beit-0.352
wide-0.341
 primates-0.326
 elect-0.324
Tokenuous
Token ID5623
Feature activation-2.44e-01

Pos logit contributions
ury+1.972
ucc+1.955
avorite+1.924
avour+1.533
uri+1.213
Neg logit contributions
peat-13.278
beit-12.620
wide-12.112
 primates-12.011
 candid-11.785
Token behaviour
Token ID9172
Feature activation+1.92e-01

Pos logit contributions
avorite+1.344
ucc+1.310
ury+1.276
avour+1.227
ÃÂ+1.202
Neg logit contributions
peat-1.467
beit-1.309
wide-1.257
 primates-1.196
 elect-1.189
Token.
Token ID13
Feature activation-1.17e-01

Pos logit contributions
peat+1.031
beit+0.902
wide+0.876
 primates+0.830
 elect+0.823
Neg logit contributions
avorite-1.188
ucc-1.159
ury-1.113
avour-1.085
ÃÂ-1.071
Token
Token ID447
Feature activation+1.24e-01

Pos logit contributions
avorite+0.715
ucc+0.699
ury+0.678
avour+0.657
ÃÂ+0.647
Neg logit contributions
peat-0.630
beit-0.554
wide-0.532
 primates-0.504
 elect-0.500
Token
Token ID251
Feature activation-3.14e-01

Pos logit contributions
peat+0.875
beit+0.792
wide+0.773
 primates+0.738
 elect+0.736
Neg logit contributions
avorite-0.555
ucc-0.539
ury-0.517
avour-0.491
ÃÂ-0.486
Tokens
Token ID82
Feature activation-4.26e-01

Pos logit contributions
avorite+2.317
ucc+2.307
ury+2.247
avour+2.182
ÃÂ+2.113
Neg logit contributions
peat-1.653
beit-1.422
wide-1.339
 primates-1.291
 elect-1.273
TokenFormat
Token ID26227
Feature activation-1.71e-03

Pos logit contributions
ucc+1.531
avorite+1.516
ury+1.446
avour+1.358
ÃÂ+1.259
Neg logit contributions
peat-3.294
beit-3.032
wide-2.936
 primates-2.886
 elect-2.862
Token.
Token ID13
Feature activation-3.31e-01

Pos logit contributions
avorite+0.010
ucc+0.010
ury+0.010
ÃÂ+0.009
avour+0.009
Neg logit contributions
peat-0.010
beit-0.009
wide-0.008
 elect-0.008
 primates-0.008
TokenLine
Token ID13949
Feature activation-5.12e-01

Pos logit contributions
avorite+1.308
ucc+1.299
ury+1.239
avour+1.173
ÃÂ+1.107
Neg logit contributions
peat-2.466
beit-2.256
wide-2.156
 primates-2.137
 elect-2.116
TokenAl
Token ID2348
Feature activation-6.56e-01

Pos logit contributions
ucc+2.149
avorite+2.132
ury+2.046
avour+1.943
uri+1.840
Neg logit contributions
peat-3.640
beit-3.307
 primates-3.145
wide-3.133
 elect-3.095
Tokenignment
Token ID16747
Feature activation-1.39e+00

Pos logit contributions
ucc+3.759
ury+3.693
avorite+3.686
avour+3.571
uri+3.351
Neg logit contributions
peat-3.630
beit-3.157
wide-3.037
 primates-3.008
 elect-2.970
Token =
Token ID796
Feature activation+1.06e-01

Pos logit contributions
ucc+4.870
ury+4.674
uri+4.194
avour+4.185
avorite+4.127
Neg logit contributions
peat-10.204
beit-9.289
 primates-9.098
 elect-8.932
wide-8.827
Token [
Token ID685
Feature activation-3.03e-01

Pos logit contributions
peat+0.538
beit+0.469
wide+0.453
 primates+0.425
 elect+0.424
Neg logit contributions
avorite-0.692
ucc-0.669
ury-0.650
avour-0.633
ÃÂ-0.630
Tokensystem
Token ID10057
Feature activation-1.95e-01

Pos logit contributions
ucc+1.636
avorite+1.635
ury+1.576
avour+1.514
ÃÂ+1.446
Neg logit contributions
peat-1.813
beit-1.620
wide-1.543
 primates-1.506
 elect-1.482
Token.
Token ID13
Feature activation-7.85e-02

Pos logit contributions
avorite+1.093
ucc+1.077
ury+1.042
avour+1.005
ÃÂ+0.977
Neg logit contributions
peat-1.143
beit-1.016
wide-0.975
 primates-0.935
 elect-0.927
Tokendraw
Token ID19334
Feature activation+1.69e-01

Pos logit contributions
avorite+0.474
ucc+0.464
ury+0.449
avour+0.435
ÃÂ+0.428
Neg logit contributions
peat-0.431
beit-0.380
wide-0.365
 primates-0.346
 elect-0.344
Tokenle
Token ID293
Feature activation-3.92e-01

Pos logit contributions
avorite+2.403
ucc+2.371
ury+2.278
avour+2.182
ÃÂ+2.120
Neg logit contributions
peat-3.610
beit-3.248
wide-3.160
 primates-3.041
 elect-3.023
Token Benn
Token ID14025
Feature activation-6.98e-01

Pos logit contributions
avorite+1.458
ucc+1.376
ury+1.296
avour+1.240
ÃÂ+1.227
Neg logit contributions
peat-3.085
beit-2.825
wide-2.755
 primates-2.670
 elect-2.665
Tokenati
Token ID7246
Feature activation+2.81e-03

Pos logit contributions
ucc+2.583
avorite+2.383
ury+2.349
avour+2.253
uri+2.101
Neg logit contributions
peat-5.511
beit-4.982
wide-4.801
 primates-4.752
 elect-4.651
Token (
Token ID357
Feature activation-5.87e-01

Pos logit contributions
peat+0.016
beit+0.014
wide+0.014
 primates+0.013
 elect+0.013
Neg logit contributions
ucc-0.016
avorite-0.016
ury-0.015
avour-0.015
ÃÂ-0.014
TokenRadio
Token ID26093
Feature activation-4.99e-01

Pos logit contributions
ucc+2.409
avorite+2.333
ury+2.280
avour+2.238
uri+2.030
Neg logit contributions
peat-4.258
beit-3.873
wide-3.680
 primates-3.667
 elect-3.623
TokenSh
Token ID2484
Feature activation-1.38e+00

Pos logit contributions
ucc+1.955
avorite+1.853
ury+1.803
avour+1.763
ÃÂ+1.627
Neg logit contributions
peat-3.755
beit-3.332
wide-3.252
 primates-3.185
 elect-3.176
Tokenack
Token ID441
Feature activation-4.93e-01

Pos logit contributions
ucc+5.905
ury+5.792
avorite+5.578
avour+5.325
uri+5.258
Neg logit contributions
peat-9.486
 primates-8.407
wide-8.268
beit-8.179
 numbered-8.075
Token -
Token ID532
Feature activation-4.11e-01

Pos logit contributions
ucc+1.440
avorite+1.387
ury+1.341
avour+1.278
ÃÂ+1.145
Neg logit contributions
peat-4.149
beit-3.827
wide-3.669
 primates-3.639
 elect-3.574
Token Nissan
Token ID28190
Feature activation-4.46e-01

Pos logit contributions
avorite+2.079
ucc+2.050
ury+1.971
avour+1.926
ÃÂ+1.851
Neg logit contributions
peat-2.633
beit-2.353
wide-2.259
 primates-2.182
 elect-2.150
Token)
Token ID8
Feature activation+2.89e-01

Pos logit contributions
ucc+2.704
ury+2.636
avorite+2.634
avour+2.578
ÃÂ+2.429
Neg logit contributions
peat-2.344
beit-2.019
wide-1.914
 primates-1.854
 candid-1.777
Token s
Token ID264
Feature activation-4.40e-02

Pos logit contributions
peat+1.435
beit+1.241
wide+1.186
 primates+1.116
 elect+1.098
Neg logit contributions
avorite-1.900
ucc-1.861
ury-1.809
avour-1.768
ÃÂ-1.738
TokenSo
Token ID2396
Feature activation-1.17e-02

Pos logit contributions
avorite+0.954
ucc+0.929
ury+0.891
avour+0.855
ÃÂ+0.833
Neg logit contributions
peat-1.413
beit-1.278
wide-1.239
 primates-1.192
 elect-1.186
Token here
Token ID994
Feature activation-3.68e-02

Pos logit contributions
avorite+0.076
ucc+0.075
ury+0.073
avour+0.071
ÃÂ+0.070
Neg logit contributions
peat-0.059
beit-0.051
wide-0.049
 primates-0.046
 elect-0.046
Token it
Token ID340
Feature activation-5.02e-01

Pos logit contributions
avorite+0.220
ucc+0.216
ury+0.209
avour+0.202
ÃÂ+0.200
Neg logit contributions
peat-0.205
beit-0.180
wide-0.173
 primates-0.164
 elect-0.163
Token is
Token ID318
Feature activation-4.48e-01

Pos logit contributions
ucc+2.677
avorite+2.676
ury+2.554
ÃÂ+2.484
avour+2.477
Neg logit contributions
peat-3.017
beit-2.677
wide-2.584
 primates-2.456
 elect-2.433
Token,
Token ID11
Feature activation-7.19e-02

Pos logit contributions
avorite+2.566
ucc+2.555
ury+2.475
avour+2.376
ÃÂ+2.350
Neg logit contributions
peat-2.528
beit-2.223
wide-2.138
 primates-2.028
 elect-2.016
Token without
Token ID1231
Feature activation-1.37e+00

Pos logit contributions
avorite+0.460
ucc+0.451
ury+0.438
avour+0.425
ÃÂ+0.418
Neg logit contributions
peat-0.368
beit-0.321
wide-0.307
 primates-0.289
 elect-0.287
Token further
Token ID2252
Feature activation-8.56e-01

Pos logit contributions
avorite+7.694
ucc+7.519
ury+7.238
avour+6.935
ÃÂ+6.597
Neg logit contributions
peat-7.415
beit-6.823
wide-6.301
 primates-5.881
 elect-5.819
Token ado
Token ID50129
Feature activation-4.51e-01

Pos logit contributions
avorite+4.025
ucc+3.962
ury+3.799
avour+3.678
ÃÂ+3.424
Neg logit contributions
peat-5.610
beit-5.175
wide-4.924
 elect-4.608
 primates-4.576
Token:
Token ID25
Feature activation-2.66e-02

Pos logit contributions
ucc+2.497
avorite+2.486
ury+2.434
avour+2.306
ÃÂ+2.276
Neg logit contributions
peat-2.634
beit-2.320
wide-2.216
 primates-2.106
 elect-2.075
Token one
Token ID530
Feature activation-3.69e-02

Pos logit contributions
avorite+0.163
ucc+0.159
ury+0.155
avour+0.150
ÃÂ+0.147
Neg logit contributions
peat-0.144
beit-0.126
wide-0.121
 primates-0.115
 elect-0.114
Token man
Token ID582
Feature activation-5.85e-01

Pos logit contributions
avorite+0.242
ucc+0.238
ury+0.232
avour+0.224
ÃÂ+0.221
Neg logit contributions
peat-0.182
beit-0.159
wide-0.151
 primates-0.142
 elect-0.140
Token4
Token ID19
Feature activation+4.87e-01

Pos logit contributions
avorite+3.165
ucc+3.144
ury+3.123
avour+2.967
uri+2.795
Neg logit contributions
peat-6.866
beit-6.307
 primates-6.194
 elect-6.045
wide-6.020
Token {
Token ID1391
Feature activation-4.81e-02

Pos logit contributions
peat+2.557
beit+2.267
wide+2.169
 elect+2.052
 primates+2.039
Neg logit contributions
avorite-3.076
ucc-2.980
ury-2.880
avour-2.805
ÃÂ-2.775
Token(
Token ID7
Feature activation-3.94e-01

Pos logit contributions
avorite+0.263
ucc+0.256
ury+0.249
avour+0.240
ÃÂ+0.235
Neg logit contributions
peat-0.291
beit-0.258
wide-0.249
 primates-0.240
 elect-0.237
Token0
Token ID15
Feature activation-1.31e-01

Pos logit contributions
avorite+1.867
ucc+1.827
ury+1.762
avour+1.689
ÃÂ+1.622
Neg logit contributions
peat-2.633
beit-2.382
wide-2.299
 primates-2.227
 elect-2.211
Tokens
Token ID82
Feature activation+1.82e-01

Pos logit contributions
avorite+0.838
ucc+0.823
ury+0.816
avour+0.776
ÃÂ+0.768
Neg logit contributions
peat-0.666
beit-0.579
wide-0.552
 primates-0.522
 elect-0.514
Token)}
Token ID38165
Feature activation+8.51e-01

Pos logit contributions
peat+0.844
beit+0.720
wide+0.686
 primates+0.647
 elect+0.636
Neg logit contributions
avorite-1.265
ucc-1.238
ury-1.209
avour-1.173
ÃÂ-1.159
Token N
Token ID399
Feature activation-1.48e-01

Pos logit contributions
peat+4.210
beit+3.855
wide+3.654
 elect+3.490
 candid+3.409
Neg logit contributions
avorite-5.491
ucc-5.255
ury-5.018
ÃÂ-4.963
avour-4.921
Tokenf
Token ID69
Feature activation-3.60e-01

Pos logit contributions
avorite+0.786
ucc+0.777
ury+0.754
avour+0.724
ÃÂ+0.694
Neg logit contributions
peat-0.911
beit-0.811
wide-0.782
 primates-0.756
 elect-0.748
Token6
Token ID21
Feature activation+7.40e-01

Pos logit contributions
avorite+1.505
ucc+1.452
ury+1.383
avour+1.318
ÃÂ+1.298
Neg logit contributions
peat-2.658
beit-2.425
wide-2.359
 primates-2.268
 elect-2.259
Token {
Token ID1391
Feature activation+2.22e-01

Pos logit contributions
peat+3.951
beit+3.557
wide+3.410
 elect+3.230
 primates+3.190
Neg logit contributions
avorite-4.512
ucc-4.356
ury-4.188
avour-4.085
ÃÂ-4.018
Token(
Token ID7
Feature activation-2.67e-01

Pos logit contributions
peat+1.312
beit+1.165
wide+1.123
 primates+1.078
 elect+1.068
Neg logit contributions
avorite-1.247
ucc-1.219
ury-1.178
avour-1.139
ÃÂ-1.117
Token selfish
Token ID20363
Feature activation+3.02e-01

Pos logit contributions
peat+0.480
beit+0.417
wide+0.399
 primates+0.376
 elect+0.373
Neg logit contributions
avorite-0.628
ucc-0.615
ury-0.597
avour-0.580
ÃÂ-0.572
Token request
Token ID2581
Feature activation+4.25e-01

Pos logit contributions
peat+1.472
beit+1.278
wide+1.234
 elect+1.161
 primates+1.155
Neg logit contributions
avorite-2.014
ucc-1.968
ury-1.882
avour-1.850
ÃÂ-1.835
Token.
Token ID13
Feature activation+5.84e-01

Pos logit contributions
peat+2.300
beit+2.034
wide+1.962
 elect+1.844
 primates+1.842
Neg logit contributions
avorite-2.588
ucc-2.523
ury-2.423
avour-2.370
ÃÂ-2.332
Token
Token ID198
Feature activation+5.26e-01

Pos logit contributions
peat+3.004
beit+2.696
wide+2.612
 elect+2.408
 primates+2.396
Neg logit contributions
avorite-3.726
ucc-3.541
ury-3.416
avour-3.346
ÃÂ-3.325
Token
Token ID198
Feature activation+4.21e-01

Pos logit contributions
peat+2.411
wide+2.074
beit+2.067
 elect+1.829
 primates+1.792
Neg logit contributions
avorite-3.838
ÃÂ-3.559
ucc-3.547
ury-3.441
avour-3.383
TokenEven
Token ID6104
Feature activation+8.58e-01

Pos logit contributions
peat+2.746
beit+2.491
wide+2.433
 primates+2.254
 elect+2.250
Neg logit contributions
avorite-2.171
ucc-2.013
ury-1.931
ÃÂ-1.927
avour-1.869
Token if
Token ID611
Feature activation+5.45e-01

Pos logit contributions
peat+3.981
beit+3.547
wide+3.365
 elect+3.182
 primates+3.180
Neg logit contributions
avorite-5.739
ucc-5.598
ury-5.350
avour-5.324
ÃÂ-5.297
Token the
Token ID262
Feature activation+4.34e-01

Pos logit contributions
peat+2.674
beit+2.343
wide+2.229
 elect+2.116
 primates+2.115
Neg logit contributions
avorite-3.592
ucc-3.495
ury-3.367
avour-3.311
ÃÂ-3.268
Token God
Token ID1793
Feature activation+4.00e-01

Pos logit contributions
peat+2.299
beit+2.013
wide+1.942
 primates+1.848
 elect+1.842
Neg logit contributions
avorite-2.694
ucc-2.631
ury-2.542
avour-2.471
ÃÂ-2.457
Token of
Token ID286
Feature activation-2.35e-01

Pos logit contributions
peat+2.072
beit+1.818
wide+1.752
 elect+1.653
 primates+1.642
Neg logit contributions
avorite-2.544
ucc-2.474
ury-2.397
avour-2.330
ÃÂ-2.305
Token this
Token ID428
Feature activation+3.88e-01

Pos logit contributions
avorite+1.322
ucc+1.287
ury+1.253
avour+1.209
ÃÂ+1.176
Neg logit contributions
peat-1.382
beit-1.223
wide-1.185
 primates-1.122
 elect-1.113
Token firm
Token ID4081
Feature activation+7.37e-02

Pos logit contributions
avorite+1.308
ucc+1.282
ury+1.242
avour+1.203
ÃÂ+1.172
Neg logit contributions
peat-1.229
beit-1.090
wide-1.035
 primates-0.985
 elect-0.978
Token that
Token ID326
Feature activation-2.21e-03

Pos logit contributions
peat+0.396
beit+0.349
wide+0.332
 primates+0.316
 elect+0.313
Neg logit contributions
avorite-0.452
ucc-0.444
ury-0.431
avour-0.418
ÃÂ-0.409
Token special
Token ID2041
Feature activation+6.82e-04

Pos logit contributions
avorite+0.013
ucc+0.013
ury+0.013
avour+0.012
ÃÂ+0.012
Neg logit contributions
peat-0.012
beit-0.011
wide-0.010
 primates-0.010
 elect-0.010
Tokenises
Token ID2696
Feature activation+2.60e-01

Pos logit contributions
peat+0.005
beit+0.004
wide+0.004
 primates+0.004
 elect+0.004
Neg logit contributions
avorite-0.003
ucc-0.003
ury-0.003
avour-0.003
ÃÂ-0.003
Token in
Token ID287
Feature activation+5.47e-02

Pos logit contributions
peat+1.205
beit+1.035
wide+0.978
 primates+0.919
 elect+0.902
Neg logit contributions
avorite-1.802
ucc-1.768
ury-1.727
avour-1.682
ÃÂ-1.651
Token proxy
Token ID15741
Feature activation+9.69e-01

Pos logit contributions
peat+0.301
beit+0.265
wide+0.255
 primates+0.242
 elect+0.240
Neg logit contributions
avorite-0.330
ucc-0.323
ury-0.313
avour-0.303
ÃÂ-0.298
Token battles
Token ID10181
Feature activation+2.11e-01

Pos logit contributions
peat+4.876
beit+4.266
wide+4.255
 elect+4.080
 primates+4.049
Neg logit contributions
avorite-6.089
ucc-5.890
ury-5.622
ÃÂ-5.600
avour-5.538
Token in
Token ID287
Feature activation+1.39e-01

Pos logit contributions
peat+1.120
beit+0.981
wide+0.948
 primates+0.894
 elect+0.889
Neg logit contributions
avorite-1.315
ucc-1.276
ury-1.239
avour-1.203
ÃÂ-1.193
Token a
Token ID257
Feature activation-2.95e-02

Pos logit contributions
peat+0.767
beit+0.677
wide+0.654
 primates+0.621
 elect+0.617
Neg logit contributions
avorite-0.840
ucc-0.817
ury-0.792
avour-0.768
ÃÂ-0.762
Token move
Token ID1445
Feature activation+3.57e-01

Pos logit contributions
avorite+0.199
ucc+0.195
ury+0.190
avour+0.185
ÃÂ+0.181
Neg logit contributions
peat-0.141
beit-0.122
wide-0.116
 primates-0.109
 elect-0.108
Token which
Token ID543
Feature activation+3.01e-01

Pos logit contributions
peat+1.814
beit+1.571
wide+1.523
 primates+1.435
 elect+1.422
Neg logit contributions
avorite-2.311
ucc-2.254
ury-2.192
avour-2.120
ÃÂ-2.119
Token originally
Token ID6198
Feature activation-3.35e-02

Pos logit contributions
avorite+1.044
ucc+1.023
ury+0.995
avour+0.965
ÃÂ+0.948
Neg logit contributions
peat-0.818
beit-0.712
wide-0.677
 primates-0.640
 elect-0.632
Token passed
Token ID3804
Feature activation-5.43e-01

Pos logit contributions
avorite+0.221
ucc+0.216
ury+0.210
avour+0.204
ÃÂ+0.201
Neg logit contributions
peat-0.166
beit-0.144
wide-0.138
 primates-0.130
 elect-0.128
Token in
Token ID287
Feature activation-1.93e-02

Pos logit contributions
avorite+3.494
ucc+3.478
ury+3.367
avour+3.245
ÃÂ+3.185
Neg logit contributions
peat-2.711
beit-2.371
wide-2.182
 primates-2.110
 elect-2.049
Token 1970
Token ID8069
Feature activation+8.08e-02

Pos logit contributions
avorite+0.104
ucc+0.101
ury+0.098
avour+0.094
ÃÂ+0.093
Neg logit contributions
peat-0.119
beit-0.106
wide-0.103
 primates-0.098
 elect-0.098
Token,[
Token ID17414
Feature activation-6.98e-01

Pos logit contributions
peat+0.421
beit+0.368
wide+0.356
 primates+0.334
 elect+0.333
Neg logit contributions
avorite-0.512
ucc-0.499
ury-0.484
avour-0.471
ÃÂ-0.465
Token2
Token ID17
Feature activation+8.89e-01

Pos logit contributions
ucc+2.034
avorite+2.018
ury+1.872
avour+1.733
uri+1.612
Neg logit contributions
peat-5.885
beit-5.395
wide-5.165
 primates-5.158
 elect-5.102
Token]
Token ID60
Feature activation-2.61e-02

Pos logit contributions
peat+4.450
beit+3.901
wide+3.781
 primates+3.511
 elect+3.500
Neg logit contributions
avorite-5.743
ucc-5.552
ury-5.425
avour-5.312
ÃÂ-5.210
Token and
Token ID290
Feature activation-1.16e-01

Pos logit contributions
avorite+0.166
ucc+0.162
ury+0.157
avour+0.153
ÃÂ+0.151
Neg logit contributions
peat-0.135
beit-0.118
wide-0.115
 primates-0.108
 elect-0.107
Token is
Token ID318
Feature activation-3.05e-03

Pos logit contributions
avorite+0.726
ucc+0.711
ury+0.691
avour+0.668
ÃÂ+0.656
Neg logit contributions
peat-0.614
beit-0.537
wide-0.512
 primates-0.485
 elect-0.481
Token enforced
Token ID20326
Feature activation-2.45e-01

Pos logit contributions
avorite+0.020
ucc+0.019
ury+0.019
avour+0.018
ÃÂ+0.018
Neg logit contributions
peat-0.016
beit-0.014
wide-0.013
 primates-0.012
 elect-0.012
Token by
Token ID416
Feature activation-1.78e-01

Pos logit contributions
avorite+1.592
ucc+1.563
ury+1.524
avour+1.475
ÃÂ+1.449
Neg logit contributions
peat-1.227
beit-1.068
wide-1.013
 primates-0.960
 elect-0.947
Token seven
Token ID3598
Feature activation+4.88e-01

Pos logit contributions
peat+2.482
beit+2.162
wide+2.072
 primates+1.953
 elect+1.939
Neg logit contributions
avorite-3.161
ucc-3.094
ury-3.005
avour-2.917
ÃÂ-2.875
Token heavens
Token ID27636
Feature activation-3.62e-01

Pos logit contributions
peat+2.026
beit+1.711
wide+1.637
 primates+1.522
 elect+1.518
Neg logit contributions
avorite-3.633
ucc-3.529
ury-3.438
avour-3.355
ÃÂ-3.342
Token.
Token ID13
Feature activation-4.27e-02

Pos logit contributions
avorite+1.905
ucc+1.882
ury+1.836
avour+1.754
ÃÂ+1.714
Neg logit contributions
peat-2.234
beit-1.972
wide-1.899
 primates-1.824
 elect-1.792
Token
Token ID447
Feature activation+4.32e-01

Pos logit contributions
avorite+0.248
ucc+0.240
ury+0.232
avour+0.225
ÃÂ+0.222
Neg logit contributions
peat-0.246
beit-0.220
wide-0.212
 primates-0.200
 elect-0.199
Token
Token ID251
Feature activation-1.85e-01

Pos logit contributions
peat+2.778
beit+2.498
wide+2.415
 primates+2.310
 elect+2.299
Neg logit contributions
avorite-2.206
ucc-2.140
ury-2.060
avour-1.986
ÃÂ-1.948
Token6
Token ID21
Feature activation+1.31e+00

Pos logit contributions
avorite+1.078
ucc+1.056
ury+1.023
avour+0.989
ÃÂ+0.971
Neg logit contributions
peat-1.057
beit-0.933
wide-0.899
 primates-0.856
 elect-0.850
Token
Token ID198
Feature activation+1.96e-01

Pos logit contributions
peat+6.759
beit+6.222
wide+5.948
 primates+5.588
 elect+5.490
Neg logit contributions
avorite-7.874
ucc-7.365
ury-7.111
ÃÂ-7.051
avour-6.961
Token
Token ID198
Feature activation-1.84e-01

Pos logit contributions
peat+0.960
beit+0.831
wide+0.811
 elect+0.743
 primates+0.741
Neg logit contributions
avorite-1.331
ucc-1.269
ury-1.232
ÃÂ-1.221
avour-1.203
TokenDaniel
Token ID19962
Feature activation+2.38e-01

Pos logit contributions
avorite+0.907
ucc+0.890
ury+0.855
avour+0.824
ÃÂ+0.796
Neg logit contributions
peat-1.209
beit-1.088
wide-1.051
 primates-1.014
 elect-1.010
Token C
Token ID327
Feature activation-3.25e-02

Pos logit contributions
peat+1.407
beit+1.257
wide+1.215
 primates+1.158
 elect+1.156
Neg logit contributions
avorite-1.342
ucc-1.293
ury-1.241
avour-1.208
ÃÂ-1.202
Token.
Token ID13
Feature activation+1.92e-02

Pos logit contributions
avorite+0.179
ucc+0.174
ury+0.169
avour+0.162
ÃÂ+0.159
Neg logit contributions
peat-0.197
beit-0.177
wide-0.171
 primates-0.163
 elect-0.161
Token in
Token ID287
Feature activation-2.67e-01

Pos logit contributions
peat+2.256
beit+1.939
wide+1.868
 elect+1.734
 primates+1.728
Neg logit contributions
avorite-3.471
ucc-3.365
ury-3.284
avour-3.207
ÃÂ-3.165
Token an
Token ID281
Feature activation-1.29e-01

Pos logit contributions
avorite+1.669
ucc+1.657
ury+1.605
avour+1.553
ÃÂ+1.507
Neg logit contributions
peat-1.402
beit-1.216
wide-1.153
 primates-1.097
 elect-1.083
Token email
Token ID3053
Feature activation+1.36e-01

Pos logit contributions
avorite+0.833
ucc+0.829
ury+0.803
avour+0.773
ÃÂ+0.760
Neg logit contributions
peat-0.649
beit-0.559
wide-0.530
 primates-0.502
 elect-0.491
Token to
Token ID284
Feature activation+5.01e-03

Pos logit contributions
peat+0.732
beit+0.643
wide+0.615
 primates+0.584
 elect+0.578
Neg logit contributions
avorite-0.836
ucc-0.820
ury-0.797
avour-0.771
ÃÂ-0.758
Token several
Token ID1811
Feature activation+1.76e-01

Pos logit contributions
peat+0.027
beit+0.024
wide+0.023
 primates+0.021
 elect+0.021
Neg logit contributions
avorite-0.031
ucc-0.030
ury-0.029
avour-0.028
ÃÂ-0.028
Token colleagues
Token ID7810
Feature activation+3.51e-01

Pos logit contributions
peat+0.881
beit+0.765
wide+0.735
 primates+0.691
 elect+0.687
Neg logit contributions
avorite-1.155
ucc-1.126
ury-1.094
avour-1.066
ÃÂ-1.052
Token,
Token ID11
Feature activation+9.12e-02

Pos logit contributions
peat+1.849
beit+1.622
wide+1.560
 primates+1.473
 elect+1.464
Neg logit contributions
avorite-2.197
ucc-2.143
ury-2.077
avour-2.019
ÃÂ-1.988
Token
Token ID564
Feature activation+3.28e-01

Pos logit contributions
peat+0.465
beit+0.405
wide+0.386
 primates+0.365
 elect+0.362
Neg logit contributions
avorite-0.588
ucc-0.576
ury-0.560
avour-0.543
ÃÂ-0.535
Token
Token ID250
Feature activation+3.69e-01

Pos logit contributions
peat+2.269
beit+2.054
wide+1.993
 primates+1.914
 elect+1.904
Neg logit contributions
avorite-1.513
ucc-1.467
ury-1.408
avour-1.350
ÃÂ-1.321
TokenSub
Token ID7004
Feature activation+5.05e-01

Pos logit contributions
peat+2.271
beit+2.030
wide+2.005
 primates+1.846
 elect+1.845
Neg logit contributions
avorite-2.003
ucc-1.907
ury-1.846
ÃÂ-1.798
avour-1.793
Tokenprime
Token ID35505
Feature activation+2.44e-01

Pos logit contributions
peat+3.522
beit+3.186
wide+3.132
 primates+2.965
 elect+2.953
Neg logit contributions
avorite-2.295
ucc-2.203
ury-2.102
ÃÂ-2.025
avour-2.015
Token markets
Token ID5939
Feature activation+3.05e-01

Pos logit contributions
peat+0.406
beit+0.358
wide+0.344
 primates+0.327
 elect+0.324
Neg logit contributions
avorite-0.433
ucc-0.424
ury-0.411
avour-0.398
ÃÂ-0.391
Token are
Token ID389
Feature activation+2.28e-01

Pos logit contributions
peat+1.637
beit+1.440
wide+1.394
 primates+1.313
 elect+1.309
Neg logit contributions
avorite-1.881
ucc-1.830
ury-1.772
avour-1.720
ÃÂ-1.696
Token in
Token ID287
Feature activation+2.37e-01

Pos logit contributions
peat+1.152
beit+1.002
wide+0.973
 primates+0.915
 elect+0.912
Neg logit contributions
avorite-1.480
ucc-1.440
ury-1.395
avour-1.358
ÃÂ-1.343
Token a
Token ID257
Feature activation+9.49e-02

Pos logit contributions
peat+1.173
beit+1.018
wide+0.983
 primates+0.923
 elect+0.918
Neg logit contributions
avorite-1.567
ucc-1.526
ury-1.481
avour-1.440
ÃÂ-1.422
Token
Token ID564
Feature activation+3.01e-01

Pos logit contributions
peat+0.485
beit+0.422
wide+0.406
 primates+0.383
 elect+0.381
Neg logit contributions
avorite-0.612
ucc-0.596
ury-0.578
avour-0.562
ÃÂ-0.554
Token
Token ID250
Feature activation+1.19e-01

Pos logit contributions
peat+2.082
beit+1.884
wide+1.828
 primates+1.755
 elect+1.746
Neg logit contributions
avorite-1.392
ucc-1.350
ury-1.295
avour-1.242
ÃÂ-1.215
Tokendeath
Token ID22595
Feature activation+5.63e-01

Pos logit contributions
peat+0.694
beit+0.614
wide+0.596
 primates+0.562
 elect+0.560
Neg logit contributions
avorite-0.683
ucc-0.662
ury-0.640
avour-0.620
ÃÂ-0.613
Token spiral
Token ID23642
Feature activation+1.26e-01

Pos logit contributions
peat+2.826
beit+2.434
wide+2.363
 elect+2.216
 primates+2.211
Neg logit contributions
avorite-3.694
ucc-3.570
ury-3.446
avour-3.366
ÃÂ-3.313
Token
Token ID447
Feature activation+1.44e-01

Pos logit contributions
peat+0.707
beit+0.624
wide+0.602
 primates+0.570
 elect+0.567
Neg logit contributions
avorite-0.749
ucc-0.729
ury-0.706
avour-0.685
ÃÂ-0.673
Token
Token ID251
Feature activation-1.15e-02

Pos logit contributions
peat+0.959
beit+0.864
wide+0.837
 primates+0.803
 elect+0.798
Neg logit contributions
avorite-0.698
ucc-0.681
ury-0.655
avour-0.628
ÃÂ-0.617
Token and
Token ID290
Feature activation-8.17e-02

Pos logit contributions
avorite+0.072
ucc+0.070
ury+0.068
avour+0.066
ÃÂ+0.065
Neg logit contributions
peat-0.061
beit-0.053
wide-0.051
 primates-0.048
 elect-0.048
TokenOrange
Token ID40141
Feature activation+2.11e-01

Pos logit contributions
avorite+0.382
ucc+0.367
ury+0.352
avour+0.338
ÃÂ+0.333
Neg logit contributions
peat-0.574
beit-0.521
wide-0.506
 primates-0.483
 elect-0.481
Tokenhead
Token ID2256
Feature activation-9.00e-02

Pos logit contributions
peat+1.178
beit+1.040
wide+1.002
 primates+0.948
 elect+0.945
Neg logit contributions
avorite-1.264
ucc-1.225
ury-1.184
avour-1.150
ÃÂ-1.136
Token,
Token ID11
Feature activation+5.62e-02

Pos logit contributions
avorite+0.530
ucc+0.519
ury+0.503
avour+0.487
ÃÂ+0.478
Neg logit contributions
peat-0.507
beit-0.447
wide-0.430
 primates-0.409
 elect-0.406
Token wasn
Token ID2492
Feature activation+5.70e-02

Pos logit contributions
peat+0.289
beit+0.253
wide+0.242
 primates+0.228
 elect+0.227
Neg logit contributions
avorite-0.359
ucc-0.351
ury-0.341
avour-0.331
ÃÂ-0.326
Token
Token ID447
Feature activation+2.40e-01

Pos logit contributions
peat+0.338
beit+0.302
wide+0.292
 elect+0.273
 primates+0.272
Neg logit contributions
avorite-0.330
ucc-0.311
ury-0.302
avour-0.293
ÃÂ-0.291
Token
Token ID247
Feature activation+5.10e-02

Pos logit contributions
peat+1.627
beit+1.469
wide+1.417
 primates+1.358
 elect+1.355
Neg logit contributions
avorite-1.156
ucc-1.111
ury-1.069
avour-1.030
ÃÂ-1.010
Tokent
Token ID83
Feature activation+1.63e-02

Pos logit contributions
peat+0.304
beit+0.270
wide+0.261
 primates+0.249
 elect+0.247
Neg logit contributions
avorite-0.285
ucc-0.277
ury-0.268
avour-0.259
ÃÂ-0.255
Token the
Token ID262
Feature activation+2.23e-01

Pos logit contributions
peat+0.079
beit+0.068
wide+0.065
 primates+0.062
 elect+0.061
Neg logit contributions
avorite-0.108
ucc-0.106
ury-0.103
avour-0.100
ÃÂ-0.099
Token Captain
Token ID8599
Feature activation+2.00e-02

Pos logit contributions
peat+1.192
beit+1.046
wide+1.005
 primates+0.952
 elect+0.946
Neg logit contributions
avorite-1.377
ucc-1.343
ury-1.303
avour-1.264
ÃÂ-1.247
Token here
Token ID994
Feature activation+2.52e-01

Pos logit contributions
peat+0.101
beit+0.088
wide+0.084
 primates+0.079
 elect+0.078
Neg logit contributions
avorite-0.130
ucc-0.127
ury-0.123
avour-0.120
ÃÂ-0.118
Token just
Token ID655
Feature activation+6.84e-01

Pos logit contributions
peat+1.243
beit+1.082
wide+1.040
 elect+0.975
 primates+0.974
Neg logit contributions
avorite-1.665
ucc-1.622
ury-1.570
avour-1.531
ÃÂ-1.512
Token I
Token ID314
Feature activation+3.26e-01

Pos logit contributions
peat+2.819
beit+2.471
wide+2.369
 elect+2.248
 primates+2.245
Neg logit contributions
avorite-3.667
ucc-3.565
ury-3.470
avour-3.375
ÃÂ-3.363
Token did
Token ID750
Feature activation+2.42e-01

Pos logit contributions
peat+1.708
beit+1.497
wide+1.451
 elect+1.370
 primates+1.367
Neg logit contributions
avorite-2.061
ucc-2.005
ury-1.937
avour-1.885
ÃÂ-1.883
Token manage
Token ID6687
Feature activation+6.35e-01

Pos logit contributions
peat+1.248
beit+1.088
wide+1.046
 primates+0.988
 elect+0.985
Neg logit contributions
avorite-1.546
ucc-1.510
ury-1.468
avour-1.425
ÃÂ-1.408
Token to
Token ID284
Feature activation+4.68e-01

Pos logit contributions
peat+3.116
beit+2.692
wide+2.589
 elect+2.441
 primates+2.440
Neg logit contributions
avorite-4.185
ucc-4.070
ury-3.973
avour-3.862
ÃÂ-3.811
Token answer
Token ID3280
Feature activation+6.32e-03

Pos logit contributions
peat+2.489
beit+2.184
wide+2.107
 elect+2.029
 primates+2.008
Neg logit contributions
avorite-2.895
ucc-2.816
ury-2.745
ÃÂ-2.662
avour-2.656
Token a
Token ID257
Feature activation+2.20e-01

Pos logit contributions
peat+0.031
beit+0.027
wide+0.026
 primates+0.024
 elect+0.024
Neg logit contributions
avorite-0.042
ucc-0.041
ury-0.040
avour-0.039
ÃÂ-0.038
Token few
Token ID1178
Feature activation+2.23e-01

Pos logit contributions
peat+1.009
beit+0.867
wide+0.830
 primates+0.774
 elect+0.770
Neg logit contributions
avorite-1.534
ucc-1.498
ury-1.459
avour-1.418
ÃÂ-1.408
Token questions
Token ID2683
Feature activation+3.51e-01

Pos logit contributions
peat+1.019
beit+0.873
wide+0.832
 primates+0.777
 elect+0.771
Neg logit contributions
avorite-1.554
ucc-1.523
ury-1.483
avour-1.443
ÃÂ-1.423
Token in
Token ID287
Feature activation+1.93e-01

Pos logit contributions
peat+1.768
beit+1.547
wide+1.489
 primates+1.400
 elect+1.398
Neg logit contributions
avorite-2.292
ucc-2.226
ury-2.150
avour-2.096
ÃÂ-2.085
Token the
Token ID262
Feature activation+2.12e-01

Pos logit contributions
peat+1.000
beit+0.874
wide+0.838
 primates+0.792
 elect+0.786
Neg logit contributions
avorite-1.224
ucc-1.197
ury-1.162
avour-1.127
ÃÂ-1.112
Token chat
Token ID8537
Feature activation+2.05e-01

Pos logit contributions
peat+1.216
beit+1.077
wide+1.041
 primates+0.992
 elect+0.986
Neg logit contributions
avorite-1.233
ucc-1.202
ury-1.162
avour-1.123
ÃÂ-1.111
Token of
Token ID286
Feature activation-1.00e-01

Pos logit contributions
peat+0.014
beit+0.012
wide+0.011
 primates+0.011
 elect+0.011
Neg logit contributions
avorite-0.018
ucc-0.018
ury-0.017
avour-0.017
ÃÂ-0.016
Token their
Token ID511
Feature activation-7.18e-02

Pos logit contributions
avorite+0.597
ucc+0.584
ury+0.569
avour+0.549
ÃÂ+0.540
Neg logit contributions
peat-0.554
beit-0.489
wide-0.470
 primates-0.447
 elect-0.440
Token own
Token ID898
Feature activation+3.95e-01

Pos logit contributions
avorite+0.423
ucc+0.412
ury+0.399
avour+0.387
ÃÂ+0.380
Neg logit contributions
peat-0.406
beit-0.359
wide-0.346
 primates-0.328
 elect-0.327
Token.
Token ID13
Feature activation+5.81e-02

Pos logit contributions
peat+2.141
beit+1.882
wide+1.814
 elect+1.726
 primates+1.720
Neg logit contributions
avorite-2.421
ucc-2.365
ury-2.271
avour-2.209
ÃÂ-2.176
Token His
Token ID2399
Feature activation+1.16e-01

Pos logit contributions
peat+0.341
beit+0.307
wide+0.298
 elect+0.279
 primates+0.278
Neg logit contributions
avorite-0.336
ucc-0.316
ury-0.308
avour-0.298
ÃÂ-0.297
Token overhaul
Token ID18708
Feature activation+3.47e-02

Pos logit contributions
peat+0.620
beit+0.543
wide+0.524
 elect+0.496
 primates+0.495
Neg logit contributions
avorite-0.723
ucc-0.704
ury-0.682
avour-0.662
ÃÂ-0.655
Token saw
Token ID2497
Feature activation+3.62e-01

Pos logit contributions
peat+0.183
beit+0.160
wide+0.153
 primates+0.145
 elect+0.144
Neg logit contributions
avorite-0.218
ucc-0.213
ury-0.207
avour-0.201
ÃÂ-0.198
Token the
Token ID262
Feature activation+8.31e-02

Pos logit contributions
peat+1.871
beit+1.632
wide+1.578
 elect+1.485
 primates+1.482
Neg logit contributions
avorite-2.299
ucc-2.231
ury-2.165
avour-2.105
ÃÂ-2.076
Token dismissal
Token ID25693
Feature activation-6.62e-03

Pos logit contributions
peat+0.472
beit+0.417
wide+0.402
 primates+0.382
 elect+0.381
Neg logit contributions
avorite-0.489
ucc-0.475
ury-0.460
avour-0.446
ÃÂ-0.440
Token of
Token ID286
Feature activation+1.04e-01

Pos logit contributions
avorite+0.043
ucc+0.043
ury+0.041
avour+0.040
ÃÂ+0.040
Neg logit contributions
peat-0.033
beit-0.029
wide-0.027
 primates-0.026
 elect-0.025
Token veterans
Token ID11255
Feature activation-4.91e-02

Pos logit contributions
peat+0.605
beit+0.537
wide+0.519
 primates+0.493
 elect+0.491
Neg logit contributions
avorite-0.600
ucc-0.584
ury-0.565
avour-0.547
ÃÂ-0.539
Token the
Token ID262
Feature activation-2.06e-01

Pos logit contributions
avorite+0.299
ucc+0.292
ury+0.284
avour+0.275
ÃÂ+0.271
Neg logit contributions
peat-0.251
beit-0.219
wide-0.210
 primates-0.199
 elect-0.197
Token Washington
Token ID2669
Feature activation-1.70e-01

Pos logit contributions
avorite+1.128
ucc+1.094
ury+1.059
avour+1.021
ÃÂ+0.996
Neg logit contributions
peat-1.253
beit-1.119
wide-1.068
 primates-1.025
 elect-1.010
Token Beer
Token ID16971
Feature activation+6.62e-02

Pos logit contributions
avorite+0.722
ucc+0.691
ury+0.668
avour+0.632
ÃÂ+0.616
Neg logit contributions
peat-1.240
beit-1.123
wide-1.075
 primates-1.060
 elect-1.035
Token Mobile
Token ID12173
Feature activation-1.66e-01

Pos logit contributions
peat+0.406
beit+0.363
wide+0.351
 primates+0.335
 elect+0.333
Neg logit contributions
avorite-0.358
ucc-0.348
ury-0.336
avour-0.324
ÃÂ-0.319
Token App
Token ID2034
Feature activation-1.02e-02

Pos logit contributions
avorite+0.755
ucc+0.737
ury+0.698
avour+0.673
ÃÂ+0.654
Neg logit contributions
peat-1.157
beit-1.048
wide-1.003
 primates-0.977
 elect-0.958
Token for
Token ID329
Feature activation-6.11e-02

Pos logit contributions
avorite+0.065
ucc+0.064
ury+0.062
avour+0.060
ÃÂ+0.059
Neg logit contributions
peat-0.053
beit-0.046
wide-0.044
 primates-0.042
 elect-0.041
Token a
Token ID257
Feature activation-3.38e-01

Pos logit contributions
avorite+0.402
ucc+0.394
ury+0.383
avour+0.372
ÃÂ+0.366
Neg logit contributions
peat-0.303
beit-0.263
wide-0.251
 primates-0.237
 elect-0.235
Token few
Token ID1178
Feature activation-2.47e-01

Pos logit contributions
avorite+2.351
ucc+2.314
ury+2.258
avour+2.187
ÃÂ+2.153
Neg logit contributions
peat-1.521
beit-1.290
wide-1.217
 primates-1.157
 elect-1.130
Token weeks
Token ID2745
Feature activation+1.44e-01

Pos logit contributions
avorite+1.335
ucc+1.315
ury+1.268
avour+1.218
ÃÂ+1.193
Neg logit contributions
peat-1.490
beit-1.335
wide-1.275
 primates-1.218
 elect-1.201
Token:
Token ID25
Feature activation+2.85e-01

Pos logit contributions
peat+0.814
beit+0.720
wide+0.693
 primates+0.657
 elect+0.654
Neg logit contributions
avorite-0.854
ucc-0.833
ury-0.806
avour-0.781
ÃÂ-0.768
Token feedback
Token ID7538
Feature activation+5.07e-01

Pos logit contributions
peat+1.505
beit+1.323
wide+1.276
 primates+1.201
 elect+1.200
Neg logit contributions
avorite-1.778
ucc-1.735
ury-1.685
avour-1.632
ÃÂ-1.611
Token company
Token ID1664
Feature activation+2.72e-01

Pos logit contributions
peat+2.684
beit+2.352
wide+2.291
 elect+2.186
 primates+2.173
Neg logit contributions
avorite-2.931
ucc-2.832
ury-2.747
avour-2.698
ÃÂ-2.681
Token,
Token ID11
Feature activation+2.35e-01

Pos logit contributions
peat+1.441
beit+1.268
wide+1.230
 primates+1.154
 elect+1.151
Neg logit contributions
avorite-1.691
ucc-1.638
ury-1.595
avour-1.553
ÃÂ-1.525
Token which
Token ID543
Feature activation+4.25e-01

Pos logit contributions
peat+1.341
beit+1.198
wide+1.156
 elect+1.098
 primates+1.097
Neg logit contributions
avorite-1.370
ucc-1.331
ury-1.283
avour-1.248
ÃÂ-1.233
Token has
Token ID468
Feature activation+2.87e-01

Pos logit contributions
peat+2.115
beit+1.854
wide+1.778
 primates+1.675
 elect+1.671
Neg logit contributions
avorite-2.787
ucc-2.719
ury-2.628
avour-2.574
ÃÂ-2.542
Token since
Token ID1201
Feature activation-3.05e-01

Pos logit contributions
peat+1.437
beit+1.253
wide+1.210
 primates+1.135
 elect+1.129
Neg logit contributions
avorite-1.878
ucc-1.836
ury-1.779
avour-1.737
ÃÂ-1.717
Token been
Token ID587
Feature activation-3.97e-01

Pos logit contributions
avorite+1.929
ucc+1.899
ury+1.857
avour+1.780
ÃÂ+1.732
Neg logit contributions
peat-1.569
beit-1.352
wide-1.273
 primates-1.221
 elect-1.194
Token sold
Token ID2702
Feature activation-8.72e-02

Pos logit contributions
avorite+2.489
ucc+2.441
ury+2.376
avour+2.299
ÃÂ+2.251
Neg logit contributions
peat-2.079
beit-1.813
wide-1.728
 primates-1.642
 elect-1.617
Token to
Token ID284
Feature activation+1.19e-01

Pos logit contributions
avorite+0.538
ucc+0.527
ury+0.512
avour+0.496
ÃÂ+0.488
Neg logit contributions
peat-0.468
beit-0.411
wide-0.392
 primates-0.372
 elect-0.369
Token the
Token ID262
Feature activation+6.53e-02

Pos logit contributions
peat+0.664
beit+0.586
wide+0.564
 primates+0.535
 elect+0.531
Neg logit contributions
avorite-0.707
ucc-0.691
ury-0.669
avour-0.648
ÃÂ-0.638
Token video
Token ID2008
Feature activation-1.10e-01

Pos logit contributions
peat+0.376
beit+0.333
wide+0.321
 primates+0.306
 elect+0.304
Neg logit contributions
avorite-0.378
ucc-0.369
ury-0.357
avour-0.346
ÃÂ-0.341
Token start
Token ID923
Feature activation-4.04e-01

Pos logit contributions
avorite+0.729
ucc+0.726
ury+0.703
avour+0.681
ÃÂ+0.661
Neg logit contributions
peat-0.535
beit-0.467
wide-0.436
 primates-0.407
 elect-0.401
Token Department
Token ID2732
Feature activation+1.40e-02

Pos logit contributions
peat+1.191
beit+1.068
wide+1.036
 primates+0.990
 elect+0.984
Neg logit contributions
avorite-0.976
ucc-0.948
ury-0.914
avour-0.882
ÃÂ-0.866
Token,
Token ID11
Feature activation-4.27e-02

Pos logit contributions
peat+0.081
beit+0.071
wide+0.068
 primates+0.065
 elect+0.064
Neg logit contributions
avorite-0.082
ucc-0.080
ury-0.077
avour-0.074
ÃÂ-0.074
Token could
Token ID714
Feature activation-3.47e-01

Pos logit contributions
avorite+0.268
ucc+0.263
ury+0.257
avour+0.248
ÃÂ+0.243
Neg logit contributions
peat-0.223
beit-0.195
wide-0.186
 primates-0.176
 elect-0.174
Token be
Token ID307
Feature activation+6.77e-02

Pos logit contributions
avorite+1.870
ucc+1.857
ury+1.817
avour+1.721
ÃÂ+1.684
Neg logit contributions
peat-2.075
beit-1.856
wide-1.782
 primates-1.699
 rats-1.638
Token seen
Token ID1775
Feature activation+1.22e-01

Pos logit contributions
peat+0.354
beit+0.309
wide+0.298
 primates+0.281
 elect+0.280
Neg logit contributions
avorite-0.428
ucc-0.418
ury-0.404
avour-0.393
ÃÂ-0.388
Token drawing
Token ID8263
Feature activation-2.83e-02

Pos logit contributions
peat+0.673
beit+0.593
wide+0.572
 primates+0.542
 elect+0.540
Neg logit contributions
avorite-0.734
ucc-0.716
ury-0.692
avour-0.672
ÃÂ-0.662
Token his
Token ID465
Feature activation-1.88e-01

Pos logit contributions
avorite+0.186
ucc+0.183
ury+0.178
avour+0.172
ÃÂ+0.169
Neg logit contributions
peat-0.141
beit-0.122
wide-0.115
 primates-0.109
 elect-0.107
Token gun
Token ID2485
Feature activation+7.53e-02

Pos logit contributions
avorite+1.125
ucc+1.103
ury+1.071
avour+1.031
ÃÂ+1.010
Neg logit contributions
peat-1.040
beit-0.914
wide-0.872
 primates-0.829
 elect-0.819
Token and
Token ID290
Feature activation+3.26e-02

Pos logit contributions
peat+0.388
beit+0.339
wide+0.327
 primates+0.306
 elect+0.306
Neg logit contributions
avorite-0.485
ucc-0.471
ury-0.455
avour-0.445
ÃÂ-0.440
Token yelling
Token ID22187
Feature activation+2.86e-01

Pos logit contributions
peat+0.185
beit+0.164
wide+0.158
 primates+0.150
 elect+0.149
Neg logit contributions
avorite-0.192
ucc-0.187
ury-0.181
avour-0.175
ÃÂ-0.172
Token for
Token ID329
Feature activation+1.23e-01

Pos logit contributions
peat+1.411
beit+1.226
wide+1.179
 primates+1.110
 elect+1.103
Neg logit contributions
avorite-1.892
ucc-1.846
ury-1.788
avour-1.746
ÃÂ-1.726
Token word
Token ID1573
Feature activation+4.00e-01

Pos logit contributions
avorite+0.493
ucc+0.482
ury+0.468
avour+0.454
ÃÂ+0.450
Neg logit contributions
peat-0.370
beit-0.322
wide-0.308
 primates-0.289
 elect-0.289
Tokened
Token ID276
Feature activation-8.12e-01

Pos logit contributions
peat+1.692
beit+1.458
wide+1.413
 primates+1.249
 elect+1.245
Neg logit contributions
avorite-2.997
ucc-2.839
ury-2.768
ÃÂ-2.768
avour-2.702
Token editorial
Token ID13684
Feature activation-2.23e-01

Pos logit contributions
ucc+4.776
avorite+4.676
ury+4.593
avour+4.475
ÃÂ+4.212
Neg logit contributions
peat-4.434
beit-3.852
wide-3.563
 primates-3.449
 elect-3.381
Token this
Token ID428
Feature activation-1.67e-01

Pos logit contributions
avorite+1.394
ucc+1.373
ury+1.334
avour+1.297
ÃÂ+1.253
Neg logit contributions
peat-1.168
beit-1.018
wide-0.966
 primates-0.915
 elect-0.904
Token week
Token ID1285
Feature activation-8.22e-02

Pos logit contributions
avorite+0.969
ucc+0.953
ury+0.927
avour+0.895
ÃÂ+0.862
Neg logit contributions
peat-0.951
beit-0.837
wide-0.801
 primates-0.759
 elect-0.751
Token that
Token ID326
Feature activation-2.36e-01

Pos logit contributions
avorite+0.528
ucc+0.519
ury+0.503
avour+0.490
ÃÂ+0.478
Neg logit contributions
peat-0.418
beit-0.364
wide-0.346
 primates-0.328
 elect-0.324
Token Britain
Token ID5491
Feature activation+3.38e-02

Pos logit contributions
avorite+1.316
ucc+1.283
ury+1.247
avour+1.203
ÃÂ+1.162
Neg logit contributions
peat-1.408
beit-1.244
wide-1.197
 primates-1.141
 elect-1.129
Token had
Token ID550
Feature activation+2.56e-01

Pos logit contributions
peat+0.195
beit+0.172
wide+0.165
 primates+0.158
 elect+0.155
Neg logit contributions
avorite-0.195
ucc-0.190
ury-0.185
avour-0.179
ÃÂ-0.175
Token "
Token ID366
Feature activation+4.90e-01

Pos logit contributions
peat+1.277
beit+1.109
wide+1.061
 primates+0.999
 elect+0.991
Neg logit contributions
avorite-1.676
ucc-1.640
ury-1.595
avour-1.549
ÃÂ-1.525
Tokenblack
Token ID13424
Feature activation+2.64e-01

Pos logit contributions
peat+2.891
beit+2.555
wide+2.521
 elect+2.318
 primates+2.313
Neg logit contributions
avorite-2.783
ucc-2.687
ury-2.578
ÃÂ-2.537
avour-2.488
Tokenmail
Token ID4529
Feature activation-1.29e-01

Pos logit contributions
peat+1.424
beit+1.253
wide+1.204
 primates+1.138
 elect+1.132
Neg logit contributions
avorite-1.620
ucc-1.579
ury-1.534
avour-1.485
ÃÂ-1.465
Token the
Token ID262
Feature activation+3.51e-01

Pos logit contributions
peat+2.297
beit+2.013
wide+1.940
 elect+1.854
 primates+1.852
Neg logit contributions
avorite-2.855
ucc-2.768
ury-2.691
avour-2.620
ÃÂ-2.595
Token Mid
Token ID7215
Feature activation+6.73e-02

Pos logit contributions
peat+1.870
beit+1.647
wide+1.586
 primates+1.518
 elect+1.516
Neg logit contributions
avorite-2.177
ucc-2.115
ury-2.047
avour-1.993
ÃÂ-1.974
Tokenas
Token ID292
Feature activation+4.28e-02

Pos logit contributions
peat+0.375
beit+0.332
wide+0.318
 primates+0.302
 elect+0.300
Neg logit contributions
avorite-0.400
ucc-0.394
ury-0.381
avour-0.368
ÃÂ-0.361
Token Touch
Token ID15957
Feature activation+5.39e-03

Pos logit contributions
peat+0.281
beit+0.252
wide+0.245
 primates+0.234
 elect+0.233
Neg logit contributions
avorite-0.213
ucc-0.207
ury-0.199
avour-0.192
ÃÂ-0.188
Token that
Token ID326
Feature activation+2.38e-01

Pos logit contributions
peat+0.028
beit+0.025
wide+0.024
 primates+0.023
 elect+0.022
Neg logit contributions
avorite-0.034
ucc-0.033
ury-0.032
avour-0.031
ÃÂ-0.031
Token turns
Token ID4962
Feature activation-1.81e-01

Pos logit contributions
peat+1.258
beit+1.108
wide+1.065
 primates+1.010
 elect+1.009
Neg logit contributions
avorite-1.490
ucc-1.458
ury-1.410
avour-1.368
ÃÂ-1.350
Token everything
Token ID2279
Feature activation-2.13e-02

Pos logit contributions
avorite+1.143
ucc+1.118
ury+1.090
avour+1.056
ÃÂ+1.031
Neg logit contributions
peat-0.940
beit-0.817
wide-0.784
 primates-0.733
 elect-0.727
Token into
Token ID656
Feature activation+4.64e-01

Pos logit contributions
avorite+0.134
ucc+0.131
ury+0.127
avour+0.123
ÃÂ+0.121
Neg logit contributions
peat-0.112
beit-0.098
wide-0.094
 primates-0.089
 elect-0.088
Token s
Token ID264
Feature activation-3.84e-01

Pos logit contributions
peat+2.205
beit+1.906
wide+1.827
 primates+1.730
 elect+1.725
Neg logit contributions
avorite-3.141
ucc-3.061
ury-2.966
avour-2.894
ÃÂ-2.870
Token***
Token ID8162
Feature activation+2.95e-01

Pos logit contributions
avorite+2.065
ucc+2.044
ury+1.972
avour+1.908
ÃÂ+1.825
Neg logit contributions
peat-2.344
beit-2.065
wide-1.989
 primates-1.924
 elect-1.916
Token.
Token ID13
Feature activation-1.83e-01

Pos logit contributions
peat+1.602
beit+1.405
wide+1.359
 primates+1.285
 elect+1.276
Neg logit contributions
avorite-1.809
ucc-1.758
ury-1.703
avour-1.654
ÃÂ-1.634
Token
Token ID198
Feature activation-5.46e-02

Pos logit contributions
peat+1.329
beit+1.162
wide+1.114
 primates+1.061
 elect+1.048
Neg logit contributions
avorite-1.535
ucc-1.514
ury-1.468
avour-1.417
ÃÂ-1.400
Token
Token ID198
Feature activation-4.83e-01

Pos logit contributions
avorite+0.346
ucc+0.340
ury+0.330
avour+0.320
ÃÂ+0.313
Neg logit contributions
peat-0.283
beit-0.247
wide-0.236
 primates-0.224
 elect-0.222
TokenMan
Token ID5124
Feature activation-3.77e-01

Pos logit contributions
avorite+2.556
ucc+2.545
ury+2.450
avour+2.362
ÃÂ+2.259
Neg logit contributions
peat-2.898
beit-2.617
wide-2.474
 primates-2.453
 elect-2.405
Tokenhattan
Token ID12904
Feature activation-7.69e-01

Pos logit contributions
avorite+2.174
ucc+2.159
ury+2.099
avour+2.013
ÃÂ+1.984
Neg logit contributions
peat-2.128
beit-1.877
wide-1.805
 primates-1.726
 elect-1.700
Token
Token ID851
Feature activation-3.11e-01

Pos logit contributions
ucc+4.283
avorite+4.243
ury+4.135
avour+3.979
ÃÂ+3.950
Neg logit contributions
peat-4.434
beit-3.863
wide-3.605
 primates-3.546
 elect-3.439
Token (
Token ID357
Feature activation-1.03e+00

Pos logit contributions
avorite+1.726
ucc+1.696
ury+1.638
avour+1.583
ÃÂ+1.553
Neg logit contributions
peat-1.850
beit-1.639
wide-1.579
 primates-1.513
 elect-1.498
Token1979
Token ID33581
Feature activation+3.15e-02

Pos logit contributions
ucc+4.705
avorite+4.602
ury+4.508
avour+4.259
uri+4.141
Neg logit contributions
peat-6.688
beit-6.006
 primates-5.759
 elect-5.718
wide-5.644
Token)
Token ID8
Feature activation-2.85e-01

Pos logit contributions
peat+0.147
beit+0.126
wide+0.117
 primates+0.113
 elect+0.110
Neg logit contributions
avorite-0.215
ucc-0.214
ury-0.208
ÃÂ-0.201
avour-0.200
Token
Token ID198
Feature activation-2.19e-01

Pos logit contributions
avorite+1.651
ucc+1.630
ury+1.581
avour+1.510
ÃÂ+1.508
Neg logit contributions
peat-1.601
beit-1.428
wide-1.346
 primates-1.322
 elect-1.285
Token
Token ID198
Feature activation-3.02e-01

Pos logit contributions
ucc+1.323
avorite+1.311
ury+1.284
avour+1.237
uri+1.185
Neg logit contributions
peat-1.173
beit-1.032
wide-0.972
 primates-0.946
 elect-0.929
TokenGrade
Token ID42233
Feature activation-2.58e-01

Pos logit contributions
ucc+1.239
avorite+1.225
ury+1.188
avour+1.121
uri+1.051
Neg logit contributions
peat-2.170
beit-1.993
 primates-1.914
wide-1.914
 elect-1.881
Token]
Token ID60
Feature activation-3.25e-01

Pos logit contributions
avorite+1.216
ucc+1.185
ury+1.150
avour+1.106
ÃÂ+1.081
Neg logit contributions
peat-1.342
beit-1.193
wide-1.145
 primates-1.098
 elect-1.084
Token::
Token ID3712
Feature activation-4.53e-01

Pos logit contributions
avorite+1.193
ucc+1.143
ury+1.098
avour+1.037
ÃÂ+1.001
Neg logit contributions
peat-2.557
beit-2.337
wide-2.266
 primates-2.198
 elect-2.175
TokenFrom
Token ID4863
Feature activation-2.90e-01

Pos logit contributions
avorite+1.911
ucc+1.867
ury+1.793
avour+1.711
ÃÂ+1.610
Neg logit contributions
peat-3.281
beit-2.987
wide-2.868
 primates-2.819
 elect-2.794
TokenImage
Token ID5159
Feature activation-2.79e-01

Pos logit contributions
avorite+1.155
ucc+1.133
ury+1.082
avour+1.029
ÃÂ+0.981
Neg logit contributions
peat-2.174
beit-1.986
wide-1.905
 primates-1.860
 elect-1.845
Token($
Token ID16763
Feature activation-5.24e-01

Pos logit contributions
avorite+1.541
ucc+1.509
ury+1.469
avour+1.411
ÃÂ+1.380
Neg logit contributions
peat-1.670
beit-1.483
wide-1.419
 primates-1.365
 elect-1.348
Tokenb
Token ID65
Feature activation-1.06e+00

Pos logit contributions
avorite+2.886
ucc+2.870
ury+2.773
avour+2.681
uri+2.606
Neg logit contributions
peat-3.061
beit-2.694
wide-2.551
 primates-2.531
 elect-2.522
Tokenmp
Token ID3149
Feature activation-7.13e-01

Pos logit contributions
ucc+2.739
avorite+2.515
ury+2.442
avour+2.252
uri+1.982
Neg logit contributions
peat-9.159
beit-8.553
wide-8.259
 primates-8.240
 rats-8.172
Token)
Token ID8
Feature activation-4.17e-01

Pos logit contributions
avorite+4.320
ucc+4.311
ury+4.216
avour+4.103
ÃÂ+3.945
Neg logit contributions
peat-3.694
beit-3.202
wide-3.052
 primates-2.952
 elect-2.872
Token $
Token ID720
Feature activation-4.60e-01

Pos logit contributions
avorite+2.446
ucc+2.407
ury+2.372
avour+2.267
ÃÂ+2.215
Neg logit contributions
peat-2.314
beit-2.028
wide-1.940
 primates-1.857
 elect-1.806
TokenSR
Token ID12562
Feature activation-4.31e-01

Pos logit contributions
avorite+2.523
ucc+2.497
ury+2.415
avour+2.327
uri+2.238
Neg logit contributions
peat-2.722
beit-2.417
wide-2.302
 primates-2.269
 elect-2.243
Token =
Token ID796
Feature activation-7.52e-02

Pos logit contributions
avorite+2.086
ucc+2.036
ury+1.962
avour+1.882
ÃÂ+1.799
Neg logit contributions
peat-2.864
beit-2.583
wide-2.491
 primates-2.418
 elect-2.393
Token away
Token ID1497
Feature activation-1.51e-01

Pos logit contributions
peat+0.726
beit+0.641
wide+0.621
 primates+0.585
 elect+0.585
Neg logit contributions
avorite-0.790
ucc-0.772
ury-0.742
avour-0.723
ÃÂ-0.715
Token from
Token ID422
Feature activation-3.66e-01

Pos logit contributions
avorite+0.783
ucc+0.753
ury+0.738
avour+0.703
ÃÂ+0.687
Neg logit contributions
peat-0.954
beit-0.863
wide-0.831
 primates-0.789
 elect-0.783
Token Kirby
Token ID23965
Feature activation-4.70e-01

Pos logit contributions
avorite+2.219
ucc+2.164
ury+2.116
avour+2.035
ÃÂ+1.962
Neg logit contributions
peat-1.977
beit-1.756
wide-1.658
 primates-1.568
 elect-1.545
Token's
Token ID338
Feature activation-3.02e-01

Pos logit contributions
avorite+2.362
ucc+2.284
ury+2.240
avour+2.155
ÃÂ+2.101
Neg logit contributions
peat-3.006
beit-2.707
wide-2.576
 primates-2.502
 elect-2.453
Token bomb
Token ID5194
Feature activation+6.34e-03

Pos logit contributions
avorite+1.838
ucc+1.805
ury+1.765
avour+1.705
ÃÂ+1.639
Neg logit contributions
peat-1.610
beit-1.429
wide-1.346
 primates-1.275
 elect-1.250
Tokenastic
Token ID3477
Feature activation-8.50e-01

Pos logit contributions
peat+0.033
beit+0.029
wide+0.027
 primates+0.026
 elect+0.026
Neg logit contributions
avorite-0.040
ucc-0.039
ury-0.038
avour-0.037
ÃÂ-0.036
Token style
Token ID3918
Feature activation+2.75e-01

Pos logit contributions
avorite+4.863
ucc+4.830
ury+4.754
avour+4.556
uri+4.336
Neg logit contributions
peat-4.655
beit-4.160
wide-3.853
 primates-3.639
 elect-3.525
Token of
Token ID286
Feature activation+2.97e-01

Pos logit contributions
peat+1.453
beit+1.271
wide+1.223
 primates+1.155
 elect+1.148
Neg logit contributions
avorite-1.723
ucc-1.684
ury-1.633
avour-1.585
ÃÂ-1.563
Token storytelling
Token ID23689
Feature activation+4.79e-02

Pos logit contributions
peat+1.612
beit+1.411
wide+1.371
 primates+1.298
 elect+1.295
Neg logit contributions
avorite-1.832
ucc-1.772
ury-1.717
avour-1.668
ÃÂ-1.661
Token.
Token ID13
Feature activation-9.10e-02

Pos logit contributions
peat+0.266
beit+0.235
wide+0.226
 primates+0.214
 elect+0.213
Neg logit contributions
avorite-0.286
ucc-0.280
ury-0.271
avour-0.262
ÃÂ-0.258
Token According
Token ID4784
Feature activation-4.35e-01

Pos logit contributions
avorite+0.535
ucc+0.521
ury+0.504
avour+0.488
ÃÂ+0.481
Neg logit contributions
peat-0.516
beit-0.457
wide-0.441
 primates-0.418
 elect-0.415
TokenDraw
Token ID25302
Feature activation-2.99e-01

Pos logit contributions
ucc+1.051
avorite+1.040
ury+0.945
avour+0.840
ÃÂ+0.717
Neg logit contributions
peat-5.227
beit-4.863
wide-4.720
 primates-4.663
 elect-4.627
Tokening
Token ID278
Feature activation-7.53e-01

Pos logit contributions
avorite+1.728
ucc+1.709
ury+1.659
avour+1.599
ÃÂ+1.545
Neg logit contributions
peat-1.693
beit-1.496
wide-1.433
 primates-1.380
 elect-1.364
Token.
Token ID13
Feature activation-8.95e-01

Pos logit contributions
ucc+3.827
ury+3.701
avorite+3.641
avour+3.500
uri+3.377
Neg logit contributions
peat-4.679
beit-4.185
 primates-3.941
wide-3.927
 elect-3.873
TokenFont
Token ID23252
Feature activation-1.31e+00

Pos logit contributions
ucc+3.258
avorite+3.082
ury+3.058
avour+2.819
uri+2.717
Neg logit contributions
peat-6.747
beit-6.196
 primates-6.019
 elect-5.925
 rats-5.831
Token($
Token ID16763
Feature activation-1.09e+00

Pos logit contributions
avorite+4.972
ucc+4.930
ury+4.740
ÃÂ+4.276
avour+4.227
Neg logit contributions
peat-9.608
beit-8.719
 primates-8.409
wide-8.082
 elect-8.078
TokenFont
Token ID23252
Feature activation-1.24e+00

Pos logit contributions
ucc+3.156
avorite+3.031
ury+2.845
avour+2.680
uri+2.516
Neg logit contributions
peat-8.876
beit-8.123
 primates-7.982
 elect-7.908
 rats-7.904
TokenName
Token ID5376
Feature activation-5.02e-01

Pos logit contributions
ucc+3.210
avorite+3.139
ury+3.007
avour+2.677
uri+2.507
Neg logit contributions
peat-10.488
beit-9.849
 primates-9.389
 rats-9.228
wide-9.216
Token,
Token ID11
Feature activation+7.57e-02

Pos logit contributions
ucc+2.826
avorite+2.821
ury+2.737
avour+2.631
ÃÂ+2.539
Neg logit contributions
peat-2.886
beit-2.551
wide-2.436
 primates-2.357
 elect-2.330
Token$
Token ID3
Feature activation-8.35e-01

Pos logit contributions
peat+0.490
beit+0.442
wide+0.431
 primates+0.407
 elect+0.406
Neg logit contributions
avorite-0.388
ucc-0.370
ury-0.357
avour-0.345
ÃÂ-0.342
TokenFS
Token ID10652
Feature activation-8.35e-01

Pos logit contributions
avorite+3.654
ucc+3.621
ury+3.454
avour+3.321
uri+3.208
Neg logit contributions
peat-5.759
beit-5.147
 primates-4.961
wide-4.954
 elect-4.895
Token2
Token ID17
Feature activation-2.58e-01

Pos logit contributions
ucc+3.212
avorite+3.069
ury+3.020
avour+2.883
uri+2.686
Neg logit contributions
peat-6.266
beit-5.690
wide-5.449
 primates-5.446
 elect-5.380
Token
Token ID564
Feature activation-2.68e-01

Pos logit contributions
avorite+1.053
ucc+1.035
ury+1.004
avour+0.975
ÃÂ+0.959
Neg logit contributions
peat-0.884
beit-0.772
wide-0.740
 primates-0.696
 elect-0.686
Token
Token ID246
Feature activation-2.38e-01

Pos logit contributions
avorite+1.214
ucc+1.181
ury+1.141
avour+1.077
ÃÂ+1.060
Neg logit contributions
peat-1.857
beit-1.682
wide-1.631
 primates-1.568
 elect-1.558
TokenFar
Token ID21428
Feature activation-3.24e-01

Pos logit contributions
avorite+1.078
ucc+1.060
ury+1.015
avour+0.972
ÃÂ+0.935
Neg logit contributions
peat-1.647
beit-1.494
wide-1.440
 primates-1.403
 elect-1.393
Token Away
Token ID21986
Feature activation-4.67e-01

Pos logit contributions
avorite+1.782
ucc+1.746
ury+1.691
avour+1.630
ÃÂ+1.600
Neg logit contributions
peat-1.950
beit-1.733
wide-1.676
 primates-1.604
 elect-1.584
Token
Token ID447
Feature activation-5.78e-01

Pos logit contributions
avorite+2.589
ucc+2.578
ury+2.494
avour+2.420
ÃÂ+2.383
Neg logit contributions
peat-2.708
beit-2.400
wide-2.290
 primates-2.181
 elect-2.167
Token
Token ID247
Feature activation-9.73e-01

Pos logit contributions
avorite+2.340
ucc+2.308
ury+2.211
avour+2.078
ÃÂ+2.046
Neg logit contributions
peat-4.257
beit-3.848
wide-3.746
 primates-3.607
 elect-3.576
Token that
Token ID326
Feature activation-6.24e-01

Pos logit contributions
avorite+5.642
ucc+5.619
ury+5.454
avour+5.303
ÃÂ+5.228
Neg logit contributions
peat-5.173
beit-4.568
wide-4.309
 primates-4.166
 elect-4.066
Token stuck
Token ID7819
Feature activation-4.05e-02

Pos logit contributions
avorite+3.900
ucc+3.816
ury+3.725
avour+3.603
ÃÂ+3.551
Neg logit contributions
peat-3.242
beit-2.822
wide-2.695
 primates-2.540
 elect-2.478
Token in
Token ID287
Feature activation+1.55e-02

Pos logit contributions
avorite+0.256
ucc+0.252
ury+0.245
avour+0.238
ÃÂ+0.233
Neg logit contributions
peat-0.209
beit-0.181
wide-0.173
 primates-0.164
 elect-0.162
Token the
Token ID262
Feature activation+5.01e-02

Pos logit contributions
peat+0.087
beit+0.077
wide+0.074
 primates+0.070
 elect+0.070
Neg logit contributions
avorite-0.091
ucc-0.089
ury-0.087
avour-0.084
ÃÂ-0.082
Token mind
Token ID2000
Feature activation-2.41e-01

Pos logit contributions
peat+0.261
beit+0.228
wide+0.221
 elect+0.210
 primates+0.209
Neg logit contributions
avorite-0.315
ucc-0.309
ury-0.299
avour-0.290
ÃÂ-0.287