TOP ACTIVATIONS
MAX = 1.720

years old and once I turn AA on things slow down
15 , 2010 Subject : Que er magnet ?
13 , 2010 Subject : boys beware This film
4 , 2010 Subject : User reviews are a
26 , 2009 Subject : THE IR ONY OF
5 , 2009 Subject : Que ers are Here
. Clement and Mill wood Junior schools both half a block
9 , 2010 Subject : R ipe for M
6 , 2010 Subject : R iff Tra x
we were hoping Winter ke wl would create ," Br ind
galaxy ( 100 , 000 Light Years ) to
2 , 000 , 000 Light Years ) to
Weight : 65 . 6 grams B earing
position lights were now c anted inward as the wings had
would much prefer the lower res without screen door effect .
Flag Reply 55 Recommend Recommend Share this
since at a 90 F OV even double or 4 x
2 ). Near those lighter v ort ons , the density
Journalism The Nation is reader supported : Chip in $ 10
COL OR _ B GR 2 G RAY ) # smooth

MIN ACTIVATIONS
MIN = -2.349

tablespoon coconut oil and fry until carrots are soft and crunch
at medium - high heat until the onion is clear or
the bell peppers . Fry until the mushrooms get a bit
other ingredients and mix well until the mixture feels like it
cover and let it simmer until most of the water has
it to a boil . Once its boiling , reduce the
imi Kan asis in Un break able Kim my Schmidt ,
soft and crunch y . Once they are ready throw them
and bring it to a boil . Once its boiling ,
the other ingredients and mix well until the mixture feels like
bit soft and nice . Once they are ready , throw
ries , and keep going until the mix is this smooth
broccoli for a few minutes until fork - t ender .
a tablespoon coconut oil and fry until carrots are soft and
the heat and cover and let it simmer until most of
other ingredients and mix well until the mixture feels like it
ai curry paste and mix well . Season with salt ,
broccoli for a few minutes until fork - t ender .
a boil . Once its boiling , reduce the heat and
tax a . Australian Journal of Zo ology 62 ( 5

INTERVAL 0.702 - 1.720
CONTAINS 1.045%

c v 2 . COL OR _ B GR 2 G
the documents some time ago by Julian Assange , the founder
read : If all that wasn 't enough ,
. For photographic evidence of Manson 's chi rog
. H ills borough County residents who find themselves

INTERVAL -0.315 - 0.702
CONTAINS 85.294%

on the street , as part of her project Hey Baby
the best integrity . The first thing we want
up loaded in ro pas aurus rex in our debugger :
pro quo , it s true that while the
the 1980 s when packet loss and congestion were

INTERVAL -1.332 - -0.315
CONTAINS 13.651%

automation and computer ization and enabling online services .
like Google , Yahoo , Duck D uck Go , B
homophobic , misogyn ist , and racist comments . ( In
... to see Shib ata . Shib ata Yu
running one s own pirate crew what it

INTERVAL -2.349 - -1.332
CONTAINS 0.010%

) Mem c py In Memory 4 66 284 4 64
bit soft and nice . Once they are ready , throw
Foundation , told the Ou ach ita Citizen that the think
and cover and let it simmer until most of the water
tax a . Australian Journal of Zo ology 62 ( 5
Token years
Token ID812
Feature activation+1.24e-01

Pos logit contributions
 Signs+2.099
ecause+1.862
ateg+1.802
 Levels+1.715
 herself+1.676
Neg logit contributions
TRY-4.604
tub-4.375
SER-4.372
idious-4.330
hop-4.323
Token old
Token ID1468
Feature activation+8.89e-01

Pos logit contributions
 Signs+0.616
ecause+0.579
ateg+0.560
avorite+0.553
 Levels+0.539
Neg logit contributions
TRY-0.702
SER-0.660
hop-0.659
tub-0.658
EST-0.646
Token and
Token ID290
Feature activation+5.04e-01

Pos logit contributions
 Signs+4.207
ecause+3.824
ateg+3.820
 Levels+3.696
 herself+3.690
Neg logit contributions
TRY-5.156
SER-4.825
tub-4.740
idious-4.719
ORN-4.716
Token once
Token ID1752
Feature activation+1.90e-01

Pos logit contributions
 Signs+2.478
ecause+2.308
ateg+2.248
avorite+2.184
 Levels+2.160
Neg logit contributions
TRY-2.878
SER-2.713
hop-2.685
tub-2.684
idious-2.635
Token I
Token ID314
Feature activation+6.92e-01

Pos logit contributions
 Signs+1.028
ecause+0.968
ateg+0.945
avorite+0.925
 Levels+0.910
Neg logit contributions
TRY-0.990
SER-0.925
hop-0.918
tub-0.916
EST-0.897
Token turn
Token ID1210
Feature activation+1.72e+00

Pos logit contributions
 Signs+2.861
ecause+2.600
ateg+2.577
 Levels+2.436
avorite+2.398
Neg logit contributions
TRY-4.457
SER-4.255
hop-4.188
tub-4.175
EST-4.138
Token AA
Token ID15923
Feature activation+1.33e+00

Pos logit contributions
 Signs+4.336
 Levels+3.547
ateg+3.517
 herself+3.373
 Gates+3.230
Neg logit contributions
TRY-13.336
SER-12.776
hop-12.586
idious-12.543
tub-12.496
Token on
Token ID319
Feature activation+1.34e+00

Pos logit contributions
 Signs+5.403
 herself+4.980
ateg+4.726
 Levels+4.682
 Gates+4.623
Neg logit contributions
TRY-8.188
idious-7.798
arious-7.698
SER-7.654
tub-7.591
Token things
Token ID1243
Feature activation+1.07e+00

Pos logit contributions
 Signs+1.553
ecause+0.653
ateg+0.626
 Levels+0.618
 herself+0.527
Neg logit contributions
TRY-12.606
SER-12.220
arious-12.088
ORN-12.081
hop-12.020
Token slow
Token ID3105
Feature activation+4.10e-01

Pos logit contributions
 Signs+4.222
ateg+3.823
ecause+3.786
 herself+3.718
 Levels+3.597
Neg logit contributions
TRY-6.885
ORN-6.614
SER-6.516
arious-6.360
EST-6.358
Token down
Token ID866
Feature activation-2.17e-01

Pos logit contributions
 Signs+1.787
ecause+1.648
ateg+1.606
 Levels+1.533
avorite+1.532
Neg logit contributions
TRY-2.570
SER-2.433
hop-2.404
tub-2.400
EST-2.364
Token 15
Token ID1315
Feature activation+3.11e-01

Pos logit contributions
TRY+1.097
SER+1.002
hop+1.002
tub+0.996
EST+0.972
Neg logit contributions
 Signs-2.121
ecause-2.120
avorite-2.110
ateg-2.038
estones-1.968
Token,
Token ID11
Feature activation+5.97e-01

Pos logit contributions
 Signs+1.552
ecause+1.458
ateg+1.417
avorite+1.390
 Levels+1.359
Neg logit contributions
TRY-1.759
SER-1.653
hop-1.642
tub-1.637
EST-1.607
Token 2010
Token ID3050
Feature activation+5.46e-01

Pos logit contributions
 Signs+2.704
ecause+2.459
ateg+2.373
 Levels+2.353
 herself+2.275
Neg logit contributions
TRY-3.653
SER-3.439
tub-3.407
hop-3.314
ORN-3.312
Token
Token ID198
Feature activation+3.88e-01

Pos logit contributions
 Signs+2.477
ecause+2.238
ateg+2.208
 Levels+2.139
estones+2.081
Neg logit contributions
TRY-3.297
SER-3.117
tub-3.100
hop-3.089
idious-3.069
Token
Token ID198
Feature activation+3.49e-01

Pos logit contributions
 Signs+1.812
ecause+1.674
ateg+1.635
 Levels+1.571
avorite+1.568
Neg logit contributions
TRY-2.312
SER-2.180
hop-2.163
tub-2.162
idious-2.128
TokenSubject
Token ID19776
Feature activation+1.59e+00

Pos logit contributions
 Signs+1.870
ecause+1.766
ateg+1.717
avorite+1.699
 Levels+1.646
Neg logit contributions
TRY-1.865
SER-1.741
hop-1.730
tub-1.722
EST-1.693
Token:
Token ID25
Feature activation+8.04e-01

Pos logit contributions
 Signs+6.498
 Levels+5.545
 herself+5.516
ateg+5.474
 Gates+5.304
Neg logit contributions
TRY-9.649
idious-9.196
SER-9.182
ORN-9.125
tub-9.045
Token Que
Token ID4670
Feature activation+3.07e-01

Pos logit contributions
 Signs+4.426
ecause+4.093
ateg+3.966
 Levels+3.917
avorite+3.851
Neg logit contributions
TRY-4.088
SER-3.822
tub-3.768
hop-3.746
Unix-3.722
Tokener
Token ID263
Feature activation+2.99e-01

Pos logit contributions
 Signs+1.634
ecause+1.540
ateg+1.500
avorite+1.456
 Levels+1.446
Neg logit contributions
TRY-1.620
SER-1.513
tub-1.503
hop-1.498
idious-1.468
Token magnet
Token ID19972
Feature activation+9.77e-02

Pos logit contributions
 Signs+1.654
ecause+1.557
ateg+1.521
avorite+1.486
 Levels+1.467
Neg logit contributions
TRY-1.521
SER-1.419
hop-1.407
tub-1.405
EST-1.374
Token?
Token ID30
Feature activation+5.97e-01

Pos logit contributions
 Signs+0.529
ecause+0.503
ateg+0.491
avorite+0.485
 Levels+0.471
Neg logit contributions
TRY-0.505
hop-0.471
SER-0.470
tub-0.468
idious-0.458
Token 13
Token ID1511
Feature activation+7.66e-02

Pos logit contributions
TRY+1.495
hop+1.374
SER+1.373
tub+1.352
EST+1.344
Neg logit contributions
avorite-2.923
ecause-2.910
 Signs-2.879
ateg-2.785
estones-2.685
Token,
Token ID11
Feature activation+4.10e-01

Pos logit contributions
 Signs+0.407
ecause+0.401
avorite+0.396
ateg+0.380
estones+0.363
Neg logit contributions
TRY-0.405
SER-0.379
hop-0.376
tub-0.372
EST-0.369
Token 2010
Token ID3050
Feature activation+3.52e-01

Pos logit contributions
 Signs+2.050
ecause+1.916
ateg+1.867
avorite+1.819
 Levels+1.794
Neg logit contributions
TRY-2.311
SER-2.171
hop-2.155
tub-2.152
EST-2.110
Token
Token ID198
Feature activation+3.71e-01

Pos logit contributions
 Signs+1.665
ecause+1.550
ateg+1.509
avorite+1.466
 Levels+1.446
Neg logit contributions
TRY-2.076
SER-1.956
hop-1.943
tub-1.940
EST-1.904
Token
Token ID198
Feature activation+4.51e-01

Pos logit contributions
 Signs+1.741
ecause+1.615
ateg+1.574
avorite+1.520
 Levels+1.511
Neg logit contributions
TRY-2.198
SER-2.072
hop-2.056
tub-2.055
idious-2.019
TokenSubject
Token ID19776
Feature activation+1.58e+00

Pos logit contributions
 Signs+2.374
ecause+2.230
ateg+2.175
avorite+2.128
 Levels+2.090
Neg logit contributions
TRY-2.434
SER-2.277
hop-2.261
tub-2.255
EST-2.214
Token:
Token ID25
Feature activation+7.39e-01

Pos logit contributions
 Signs+6.823
 Levels+5.890
ateg+5.793
 herself+5.712
ecause+5.597
Neg logit contributions
TRY-9.382
SER-8.912
idious-8.824
tub-8.820
ORN-8.791
Token boys
Token ID6510
Feature activation+6.89e-01

Pos logit contributions
 Signs+4.091
ecause+3.796
ateg+3.677
 Levels+3.623
avorite+3.575
Neg logit contributions
TRY-3.744
SER-3.505
tub-3.457
hop-3.440
Unix-3.402
Token beware
Token ID40600
Feature activation+2.00e-01

Pos logit contributions
 Signs+3.852
ecause+3.650
ateg+3.531
avorite+3.416
 Levels+3.404
Neg logit contributions
TRY-3.409
SER-3.224
tub-3.132
hop-3.096
Unix-3.096
Token This
Token ID770
Feature activation+1.73e-01

Pos logit contributions
 Signs+1.074
ecause+1.013
ateg+0.990
avorite+0.970
 Levels+0.950
Neg logit contributions
TRY-1.050
SER-0.981
hop-0.977
tub-0.973
EST-0.953
Token film
Token ID2646
Feature activation-7.92e-02

Pos logit contributions
 Signs+0.977
ecause+0.921
ateg+0.900
avorite+0.880
 Levels+0.869
Neg logit contributions
TRY-0.865
SER-0.806
hop-0.800
tub-0.798
EST-0.780
Token 4
Token ID604
Feature activation+3.37e-01

Pos logit contributions
TRY+1.242
SER+1.149
hop+1.147
tub+1.140
EST+1.115
Neg logit contributions
ecause-2.263
 Signs-2.254
avorite-2.244
ateg-2.173
estones-2.096
Token,
Token ID11
Feature activation+6.68e-01

Pos logit contributions
 Signs+1.645
ecause+1.526
ateg+1.492
 Levels+1.435
avorite+1.434
Neg logit contributions
TRY-1.935
SER-1.819
hop-1.807
tub-1.805
idious-1.774
Token 2010
Token ID3050
Feature activation+5.23e-01

Pos logit contributions
 Signs+3.293
ecause+3.042
ateg+2.977
 Levels+2.884
avorite+2.846
Neg logit contributions
TRY-3.807
SER-3.580
tub-3.552
hop-3.535
idious-3.478
Token
Token ID198
Feature activation+3.78e-01

Pos logit contributions
 Signs+2.377
ecause+2.154
ateg+2.123
 Levels+2.055
 herself+2.000
Neg logit contributions
TRY-3.159
SER-2.985
tub-2.969
hop-2.959
idious-2.933
Token
Token ID198
Feature activation+3.83e-01

Pos logit contributions
 Signs+1.767
ecause+1.636
ateg+1.596
avorite+1.535
 Levels+1.533
Neg logit contributions
TRY-2.246
SER-2.117
hop-2.101
tub-2.100
idious-2.065
TokenSubject
Token ID19776
Feature activation+1.57e+00

Pos logit contributions
 Signs+2.011
ecause+1.891
ateg+1.843
avorite+1.807
 Levels+1.771
Neg logit contributions
TRY-2.065
SER-1.933
hop-1.921
tub-1.915
EST-1.878
Token:
Token ID25
Feature activation+7.97e-01

Pos logit contributions
 Signs+6.404
 herself+5.504
 Levels+5.475
ateg+5.431
 Gates+5.234
Neg logit contributions
TRY-9.511
SER-9.060
idious-9.039
ORN-9.023
tub-8.900
Token User
Token ID11787
Feature activation+5.49e-01

Pos logit contributions
 Signs+4.388
ecause+4.059
ateg+3.929
 Levels+3.880
avorite+3.807
Neg logit contributions
TRY-4.066
SER-3.802
tub-3.748
hop-3.727
Unix-3.703
Token reviews
Token ID8088
Feature activation+8.61e-01

Pos logit contributions
 Signs+3.015
ecause+2.802
ateg+2.746
 Levels+2.671
avorite+2.653
Neg logit contributions
TRY-2.825
SER-2.638
hop-2.612
tub-2.605
Unix-2.554
Token are
Token ID389
Feature activation+5.87e-01

Pos logit contributions
 Signs+4.626
ecause+4.254
ateg+4.144
 Levels+4.094
avorite+3.992
Neg logit contributions
TRY-4.520
SER-4.190
tub-4.129
hop-4.098
EST-4.077
Token a
Token ID257
Feature activation+2.66e-01

Pos logit contributions
 Signs+3.377
ecause+3.153
ateg+3.085
avorite+3.000
 Levels+2.991
Neg logit contributions
TRY-2.873
SER-2.655
tub-2.614
hop-2.613
Unix-2.574
Token 26
Token ID2608
Feature activation+2.34e-01

Pos logit contributions
TRY+1.604
hop+1.470
SER+1.464
tub+1.457
EST+1.423
Neg logit contributions
avorite-3.149
ecause-3.146
 Signs-3.131
ateg-3.009
estones-2.904
Token,
Token ID11
Feature activation+6.08e-01

Pos logit contributions
 Signs+1.180
ecause+1.119
ateg+1.082
avorite+1.075
 Levels+1.035
Neg logit contributions
TRY-1.304
SER-1.225
hop-1.218
tub-1.212
EST-1.192
Token 2009
Token ID3717
Feature activation+5.97e-01

Pos logit contributions
 Signs+2.840
ecause+2.556
ateg+2.447
 Levels+2.433
 herself+2.397
Neg logit contributions
TRY-3.654
tub-3.432
SER-3.421
ORN-3.335
hop-3.322
Token
Token ID198
Feature activation+2.60e-01

Pos logit contributions
 Signs+2.685
ecause+2.411
ateg+2.385
 Levels+2.318
 herself+2.251
Neg logit contributions
TRY-3.625
SER-3.430
tub-3.410
hop-3.395
idious-3.380
Token
Token ID198
Feature activation+2.51e-01

Pos logit contributions
 Signs+1.240
ecause+1.162
ateg+1.128
avorite+1.106
 Levels+1.078
Neg logit contributions
TRY-1.521
SER-1.432
hop-1.424
tub-1.420
EST-1.395
TokenSubject
Token ID19776
Feature activation+1.57e+00

Pos logit contributions
 Signs+1.343
ecause+1.272
ateg+1.235
avorite+1.227
 Levels+1.182
Neg logit contributions
TRY-1.336
SER-1.248
hop-1.240
tub-1.234
EST-1.214
Token:
Token ID25
Feature activation+7.94e-01

Pos logit contributions
 Signs+6.418
 herself+5.506
 Levels+5.506
ateg+5.428
 Gates+5.263
Neg logit contributions
TRY-9.487
SER-9.005
idious-9.004
ORN-8.965
tub-8.867
Token THE
Token ID3336
Feature activation+9.42e-02

Pos logit contributions
 Signs+4.365
ecause+4.030
ateg+3.902
 Levels+3.866
avorite+3.776
Neg logit contributions
TRY-4.059
SER-3.788
tub-3.734
hop-3.729
Unix-3.684
Token IR
Token ID14826
Feature activation-5.14e-01

Pos logit contributions
 Signs+0.512
ecause+0.482
ateg+0.471
avorite+0.460
 Levels+0.453
Neg logit contributions
TRY-0.490
SER-0.457
hop-0.454
tub-0.453
EST-0.444
TokenONY
Token ID40508
Feature activation+5.11e-01

Pos logit contributions
TRY+1.635
SER+1.476
EST+1.438
ORN+1.397
tub+1.395
Neg logit contributions
 Signs-3.798
ecause-3.702
ateg-3.596
avorite-3.581
estones-3.502
Token OF
Token ID3963
Feature activation-5.38e-02

Pos logit contributions
 Signs+2.753
ecause+2.544
ateg+2.476
 Levels+2.428
avorite+2.392
Neg logit contributions
TRY-2.668
SER-2.491
tub-2.488
hop-2.480
idious-2.442
Token 5
Token ID642
Feature activation+3.22e-01

Pos logit contributions
TRY+1.901
hop+1.756
SER+1.752
tub+1.732
EST+1.710
Neg logit contributions
avorite-4.034
ecause-3.988
 Signs-3.902
ateg-3.803
VIDIA-3.722
Token,
Token ID11
Feature activation+7.02e-01

Pos logit contributions
 Signs+1.597
ecause+1.495
ateg+1.455
avorite+1.420
 Levels+1.397
Neg logit contributions
TRY-1.829
SER-1.719
hop-1.707
tub-1.703
EST-1.671
Token 2009
Token ID3717
Feature activation+5.25e-01

Pos logit contributions
 Signs+3.184
ecause+2.867
ateg+2.768
 Levels+2.757
 herself+2.649
Neg logit contributions
TRY-4.329
SER-4.077
tub-4.024
hop-3.915
ORN-3.902
Token
Token ID198
Feature activation+3.55e-01

Pos logit contributions
 Signs+2.387
ecause+2.162
ateg+2.130
 Levels+2.064
estones+2.007
Neg logit contributions
TRY-3.168
SER-2.995
tub-2.977
hop-2.966
idious-2.944
Token
Token ID198
Feature activation+3.22e-01

Pos logit contributions
 Signs+1.665
ecause+1.542
ateg+1.505
avorite+1.449
 Levels+1.445
Neg logit contributions
TRY-2.110
SER-1.989
hop-1.974
tub-1.973
idious-1.939
TokenSubject
Token ID19776
Feature activation+1.54e+00

Pos logit contributions
 Signs+1.712
ecause+1.616
ateg+1.572
avorite+1.553
 Levels+1.507
Neg logit contributions
TRY-1.727
SER-1.614
hop-1.604
tub-1.598
EST-1.570
Token:
Token ID25
Feature activation+8.08e-01

Pos logit contributions
 Signs+6.271
 herself+5.433
 Levels+5.382
ateg+5.346
 Gates+5.148
Neg logit contributions
TRY-9.313
SER-8.865
idious-8.841
ORN-8.826
tub-8.710
Token Que
Token ID4670
Feature activation+2.15e-01

Pos logit contributions
 Signs+4.430
ecause+4.094
ateg+3.962
 Levels+3.922
avorite+3.838
Neg logit contributions
TRY-4.138
SER-3.862
tub-3.811
hop-3.793
Unix-3.758
Tokeners
Token ID364
Feature activation+2.07e-01

Pos logit contributions
 Signs+1.099
ecause+1.040
ateg+1.005
avorite+0.975
 Levels+0.967
Neg logit contributions
TRY-1.176
tub-1.098
SER-1.097
hop-1.086
Unix-1.077
Token are
Token ID389
Feature activation+3.20e-01

Pos logit contributions
 Signs+1.141
ecause+1.076
ateg+1.052
avorite+1.029
 Levels+1.013
Neg logit contributions
TRY-1.056
SER-0.984
hop-0.980
tub-0.976
EST-0.954
Token Here
Token ID3423
Feature activation+4.48e-01

Pos logit contributions
 Signs+1.822
ecause+1.716
ateg+1.677
avorite+1.639
 Levels+1.621
Neg logit contributions
TRY-1.583
SER-1.474
hop-1.460
tub-1.457
EST-1.426
Token.
Token ID13
Feature activation+2.32e-01

Pos logit contributions
 Signs+1.046
ecause+0.987
ateg+0.969
avorite+0.928
 Levels+0.922
Neg logit contributions
TRY-1.099
SER-1.024
tub-1.012
hop-1.004
Unix-0.990
Token Clement
Token ID27311
Feature activation+5.81e-01

Pos logit contributions
 Signs+1.256
ecause+1.178
ateg+1.153
avorite+1.122
 Levels+1.113
Neg logit contributions
TRY-1.213
SER-1.133
tub-1.122
hop-1.121
EST-1.098
Token and
Token ID290
Feature activation+2.50e-01

Pos logit contributions
 Signs+3.128
ecause+2.918
ateg+2.853
 Levels+2.774
avorite+2.766
Neg logit contributions
TRY-3.053
SER-2.841
tub-2.835
hop-2.822
idious-2.780
Token Mill
Token ID9212
Feature activation+1.64e-01

Pos logit contributions
 Signs+1.344
ecause+1.266
ateg+1.236
avorite+1.209
 Levels+1.189
Neg logit contributions
TRY-1.310
SER-1.224
hop-1.216
tub-1.213
EST-1.188
Tokenwood
Token ID3822
Feature activation+6.77e-01

Pos logit contributions
 Signs+0.868
ecause+0.819
ateg+0.801
avorite+0.785
 Levels+0.768
Neg logit contributions
TRY-0.865
SER-0.811
hop-0.806
tub-0.804
EST-0.788
Token Junior
Token ID20000
Feature activation+1.53e+00

Pos logit contributions
 Signs+3.555
ecause+3.298
ateg+3.210
 Levels+3.113
avorite+3.100
Neg logit contributions
TRY-3.659
SER-3.412
tub-3.392
hop-3.387
idious-3.346
Token schools
Token ID4266
Feature activation+6.26e-01

Pos logit contributions
 Signs+7.631
 Levels+6.779
ecause+6.710
ateg+6.697
estones+6.356
Neg logit contributions
TRY-8.295
tub-7.851
idious-7.826
SER-7.689
Unix-7.629
Token both
Token ID1111
Feature activation+2.05e-01

Pos logit contributions
 Signs+3.281
ecause+3.053
ateg+2.984
avorite+2.882
 Levels+2.875
Neg logit contributions
TRY-3.380
SER-3.146
hop-3.096
tub-3.094
Unix-3.053
Token half
Token ID2063
Feature activation-7.22e-02

Pos logit contributions
 Signs+1.081
ecause+1.017
ateg+0.994
avorite+0.974
 Levels+0.954
Neg logit contributions
TRY-1.100
SER-1.031
hop-1.026
tub-1.021
EST-1.001
Token a
Token ID257
Feature activation+3.30e-01

Pos logit contributions
TRY+0.318
SER+0.295
hop+0.294
tub+0.290
EST+0.284
Neg logit contributions
 Signs-0.447
ecause-0.426
ateg-0.417
avorite-0.411
 Levels-0.404
Token block
Token ID2512
Feature activation-1.91e-01

Pos logit contributions
 Signs+1.782
ecause+1.673
ateg+1.631
avorite+1.591
 Levels+1.577
Neg logit contributions
TRY-1.716
SER-1.604
tub-1.592
hop-1.589
idious-1.557
Token 9
Token ID860
Feature activation+2.69e-01

Pos logit contributions
TRY+1.245
SER+1.141
hop+1.139
tub+1.128
EST+1.104
Neg logit contributions
ecause-2.350
 Signs-2.347
avorite-2.340
ateg-2.255
estones-2.177
Token,
Token ID11
Feature activation+4.83e-01

Pos logit contributions
 Signs+1.343
ecause+1.261
ateg+1.226
avorite+1.203
 Levels+1.175
Neg logit contributions
TRY-1.519
SER-1.427
hop-1.418
tub-1.414
EST-1.388
Token 2010
Token ID3050
Feature activation+5.67e-01

Pos logit contributions
 Signs+2.240
ecause+2.035
ateg+1.968
 Levels+1.943
 herself+1.892
Neg logit contributions
TRY-2.916
SER-2.744
tub-2.720
hop-2.667
idious-2.656
Token
Token ID198
Feature activation+3.23e-01

Pos logit contributions
 Signs+2.562
ecause+2.306
ateg+2.279
 Levels+2.210
 herself+2.151
Neg logit contributions
TRY-3.436
SER-3.249
tub-3.231
hop-3.219
idious-3.204
Token
Token ID198
Feature activation+3.15e-01

Pos logit contributions
 Signs+1.527
ecause+1.424
ateg+1.385
avorite+1.347
 Levels+1.326
Neg logit contributions
TRY-1.900
SER-1.790
hop-1.779
tub-1.775
EST-1.743
TokenSubject
Token ID19776
Feature activation+1.52e+00

Pos logit contributions
 Signs+1.703
ecause+1.611
ateg+1.567
avorite+1.559
 Levels+1.501
Neg logit contributions
TRY-1.669
SER-1.556
hop-1.546
tub-1.539
EST-1.513
Token:
Token ID25
Feature activation+7.92e-01

Pos logit contributions
 Signs+6.333
 Levels+5.419
 herself+5.391
ateg+5.367
 Gates+5.189
Neg logit contributions
TRY-9.174
SER-8.723
idious-8.722
ORN-8.655
tub-8.592
Token R
Token ID371
Feature activation-1.38e-01

Pos logit contributions
 Signs+4.380
ecause+4.054
ateg+3.921
 Levels+3.883
avorite+3.815
Neg logit contributions
TRY-4.010
SER-3.754
tub-3.691
hop-3.673
Unix-3.651
Tokenipe
Token ID3757
Feature activation+3.77e-01

Pos logit contributions
TRY+0.675
hop+0.628
SER+0.627
tub+0.622
EST+0.611
Neg logit contributions
 Signs-0.791
ecause-0.747
ateg-0.728
avorite-0.718
 Levels-0.706
Token for
Token ID329
Feature activation+2.25e-01

Pos logit contributions
 Signs+2.061
ecause+1.936
ateg+1.889
avorite+1.842
 Levels+1.825
Neg logit contributions
TRY-1.945
SER-1.817
hop-1.799
tub-1.795
EST-1.757
Token M
Token ID337
Feature activation+2.28e-01

Pos logit contributions
 Signs+1.230
ecause+1.160
ateg+1.133
avorite+1.108
 Levels+1.091
Neg logit contributions
TRY-1.156
SER-1.079
hop-1.071
tub-1.069
EST-1.046
Token 6
Token ID718
Feature activation+6.10e-01

Pos logit contributions
TRY+0.714
SER+0.652
hop+0.651
tub+0.646
EST+0.631
Neg logit contributions
 Signs-1.348
ecause-1.333
avorite-1.319
ateg-1.286
estones-1.240
Token,
Token ID11
Feature activation+5.32e-01

Pos logit contributions
 Signs+2.835
ecause+2.544
ateg+2.530
 Levels+2.451
 herself+2.424
Neg logit contributions
TRY-3.603
SER-3.393
tub-3.377
hop-3.369
idious-3.350
Token 2010
Token ID3050
Feature activation+5.18e-01

Pos logit contributions
 Signs+2.435
ecause+2.208
ateg+2.137
 Levels+2.107
 herself+2.062
Neg logit contributions
TRY-3.243
SER-3.037
tub-3.022
hop-2.965
ORN-2.944
Token
Token ID198
Feature activation+3.58e-01

Pos logit contributions
 Signs+2.367
ecause+2.150
ateg+2.116
 Levels+2.046
estones+1.995
Neg logit contributions
TRY-3.118
SER-2.946
tub-2.926
hop-2.918
idious-2.894
Token
Token ID198
Feature activation+3.77e-01

Pos logit contributions
 Signs+1.683
ecause+1.561
ateg+1.522
avorite+1.469
 Levels+1.461
Neg logit contributions
TRY-2.124
SER-2.003
hop-1.988
tub-1.986
idious-1.951
TokenSubject
Token ID19776
Feature activation+1.50e+00

Pos logit contributions
 Signs+2.021
ecause+1.906
ateg+1.856
avorite+1.835
 Levels+1.780
Neg logit contributions
TRY-2.012
SER-1.879
hop-1.865
tub-1.858
EST-1.827
Token:
Token ID25
Feature activation+7.72e-01

Pos logit contributions
 Signs+6.341
 Levels+5.449
ateg+5.375
 herself+5.373
 Gates+5.195
Neg logit contributions
TRY-9.005
SER-8.563
idious-8.547
ORN-8.491
tub-8.449
Token R
Token ID371
Feature activation-1.51e-01

Pos logit contributions
 Signs+4.300
ecause+3.991
ateg+3.857
 Levels+3.809
avorite+3.749
Neg logit contributions
TRY-3.885
SER-3.636
tub-3.580
hop-3.563
Unix-3.532
Tokeniff
Token ID733
Feature activation+1.30e-01

Pos logit contributions
TRY+0.739
SER+0.688
hop+0.688
tub+0.683
EST+0.677
Neg logit contributions
 Signs-0.856
ecause-0.812
ateg-0.786
avorite-0.785
 Levels-0.764
TokenTra
Token ID15721
Feature activation-3.97e-02

Pos logit contributions
 Signs+0.712
ecause+0.676
ateg+0.659
avorite+0.648
 Levels+0.631
Neg logit contributions
TRY-0.662
hop-0.619
SER-0.618
tub-0.614
EST-0.601
Tokenx
Token ID87
Feature activation-3.95e-02

Pos logit contributions
TRY+0.221
hop+0.208
SER+0.208
tub+0.205
EST+0.202
Neg logit contributions
 Signs-0.200
ecause-0.188
ateg-0.184
avorite-0.180
 Levels-0.176
Token we
Token ID356
Feature activation+4.27e-01

Pos logit contributions
 Signs+1.688
ecause+1.586
ateg+1.547
avorite+1.511
 Levels+1.489
Neg logit contributions
TRY-1.707
SER-1.597
hop-1.586
tub-1.582
EST-1.550
Token were
Token ID547
Feature activation+4.29e-01

Pos logit contributions
 Signs+2.373
ecause+2.236
ateg+2.187
avorite+2.135
 Levels+2.106
Neg logit contributions
TRY-2.161
SER-2.016
hop-1.992
tub-1.985
idious-1.950
Token hoping
Token ID7725
Feature activation+3.50e-01

Pos logit contributions
 Signs+2.265
ecause+2.128
ateg+2.076
avorite+2.028
 Levels+1.997
Neg logit contributions
TRY-2.292
SER-2.146
hop-2.131
tub-2.126
EST-2.083
Token Winter
Token ID10633
Feature activation+2.90e-01

Pos logit contributions
 Signs+1.993
ecause+1.872
ateg+1.833
avorite+1.787
 Levels+1.777
Neg logit contributions
TRY-1.733
SER-1.609
hop-1.597
tub-1.586
idious-1.557
Tokenke
Token ID365
Feature activation+3.24e-01

Pos logit contributions
 Signs+1.503
ecause+1.418
ateg+1.377
avorite+1.359
 Levels+1.330
Neg logit contributions
TRY-1.578
hop-1.475
SER-1.474
tub-1.467
EST-1.434
Tokenwl
Token ID40989
Feature activation+1.49e+00

Pos logit contributions
 Signs+1.819
ecause+1.714
ateg+1.673
avorite+1.641
 Levels+1.617
Neg logit contributions
TRY-1.633
SER-1.518
hop-1.513
tub-1.508
idious-1.472
Token would
Token ID561
Feature activation+8.59e-01

Pos logit contributions
 Signs+6.648
ecause+5.836
ateg+5.811
 Levels+5.797
 herself+5.763
Neg logit contributions
TRY-8.904
SER-8.362
idious-8.263
tub-8.107
hop-8.101
Token create
Token ID2251
Feature activation+2.99e-01

Pos logit contributions
 Signs+4.459
ecause+4.135
ateg+4.054
 herself+3.919
avorite+3.899
Neg logit contributions
TRY-4.608
SER-4.357
hop-4.196
idious-4.189
tub-4.179
Token,"
Token ID553
Feature activation-1.62e-01

Pos logit contributions
 Signs+1.640
ecause+1.541
ateg+1.506
avorite+1.469
 Levels+1.454
Neg logit contributions
TRY-1.541
SER-1.437
hop-1.422
tub-1.420
idious-1.391
Token Br
Token ID1709
Feature activation+3.41e-02

Pos logit contributions
TRY+0.830
SER+0.779
hop+0.773
tub+0.772
EST+0.758
Neg logit contributions
 Signs-0.874
ecause-0.837
ateg-0.817
avorite-0.804
estones-0.772
Tokenind
Token ID521
Feature activation-5.59e-01

Pos logit contributions
 Signs+0.186
ecause+0.175
avorite+0.171
ateg+0.169
 Levels+0.165
Neg logit contributions
TRY-0.174
hop-0.165
tub-0.164
SER-0.164
idious-0.161
Token galaxy
Token ID16161
Feature activation+3.53e-01

Pos logit contributions
 Signs+1.286
ecause+1.212
ateg+1.181
avorite+1.155
 Levels+1.132
Neg logit contributions
TRY-1.342
SER-1.255
hop-1.248
tub-1.246
EST-1.222
Token (
Token ID357
Feature activation-2.60e-01

Pos logit contributions
 Signs+1.986
ecause+1.864
ateg+1.826
avorite+1.785
 Levels+1.769
Neg logit contributions
TRY-1.766
SER-1.651
hop-1.633
tub-1.628
idious-1.593
Token100
Token ID3064
Feature activation+1.35e-01

Pos logit contributions
TRY+1.106
hop+1.016
SER+1.012
tub+1.001
EST+0.984
Neg logit contributions
 Signs-1.656
ecause-1.587
ateg-1.550
avorite-1.525
 Levels-1.493
Token,
Token ID11
Feature activation+6.37e-01

Pos logit contributions
 Signs+0.658
ecause+0.615
ateg+0.599
avorite+0.584
 Levels+0.573
Neg logit contributions
TRY-0.781
SER-0.734
hop-0.730
tub-0.728
EST-0.715
Token000
Token ID830
Feature activation+3.37e-01

Pos logit contributions
 Signs+3.334
ecause+3.089
ateg+3.015
 Levels+2.954
avorite+2.929
Neg logit contributions
TRY-3.395
SER-3.210
tub-3.175
hop-3.155
idious-3.127
Token Light
Token ID4401
Feature activation+1.48e+00

Pos logit contributions
 Signs+1.605
ecause+1.490
ateg+1.448
avorite+1.409
 Levels+1.397
Neg logit contributions
TRY-1.984
SER-1.869
hop-1.853
tub-1.853
idious-1.817
Token Years
Token ID13212
Feature activation+2.69e-01

Pos logit contributions
 Signs+4.299
 Levels+3.606
ecause+3.215
 Flags+3.194
ateg+3.103
Neg logit contributions
TRY-11.376
SER-10.875
tub-10.762
idious-10.650
hop-10.643
Token)
Token ID8
Feature activation-1.61e-01

Pos logit contributions
 Signs+1.357
ecause+1.254
ateg+1.225
avorite+1.196
 Levels+1.190
Neg logit contributions
TRY-1.500
SER-1.413
tub-1.399
hop-1.394
idious-1.373
Token
Token ID198
Feature activation-1.92e-01

Pos logit contributions
TRY+0.933
SER+0.876
hop+0.872
tub+0.868
EST+0.854
Neg logit contributions
 Signs-0.773
ecause-0.726
ateg-0.707
avorite-0.690
 Levels-0.672
Token
Token ID198
Feature activation-1.47e-01

Pos logit contributions
TRY+1.051
hop+0.986
SER+0.983
tub+0.975
EST+0.960
Neg logit contributions
 Signs-0.983
ecause-0.958
avorite-0.953
ateg-0.910
estones-0.865
Tokento
Token ID1462
Feature activation-3.32e-01

Pos logit contributions
TRY+0.656
SER+0.603
hop+0.599
tub+0.594
EST+0.582
Neg logit contributions
 Signs-0.914
ecause-0.869
ateg-0.850
avorite-0.838
 Levels-0.823
Token2
Token ID17
Feature activation+7.06e-01

Pos logit contributions
TRY+0.862
SER+0.795
hop+0.794
tub+0.783
EST+0.772
Neg logit contributions
 Signs-1.225
ecause-1.175
ateg-1.143
avorite-1.129
 Levels-1.099
Token,
Token ID11
Feature activation+3.28e-01

Pos logit contributions
 Signs+3.414
ecause+3.068
 Levels+2.992
ateg+2.990
avorite+2.887
Neg logit contributions
TRY-4.110
SER-3.845
tub-3.842
idious-3.795
hop-3.764
Token000
Token ID830
Feature activation+3.32e-01

Pos logit contributions
 Signs+1.781
ecause+1.666
ateg+1.632
 Levels+1.582
avorite+1.575
Neg logit contributions
TRY-1.694
SER-1.580
tub-1.572
hop-1.569
idious-1.541
Token,
Token ID11
Feature activation+8.58e-01

Pos logit contributions
 Signs+1.350
ecause+1.210
ateg+1.162
 Levels+1.150
avorite+1.126
Neg logit contributions
TRY-2.207
SER-2.092
tub-2.077
hop-2.062
idious-2.040
Token000
Token ID830
Feature activation+5.67e-01

Pos logit contributions
 Signs+4.453
ecause+4.048
ateg+4.016
 Levels+3.947
 herself+3.844
Neg logit contributions
TRY-4.573
SER-4.321
tub-4.277
hop-4.249
idious-4.228
Token Light
Token ID4401
Feature activation+1.48e+00

Pos logit contributions
 Signs+2.098
ecause+1.835
 Levels+1.771
ateg+1.744
avorite+1.675
Neg logit contributions
TRY-3.979
SER-3.789
tub-3.759
hop-3.715
idious-3.700
Token Years
Token ID13212
Feature activation+1.49e-01

Pos logit contributions
 Signs+3.214
 Levels+2.448
ecause+2.076
 Flags+2.059
ateg+2.005
Neg logit contributions
TRY-12.661
SER-12.104
tub-12.003
idious-11.925
hop-11.901
Token)
Token ID8
Feature activation-1.78e-01

Pos logit contributions
 Signs+0.722
ecause+0.661
ateg+0.646
avorite+0.633
 Levels+0.630
Neg logit contributions
TRY-0.860
SER-0.811
tub-0.804
hop-0.800
idious-0.793
Token
Token ID198
Feature activation-1.35e-01

Pos logit contributions
TRY+1.053
SER+0.993
hop+0.987
tub+0.985
idious+0.967
Neg logit contributions
 Signs-0.839
ecause-0.781
ateg-0.759
avorite-0.740
 Levels-0.729
Token
Token ID198
Feature activation-9.15e-02

Pos logit contributions
TRY+0.747
hop+0.701
SER+0.699
tub+0.694
EST+0.683
Neg logit contributions
 Signs-0.685
ecause-0.665
avorite-0.657
ateg-0.633
 Levels-0.601
Tokento
Token ID1462
Feature activation-1.58e-01

Pos logit contributions
TRY+0.403
SER+0.371
hop+0.368
tub+0.365
EST+0.358
Neg logit contributions
 Signs-0.573
ecause-0.545
ateg-0.533
avorite-0.524
 Levels-0.516
Token
Token ID198
Feature activation+3.84e-01

Pos logit contributions
 Signs+1.291
ecause+1.201
ateg+1.170
avorite+1.134
 Levels+1.122
Neg logit contributions
TRY-1.618
SER-1.525
hop-1.515
tub-1.512
idious-1.484
TokenWeight
Token ID25844
Feature activation+2.84e-01

Pos logit contributions
 Signs+2.101
ecause+1.973
ateg+1.940
 Levels+1.874
avorite+1.864
Neg logit contributions
TRY-1.939
SER-1.812
hop-1.805
tub-1.803
idious-1.775
Token:
Token ID25
Feature activation-3.27e-02

Pos logit contributions
 Signs+1.502
ecause+1.399
ateg+1.372
 Levels+1.327
avorite+1.323
Neg logit contributions
TRY-1.513
SER-1.417
hop-1.404
tub-1.402
idious-1.376
Token 65
Token ID6135
Feature activation+2.42e-01

Pos logit contributions
TRY+0.140
SER+0.129
hop+0.128
tub+0.128
EST+0.125
Neg logit contributions
 Signs-0.204
ecause-0.198
ateg-0.193
avorite-0.192
estones-0.185
Token.
Token ID13
Feature activation+5.10e-01

Pos logit contributions
 Signs+1.109
ecause+1.033
ateg+1.013
avorite+0.980
 Levels+0.960
Neg logit contributions
TRY-1.454
SER-1.383
hop-1.369
idious-1.361
tub-1.360
Token6
Token ID21
Feature activation+1.47e+00

Pos logit contributions
 Signs+3.224
ecause+3.024
ateg+3.001
 Levels+2.903
avorite+2.867
Neg logit contributions
TRY-2.143
SER-1.971
tub-1.959
hop-1.935
idious-1.932
Token grams
Token ID16379
Feature activation+6.13e-01

Pos logit contributions
 Signs+4.011
ateg+3.455
ecause+3.292
 herself+3.177
 Levels+3.172
Neg logit contributions
TRY-10.886
SER-10.617
idious-10.484
hop-10.462
ORN-10.367
Token
Token ID198
Feature activation+1.22e-01

Pos logit contributions
 Signs+2.496
ecause+2.263
ateg+2.255
avorite+2.199
 Levels+2.130
Neg logit contributions
TRY-3.898
SER-3.679
idious-3.655
hop-3.633
Unix-3.628
Token
Token ID198
Feature activation+2.38e-01

Pos logit contributions
 Signs+0.586
ecause+0.551
ateg+0.534
avorite+0.527
 Levels+0.509
Neg logit contributions
TRY-0.706
SER-0.665
hop-0.662
tub-0.659
EST-0.648
TokenB
Token ID33
Feature activation+3.51e-01

Pos logit contributions
 Signs+1.394
ecause+1.334
ateg+1.300
avorite+1.270
 Levels+1.250
Neg logit contributions
TRY-1.091
hop-1.011
tub-1.010
SER-1.006
idious-0.997
Tokenearing
Token ID6648
Feature activation-8.84e-02

Pos logit contributions
 Signs+1.514
ecause+1.402
ateg+1.360
avorite+1.318
 Levels+1.295
Neg logit contributions
TRY-2.219
SER-2.099
hop-2.087
tub-2.084
EST-2.045
Token position
Token ID2292
Feature activation+7.39e-01

Pos logit contributions
 Signs+2.726
ecause+2.515
ateg+2.468
 Levels+2.396
avorite+2.381
Neg logit contributions
TRY-2.913
SER-2.753
tub-2.706
hop-2.696
idious-2.654
Token lights
Token ID7588
Feature activation+5.13e-01

Pos logit contributions
 Signs+2.471
ecause+2.119
ateg+2.062
 Levels+2.011
 herself+2.005
Neg logit contributions
TRY-5.305
SER-5.089
tub-5.017
hop-5.001
Unix-4.969
Token were
Token ID547
Feature activation+4.99e-01

Pos logit contributions
 Signs+2.295
ecause+2.111
ateg+2.048
 Levels+1.977
 herself+1.965
Neg logit contributions
TRY-3.142
SER-2.973
hop-2.919
tub-2.913
ORN-2.890
Token now
Token ID783
Feature activation+2.52e-01

Pos logit contributions
 Signs+2.618
ecause+2.439
ateg+2.379
avorite+2.310
 Levels+2.307
Neg logit contributions
TRY-2.671
SER-2.504
hop-2.481
tub-2.470
idious-2.425
Token c
Token ID269
Feature activation+7.17e-01

Pos logit contributions
 Signs+1.236
ecause+1.155
ateg+1.124
avorite+1.096
 Levels+1.079
Neg logit contributions
TRY-1.443
SER-1.356
hop-1.347
tub-1.344
EST-1.319
Tokenanted
Token ID4126
Feature activation+1.46e+00

Pos logit contributions
 Signs+4.089
ecause+3.876
ateg+3.825
avorite+3.730
 Levels+3.613
Neg logit contributions
TRY-3.516
SER-3.275
tub-3.210
hop-3.199
Unix-3.169
Token inward
Token ID29879
Feature activation+2.49e-01

Pos logit contributions
 Signs+6.927
 herself+6.289
 Levels+6.174
ecause+6.166
ateg+6.130
Neg logit contributions
TRY-7.991
SER-7.489
Unix-7.356
arious-7.349
tub-7.333
Token as
Token ID355
Feature activation+1.34e-02

Pos logit contributions
 Signs+1.299
ecause+1.216
ateg+1.187
avorite+1.155
 Levels+1.144
Neg logit contributions
TRY-1.345
SER-1.263
hop-1.249
tub-1.248
EST-1.221
Token the
Token ID262
Feature activation+1.48e-02

Pos logit contributions
 Signs+0.078
ecause+0.074
ateg+0.072
avorite+0.071
 Levels+0.069
Neg logit contributions
TRY-0.064
hop-0.060
SER-0.060
tub-0.059
EST-0.058
Token wings
Token ID12098
Feature activation+1.76e-01

Pos logit contributions
 Signs+0.080
ecause+0.076
ateg+0.074
avorite+0.073
 Levels+0.071
Neg logit contributions
TRY-0.077
SER-0.072
hop-0.072
tub-0.072
EST-0.070
Token had
Token ID550
Feature activation-5.65e-02

Pos logit contributions
 Signs+0.820
ecause+0.764
ateg+0.743
avorite+0.723
 Levels+0.711
Neg logit contributions
TRY-1.046
SER-0.986
hop-0.980
tub-0.978
EST-0.960
Token would
Token ID561
Feature activation-8.88e-02

Pos logit contributions
 Signs+1.212
ecause+1.142
ateg+1.114
avorite+1.096
 Levels+1.070
Neg logit contributions
TRY-1.213
hop-1.133
SER-1.132
tub-1.127
EST-1.100
Token much
Token ID881
Feature activation-2.46e-01

Pos logit contributions
TRY+0.450
hop+0.423
tub+0.421
SER+0.417
EST+0.408
Neg logit contributions
 Signs-0.490
ecause-0.461
ateg-0.453
avorite-0.447
estones-0.435
Token prefer
Token ID4702
Feature activation+4.81e-01

Pos logit contributions
TRY+1.105
hop+1.041
tub+1.024
SER+1.021
EST+0.994
Neg logit contributions
 Signs-1.485
ecause-1.403
ateg-1.384
avorite-1.374
estones-1.334
Token the
Token ID262
Feature activation+3.38e-01

Pos logit contributions
 Signs+2.571
ecause+2.418
ateg+2.358
avorite+2.297
 Levels+2.276
Neg logit contributions
TRY-2.531
SER-2.371
hop-2.344
tub-2.339
idious-2.294
Token lower
Token ID2793
Feature activation+4.75e-01

Pos logit contributions
 Signs+1.831
ecause+1.718
ateg+1.677
avorite+1.637
 Levels+1.622
Neg logit contributions
TRY-1.759
SER-1.644
hop-1.627
tub-1.623
idious-1.589
Token res
Token ID581
Feature activation+1.46e+00

Pos logit contributions
 Signs+2.368
ecause+2.206
ateg+2.151
avorite+2.085
 Levels+2.085
Neg logit contributions
TRY-2.681
SER-2.513
hop-2.484
tub-2.479
idious-2.438
Token without
Token ID1231
Feature activation+5.90e-01

Pos logit contributions
 Signs+6.849
ecause+6.194
ateg+6.099
 Levels+5.998
 herself+5.954
Neg logit contributions
TRY-8.160
SER-7.597
Unix-7.407
ORN-7.346
arious-7.332
Token screen
Token ID3159
Feature activation+2.39e-01

Pos logit contributions
 Signs+3.021
ecause+2.812
ateg+2.745
avorite+2.677
 Levels+2.647
Neg logit contributions
TRY-3.256
SER-3.058
hop-3.013
tub-3.005
Unix-2.955
Token door
Token ID3420
Feature activation+3.51e-01

Pos logit contributions
 Signs+1.242
ecause+1.175
ateg+1.121
avorite+1.098
 Levels+1.093
Neg logit contributions
TRY-1.301
SER-1.217
Unix-1.186
idious-1.181
hop-1.180
Token effect
Token ID1245
Feature activation+6.21e-01

Pos logit contributions
 Signs+2.089
ecause+1.975
ateg+1.927
avorite+1.886
 Levels+1.869
Neg logit contributions
TRY-1.646
SER-1.527
hop-1.506
tub-1.503
EST-1.469
Token.
Token ID13
Feature activation-9.57e-02

Pos logit contributions
 Signs+3.336
ecause+3.132
ateg+3.045
avorite+2.955
 Levels+2.951
Neg logit contributions
TRY-3.269
SER-3.060
tub-3.001
hop-2.997
idious-2.948
TokenFlag
Token ID34227
Feature activation+5.80e-01

Pos logit contributions
 Signs+0.719
ecause+0.665
ateg+0.640
avorite+0.627
 Levels+0.608
Neg logit contributions
TRY-1.180
SER-1.118
hop-1.114
tub-1.106
EST-1.093
Token Reply
Token ID14883
Feature activation+6.86e-01

Pos logit contributions
 Signs+2.925
ecause+2.709
ateg+2.651
avorite+2.576
 Levels+2.572
Neg logit contributions
TRY-3.243
SER-3.052
hop-3.017
tub-3.016
EST-2.954
Token
Token ID198
Feature activation-1.18e-01

Pos logit contributions
 Signs+3.117
ecause+2.828
ateg+2.790
 Levels+2.701
avorite+2.636
Neg logit contributions
TRY-4.159
SER-3.929
tub-3.894
hop-3.887
idious-3.852
Token
Token ID198
Feature activation-5.13e-02

Pos logit contributions
TRY+0.655
hop+0.616
SER+0.614
tub+0.606
EST+0.599
Neg logit contributions
 Signs-0.598
ecause-0.581
avorite-0.570
ateg-0.554
estones-0.526
Token55
Token ID2816
Feature activation+1.87e-01

Pos logit contributions
TRY+0.214
SER+0.197
hop+0.195
tub+0.193
EST+0.189
Neg logit contributions
 Signs-0.332
ecause-0.317
ateg-0.311
avorite-0.307
 Levels-0.300
Token Recommend
Token ID30361
Feature activation+1.44e+00

Pos logit contributions
 Signs+1.042
ecause+1.006
ateg+0.963
avorite+0.955
estones+0.933
Neg logit contributions
TRY-0.936
hop-0.873
SER-0.871
tub-0.866
EST-0.856
Token
Token ID198
Feature activation+4.36e-02

Pos logit contributions
 Signs+5.401
ateg+4.678
 Levels+4.662
 herself+4.532
ecause+4.402
Neg logit contributions
TRY-9.332
idious-8.896
SER-8.874
tub-8.828
ORN-8.749
Token
Token ID198
Feature activation+2.45e-01

Pos logit contributions
 Signs+0.216
ecause+0.207
avorite+0.201
ateg+0.199
 Levels+0.189
Neg logit contributions
TRY-0.247
hop-0.232
SER-0.232
tub-0.230
EST-0.226
TokenRecommend
Token ID41248
Feature activation+9.42e-01

Pos logit contributions
 Signs+1.034
ecause+0.956
ateg+0.926
avorite+0.899
 Levels+0.882
Neg logit contributions
TRY-1.568
SER-1.484
hop-1.475
tub-1.473
EST-1.448
Token Share
Token ID8734
Feature activation+5.48e-01

Pos logit contributions
 Signs+3.727
ecause+3.325
ateg+3.248
avorite+3.170
 Levels+3.145
Neg logit contributions
TRY-6.284
tub-5.941
SER-5.934
idious-5.882
hop-5.854
Token this
Token ID428
Feature activation+1.41e-01

Pos logit contributions
 Signs+2.524
ecause+2.333
ateg+2.284
avorite+2.197
 Levels+2.187
Neg logit contributions
TRY-3.296
SER-3.104
hop-3.073
tub-3.071
idious-3.024
Token since
Token ID1201
Feature activation+8.65e-02

Pos logit contributions
 Signs+3.698
ecause+3.417
ateg+3.302
 Levels+3.167
 herself+3.128
Neg logit contributions
TRY-4.884
SER-4.625
hop-4.533
idious-4.524
tub-4.524
Token at
Token ID379
Feature activation+3.43e-01

Pos logit contributions
 Signs+0.504
ecause+0.477
ateg+0.466
avorite+0.456
 Levels+0.450
Neg logit contributions
TRY-0.415
SER-0.386
hop-0.382
tub-0.382
EST-0.373
Token a
Token ID257
Feature activation+2.08e-01

Pos logit contributions
 Signs+1.940
ecause+1.804
ateg+1.794
 Levels+1.740
 herself+1.714
Neg logit contributions
TRY-1.704
SER-1.599
tub-1.567
hop-1.560
idious-1.544
Token 90
Token ID4101
Feature activation+3.37e-01

Pos logit contributions
 Signs+1.411
ecause+1.346
ateg+1.320
avorite+1.299
 Levels+1.281
Neg logit contributions
TRY-0.795
SER-0.724
hop-0.717
tub-0.714
EST-0.694
Token F
Token ID376
Feature activation+5.20e-01

Pos logit contributions
 Signs+2.117
ecause+2.000
ateg+1.963
 Levels+1.914
avorite+1.908
Neg logit contributions
TRY-1.466
SER-1.352
hop-1.335
tub-1.335
idious-1.299
TokenOV
Token ID8874
Feature activation+1.43e+00

Pos logit contributions
 Signs+3.911
ateg+3.735
ecause+3.714
 Levels+3.622
 herself+3.570
Neg logit contributions
TRY-1.511
SER-1.351
hop-1.345
tub-1.334
idious-1.319
Token even
Token ID772
Feature activation+9.31e-01

Pos logit contributions
 Signs+6.152
ateg+5.503
 herself+5.424
ecause+5.392
 Levels+5.318
Neg logit contributions
TRY-8.645
SER-8.213
tub-8.125
idious-8.012
hop-7.951
Token double
Token ID4274
Feature activation+1.34e-01

Pos logit contributions
 Signs+4.019
ecause+3.539
 herself+3.518
ateg+3.513
 Levels+3.454
Neg logit contributions
TRY-5.751
SER-5.437
hop-5.342
ORN-5.310
tub-5.283
Token or
Token ID393
Feature activation+1.47e-01

Pos logit contributions
 Signs+0.750
ecause+0.708
ateg+0.691
avorite+0.677
 Levels+0.666
Neg logit contributions
TRY-0.673
SER-0.627
hop-0.623
tub-0.621
EST-0.607
Token 4
Token ID604
Feature activation+1.14e+00

Pos logit contributions
 Signs+0.898
ecause+0.855
ateg+0.834
avorite+0.824
 Levels+0.805
Neg logit contributions
TRY-0.664
SER-0.612
hop-0.611
tub-0.610
EST-0.593
Tokenx
Token ID87
Feature activation+3.10e-01

Pos logit contributions
 Signs+4.545
ecause+4.041
ateg+3.973
 Levels+3.925
 herself+3.845
Neg logit contributions
TRY-7.227
idious-6.859
SER-6.794
arious-6.757
tub-6.680
Token 2
Token ID362
Feature activation+3.02e-01

Pos logit contributions
 Signs+1.125
ecause+1.036
ateg+1.013
 Levels+0.993
avorite+0.981
Neg logit contributions
TRY-1.191
SER-1.112
hop-1.112
tub-1.094
idious-1.087
Token).
Token ID737
Feature activation+2.45e-01

Pos logit contributions
 Signs+1.472
ecause+1.373
ateg+1.322
avorite+1.302
 Levels+1.291
Neg logit contributions
TRY-1.723
SER-1.607
tub-1.602
hop-1.592
idious-1.589
Token Near
Token ID20173
Feature activation+2.44e-01

Pos logit contributions
 Signs+1.193
ecause+1.125
ateg+1.093
avorite+1.074
 Levels+1.038
Neg logit contributions
TRY-1.405
SER-1.323
hop-1.314
tub-1.310
EST-1.288
Token those
Token ID883
Feature activation+4.09e-01

Pos logit contributions
 Signs+0.795
ecause+0.716
ateg+0.691
avorite+0.660
 Levels+0.649
Neg logit contributions
TRY-1.793
SER-1.712
hop-1.703
tub-1.690
idious-1.669
Token lighter
Token ID14871
Feature activation+7.42e-01

Pos logit contributions
 Signs+2.057
ecause+1.916
ateg+1.880
avorite+1.836
 Levels+1.812
Neg logit contributions
TRY-2.266
SER-2.135
hop-2.109
tub-2.097
Unix-2.078
Token v
Token ID410
Feature activation+1.41e+00

Pos logit contributions
 Signs+3.957
ateg+3.660
ecause+3.640
avorite+3.556
 Levels+3.546
Neg logit contributions
TRY-3.838
SER-3.594
hop-3.520
Unix-3.504
ORN-3.456
Tokenort
Token ID419
Feature activation+2.71e-01

Pos logit contributions
 Signs+6.120
ateg+5.842
ecause+5.711
estones+5.473
 herself+5.298
Neg logit contributions
TRY-8.308
SER-7.862
Unix-7.604
ORN-7.572
idious-7.526
Tokenons
Token ID684
Feature activation+5.67e-01

Pos logit contributions
 Signs+1.756
ecause+1.681
ateg+1.637
avorite+1.628
 Levels+1.588
Neg logit contributions
TRY-1.122
hop-1.034
SER-1.031
tub-1.026
EST-0.995
Token,
Token ID11
Feature activation+4.64e-01

Pos logit contributions
 Signs+2.449
ecause+2.266
ateg+2.223
avorite+2.162
 Levels+2.134
Neg logit contributions
TRY-3.483
SER-3.314
hop-3.242
Unix-3.235
tub-3.222
Token the
Token ID262
Feature activation+4.70e-01

Pos logit contributions
 Signs+2.168
ecause+2.023
ateg+1.969
 Levels+1.933
avorite+1.903
Neg logit contributions
TRY-2.706
SER-2.564
Unix-2.523
hop-2.519
tub-2.504
Token density
Token ID12109
Feature activation+3.66e-01

Pos logit contributions
 Signs+2.012
ecause+1.862
ateg+1.824
 Levels+1.793
avorite+1.772
Neg logit contributions
TRY-2.901
SER-2.758
hop-2.712
Unix-2.670
tub-2.668
Token Journalism
Token ID37406
Feature activation-2.58e-01

Pos logit contributions
TRY+1.448
SER+1.425
hop+1.388
tub+1.375
ORN+1.352
Neg logit contributions
avorite-1.301
ecause-1.300
 Signs-1.284
ateg-1.260
estones-1.235
Token The
Token ID383
Feature activation+4.92e-01

Pos logit contributions
TRY+1.385
hop+1.302
SER+1.300
EST+1.274
ORN+1.274
Neg logit contributions
 Signs-1.297
avorite-1.281
ecause-1.267
ateg-1.212
estones-1.148
Token Nation
Token ID8741
Feature activation-3.18e-01

Pos logit contributions
 Signs+2.453
ecause+2.258
ateg+2.210
 Levels+2.150
avorite+2.133
Neg logit contributions
TRY-2.776
SER-2.612
hop-2.602
tub-2.589
idious-2.553
Token is
Token ID318
Feature activation+1.88e-01

Pos logit contributions
TRY+1.599
SER+1.509
hop+1.506
tub+1.489
EST+1.470
Neg logit contributions
 Signs-1.750
ecause-1.677
avorite-1.617
ateg-1.611
 Levels-1.558
Token reader
Token ID9173
Feature activation-2.49e-01

Pos logit contributions
 Signs+0.914
ecause+0.849
ateg+0.835
avorite+0.793
 Levels+0.790
Neg logit contributions
TRY-1.074
SER-1.005
hop-1.004
tub-0.997
idious-0.994
Token supported
Token ID4855
Feature activation+1.40e+00

Pos logit contributions
TRY+1.680
SER+1.633
hop+1.599
tub+1.582
EST+1.579
Neg logit contributions
 Signs-0.960
ecause-0.935
avorite-0.860
ateg-0.840
estones-0.806
Token:
Token ID25
Feature activation+7.76e-01

Pos logit contributions
 Signs+5.826
ecause+5.077
 Levels+5.045
ateg+5.041
 herself+4.907
Neg logit contributions
TRY-8.642
ORN-8.311
tub-8.207
SER-8.113
arious-8.055
Token Chip
Token ID17869
Feature activation+1.42e-02

Pos logit contributions
 Signs+2.840
ecause+2.466
ateg+2.369
 Levels+2.299
 Gates+2.223
Neg logit contributions
TRY-5.384
SER-5.163
hop-5.145
tub-5.141
idious-5.137
Token in
Token ID287
Feature activation+2.54e-01

Pos logit contributions
 Signs+0.088
ecause+0.085
avorite+0.083
ateg+0.083
estones+0.080
Neg logit contributions
TRY-0.061
SER-0.057
hop-0.057
EST-0.056
tub-0.056
Token $
Token ID720
Feature activation+4.83e-01

Pos logit contributions
 Signs+1.721
ecause+1.642
ateg+1.606
avorite+1.595
 Levels+1.546
Neg logit contributions
TRY-0.986
SER-0.897
tub-0.882
hop-0.879
EST-0.869
Token10
Token ID940
Feature activation+1.25e-01

Pos logit contributions
 Signs+2.696
ecause+2.545
ateg+2.511
 Levels+2.447
estones+2.390
Neg logit contributions
TRY-2.395
idious-2.231
SER-2.199
hop-2.198
Unix-2.196
Token COL
Token ID20444
Feature activation+1.01e-01

Pos logit contributions
 Signs+2.859
ecause+2.648
 Levels+2.569
ateg+2.564
avorite+2.524
Neg logit contributions
TRY-3.654
hop-3.436
idious-3.398
tub-3.385
SER-3.381
TokenOR
Token ID1581
Feature activation+1.09e+00

Pos logit contributions
 Signs+0.781
ecause+0.751
ateg+0.737
avorite+0.730
 Levels+0.718
Neg logit contributions
TRY-0.296
SER-0.262
hop-0.256
tub-0.254
EST-0.249
Token_
Token ID62
Feature activation+3.31e-02

Pos logit contributions
 Signs+5.946
ateg+5.474
 Levels+5.373
 herself+5.302
ecause+5.280
Neg logit contributions
TRY-5.115
idious-4.975
tub-4.839
arious-4.802
hop-4.772
TokenB
Token ID33
Feature activation+5.66e-01

Pos logit contributions
 Signs+0.214
ecause+0.204
ateg+0.200
avorite+0.196
 Levels+0.193
Neg logit contributions
TRY-0.138
SER-0.127
hop-0.126
tub-0.125
EST-0.123
TokenGR
Token ID10761
Feature activation+3.92e-01

Pos logit contributions
 Signs+3.253
ecause+3.054
ateg+3.009
 Levels+2.912
avorite+2.902
Neg logit contributions
TRY-2.693
hop-2.504
tub-2.498
SER-2.494
idious-2.474
Token2
Token ID17
Feature activation+1.40e+00

Pos logit contributions
 Signs+2.502
ecause+2.373
ateg+2.328
avorite+2.278
 Levels+2.259
Neg logit contributions
TRY-1.659
SER-1.523
hop-1.513
tub-1.508
idious-1.470
TokenG
Token ID38
Feature activation+4.56e-01

Pos logit contributions
 Signs+5.917
 Levels+5.181
ateg+5.151
ecause+5.145
 herself+5.142
Neg logit contributions
idious-8.115
TRY-8.019
arious-7.806
tub-7.748
hop-7.627
TokenRAY
Token ID30631
Feature activation+5.10e-01

Pos logit contributions
 Signs+2.039
ecause+1.869
ateg+1.836
 Levels+1.757
avorite+1.736
Neg logit contributions
TRY-2.764
hop-2.613
SER-2.611
tub-2.607
idious-2.590
Token )
Token ID1267
Feature activation+1.67e-01

Pos logit contributions
 Signs+2.308
ecause+2.091
ateg+2.061
 Levels+1.993
 herself+1.942
Neg logit contributions
TRY-3.087
SER-2.901
tub-2.890
hop-2.887
idious-2.884
Token #
Token ID1303
Feature activation-3.66e-01

Pos logit contributions
 Signs+0.739
ecause+0.684
ateg+0.667
 Levels+0.652
avorite+0.647
Neg logit contributions
TRY-1.026
SER-0.969
tub-0.952
hop-0.950
idious-0.947
Token smooth
Token ID7209
Feature activation+2.53e-01

Pos logit contributions
TRY+1.302
hop+1.180
SER+1.176
tub+1.173
EST+1.129
Neg logit contributions
 Signs-2.579
ecause-2.449
ateg-2.391
avorite-2.369
 Levels-2.332
Token tablespoon
Token ID33107
Feature activation-8.03e-01

Pos logit contributions
TRY+1.669
tub+1.538
SER+1.531
hop+1.523
EST+1.482
Neg logit contributions
 Signs-2.422
ecause-2.311
ateg-2.254
avorite-2.226
 Levels-2.178
Token coconut
Token ID20132
Feature activation-2.60e-01

Pos logit contributions
TRY+3.888
tub+3.644
hop+3.628
SER+3.604
EST+3.472
Neg logit contributions
 Signs-4.561
ecause-4.288
ateg-4.183
avorite-4.146
estones-4.091
Token oil
Token ID3056
Feature activation-5.89e-01

Pos logit contributions
TRY+1.169
SER+1.082
hop+1.070
tub+1.067
EST+1.041
Neg logit contributions
 Signs-1.594
ecause-1.511
ateg-1.480
avorite-1.447
 Levels-1.433
Token and
Token ID290
Feature activation-9.40e-01

Pos logit contributions
TRY+2.897
tub+2.702
hop+2.699
SER+2.690
EST+2.616
Neg logit contributions
 Signs-3.314
ecause-3.115
ateg-3.063
avorite-3.052
 Levels-2.943
Token fry
Token ID32959
Feature activation-1.58e+00

Pos logit contributions
TRY+4.181
SER+3.803
tub+3.789
hop+3.758
EST+3.633
Neg logit contributions
 Signs-5.669
ecause-5.421
avorite-5.358
ateg-5.253
estones-5.197
Token until
Token ID1566
Feature activation-2.35e+00

Pos logit contributions
TRY+7.276
hop+6.866
tub+6.838
SER+6.756
EST+6.480
Neg logit contributions
 Signs-8.742
ecause-8.184
avorite-8.144
ateg-8.108
estones-7.685
Token carrots
Token ID34397
Feature activation-8.57e-01

Pos logit contributions
TRY+9.172
tub+8.688
 attachment+8.491
SER+8.466
hop+8.428
Neg logit contributions
avorite-12.776
 Signs-12.761
ecause-12.326
ateg-11.877
estones-11.569
Token are
Token ID389
Feature activation-7.49e-01

Pos logit contributions
TRY+4.027
tub+3.836
hop+3.772
SER+3.701
EST+3.634
Neg logit contributions
 Signs-4.876
ecause-4.637
avorite-4.599
ateg-4.539
 Levels-4.384
Token soft
Token ID2705
Feature activation-9.71e-02

Pos logit contributions
TRY+3.195
tub+3.009
SER+2.938
hop+2.912
EST+2.859
Neg logit contributions
 Signs-4.623
ecause-4.420
avorite-4.377
ateg-4.322
 Levels-4.176
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
TRY+0.513
SER+0.480
hop+0.477
tub+0.476
EST+0.466
Neg logit contributions
 Signs-0.518
ecause-0.487
ateg-0.476
avorite-0.466
 Levels-0.458
Token crunch
Token ID23527
Feature activation+1.64e-01

Pos logit contributions
TRY+1.679
tub+1.559
SER+1.555
hop+1.542
EST+1.510
Neg logit contributions
 Signs-2.131
ecause-2.022
ateg-1.986
avorite-1.971
 Levels-1.916
Token at
Token ID379
Feature activation-8.38e-01

Pos logit contributions
TRY+3.297
tub+3.045
hop+3.024
SER+3.000
EST+2.901
Neg logit contributions
 Signs-4.870
ecause-4.659
avorite-4.595
ateg-4.537
 Levels-4.373
Token medium
Token ID7090
Feature activation-4.44e-01

Pos logit contributions
TRY+4.306
SER+4.023
tub+3.989
hop+3.949
EST+3.893
Neg logit contributions
 Signs-4.489
ecause-4.314
avorite-4.190
ateg-4.161
estones-3.989
Token-
Token ID12
Feature activation-3.60e-01

Pos logit contributions
TRY+2.107
SER+1.957
hop+1.937
tub+1.927
EST+1.895
Neg logit contributions
 Signs-2.608
ecause-2.480
ateg-2.415
avorite-2.394
 Levels-2.321
Tokenhigh
Token ID8929
Feature activation-5.58e-01

Pos logit contributions
TRY+1.637
SER+1.513
hop+1.502
tub+1.495
EST+1.459
Neg logit contributions
 Signs-2.200
ecause-2.083
ateg-2.035
avorite-2.009
 Levels-1.970
Token heat
Token ID4894
Feature activation-8.84e-01

Pos logit contributions
TRY+2.600
SER+2.411
hop+2.394
tub+2.369
EST+2.353
Neg logit contributions
 Signs-3.307
ecause-3.147
avorite-3.065
ateg-3.065
estones-2.974
Token until
Token ID1566
Feature activation-2.27e+00

Pos logit contributions
TRY+3.581
hop+3.232
tub+3.230
SER+3.211
EST+3.140
Neg logit contributions
 Signs-5.672
ecause-5.439
avorite-5.379
ateg-5.323
estones-5.172
Token the
Token ID262
Feature activation-7.95e-01

Pos logit contributions
TRY+8.895
tub+8.286
SER+8.140
ORN+8.094
 attachment+8.055
Neg logit contributions
avorite-12.576
 Signs-12.360
ecause-12.165
ateg-11.604
estones-11.331
Token onion
Token ID21670
Feature activation-6.59e-01

Pos logit contributions
TRY+3.326
tub+3.069
SER+3.052
hop+3.034
EST+2.986
Neg logit contributions
 Signs-4.950
ecause-4.762
ateg-4.681
avorite-4.665
estones-4.477
Token is
Token ID318
Feature activation-9.20e-01

Pos logit contributions
TRY+3.407
tub+3.204
SER+3.172
hop+3.153
EST+3.124
Neg logit contributions
 Signs-3.484
ecause-3.333
avorite-3.278
ateg-3.250
 Levels-3.092
Token clear
Token ID1598
Feature activation-9.08e-01

Pos logit contributions
TRY+3.918
tub+3.686
SER+3.617
hop+3.571
EST+3.533
Neg logit contributions
 Signs-5.640
ecause-5.379
avorite-5.377
ateg-5.263
 Levels-5.081
Token or
Token ID393
Feature activation-2.85e-01

Pos logit contributions
TRY+4.259
tub+4.039
SER+3.918
hop+3.908
EST+3.908
Neg logit contributions
 Signs-5.062
avorite-4.875
ecause-4.843
ateg-4.715
 Levels-4.630
Token the
Token ID262
Feature activation-3.31e-01

Pos logit contributions
TRY+2.092
SER+1.948
hop+1.938
tub+1.937
EST+1.890
Neg logit contributions
 Signs-2.323
ecause-2.199
ateg-2.147
avorite-2.112
 Levels-2.063
Token bell
Token ID8966
Feature activation-6.17e-01

Pos logit contributions
TRY+1.542
SER+1.430
hop+1.424
tub+1.423
EST+1.386
Neg logit contributions
 Signs-1.962
ecause-1.864
ateg-1.827
avorite-1.793
 Levels-1.752
Token peppers
Token ID31804
Feature activation-3.47e-01

Pos logit contributions
TRY+3.059
hop+2.898
tub+2.854
SER+2.853
EST+2.804
Neg logit contributions
 Signs-3.417
ecause-3.238
ateg-3.192
avorite-3.134
 Levels-3.058
Token.
Token ID13
Feature activation-7.84e-01

Pos logit contributions
TRY+1.781
SER+1.655
hop+1.654
tub+1.648
EST+1.610
Neg logit contributions
 Signs-1.903
ecause-1.789
ateg-1.756
avorite-1.724
 Levels-1.688
Token Fry
Token ID25305
Feature activation-1.47e+00

Pos logit contributions
TRY+4.177
SER+3.922
hop+3.857
tub+3.821
EST+3.790
Neg logit contributions
 Signs-3.892
avorite-3.867
ecause-3.862
ateg-3.744
estones-3.633
Token until
Token ID1566
Feature activation-2.24e+00

Pos logit contributions
TRY+6.306
hop+5.995
tub+5.992
SER+5.956
EST+5.687
Neg logit contributions
 Signs-8.563
ecause-7.983
avorite-7.967
ateg-7.930
estones-7.539
Token the
Token ID262
Feature activation-6.86e-01

Pos logit contributions
TRY+8.616
tub+8.277
hop+8.130
SER+7.969
 attachment+7.933
Neg logit contributions
 Signs-12.431
avorite-12.425
ecause-12.099
ateg-11.886
estones-11.282
Token mushrooms
Token ID23452
Feature activation-7.21e-01

Pos logit contributions
TRY+3.234
tub+3.021
hop+3.017
SER+3.015
EST+2.931
Neg logit contributions
 Signs-3.950
ecause-3.764
ateg-3.723
avorite-3.658
estones-3.523
Token get
Token ID651
Feature activation-1.05e+00

Pos logit contributions
TRY+3.455
tub+3.286
hop+3.244
SER+3.197
EST+3.128
Neg logit contributions
 Signs-4.101
ecause-3.859
ateg-3.791
avorite-3.781
 Levels-3.658
Token a
Token ID257
Feature activation-8.59e-01

Pos logit contributions
TRY+4.396
tub+4.199
SER+4.032
hop+3.993
ORN+3.966
Neg logit contributions
 Signs-6.419
ecause-6.202
avorite-6.090
ateg-6.041
 Levels-5.849
Token bit
Token ID1643
Feature activation-1.13e+00

Pos logit contributions
TRY+3.822
tub+3.595
SER+3.549
hop+3.506
EST+3.441
Neg logit contributions
 Signs-5.131
ecause-4.921
ateg-4.810
avorite-4.776
estones-4.661
Token other
Token ID584
Feature activation-2.66e-01

Pos logit contributions
TRY+1.089
SER+1.009
tub+1.001
hop+0.999
EST+0.972
Neg logit contributions
 Signs-1.493
ecause-1.421
ateg-1.390
avorite-1.361
 Levels-1.338
Token ingredients
Token ID9391
Feature activation-6.62e-01

Pos logit contributions
TRY+1.183
SER+1.098
tub+1.090
hop+1.082
EST+1.056
Neg logit contributions
 Signs-1.629
ecause-1.560
ateg-1.519
avorite-1.497
 Levels-1.464
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
TRY+3.120
tub+2.906
hop+2.903
SER+2.873
EST+2.816
Neg logit contributions
 Signs-3.848
ecause-3.672
avorite-3.564
ateg-3.554
estones-3.423
Token mix
Token ID5022
Feature activation-1.25e+00

Pos logit contributions
TRY+4.668
tub+4.350
SER+4.272
hop+4.154
EST+4.075
Neg logit contributions
 Signs-6.299
ecause-6.176
avorite-6.005
ateg-5.813
estones-5.729
Token well
Token ID880
Feature activation-1.63e+00

Pos logit contributions
TRY+4.931
tub+4.587
hop+4.548
SER+4.522
EST+4.248
Neg logit contributions
 Signs-7.926
ecause-7.786
avorite-7.433
ateg-7.375
 Levels-7.070
Token until
Token ID1566
Feature activation-2.16e+00

Pos logit contributions
TRY+6.627
tub+6.261
hop+5.946
EST+5.917
SER+5.861
Neg logit contributions
 Signs-9.605
ecause-9.449
avorite-9.353
ateg-8.883
aminer-8.726
Token the
Token ID262
Feature activation-7.84e-01

Pos logit contributions
tub+7.850
TRY+7.791
 attachment+7.552
SER+7.235
EST+7.151
Neg logit contributions
ecause-12.556
 Signs-12.381
avorite-12.313
ateg-11.623
estones-11.174
Token mixture
Token ID11710
Feature activation-6.79e-01

Pos logit contributions
TRY+3.413
tub+3.231
SER+3.182
hop+3.139
EST+3.080
Neg logit contributions
 Signs-4.711
ecause-4.562
ateg-4.427
avorite-4.374
estones-4.212
Token feels
Token ID5300
Feature activation-1.31e+00

Pos logit contributions
TRY+3.305
tub+3.146
hop+3.080
SER+3.063
EST+2.998
Neg logit contributions
 Signs-3.802
ecause-3.658
avorite-3.529
ateg-3.491
 Levels-3.361
Token like
Token ID588
Feature activation-9.20e-01

Pos logit contributions
TRY+5.113
tub+4.947
hop+4.735
SER+4.696
EST+4.606
Neg logit contributions
 Signs-8.236
ecause-8.023
avorite-7.812
ateg-7.643
estones-7.390
Token it
Token ID340
Feature activation+2.34e-01

Pos logit contributions
TRY+4.200
tub+3.987
SER+3.873
hop+3.870
EST+3.756
Neg logit contributions
 Signs-5.354
ecause-5.122
ateg-4.975
avorite-4.920
 Levels-4.741
Token cover
Token ID3002
Feature activation-1.18e+00

Pos logit contributions
TRY+3.252
tub+2.919
hop+2.908
SER+2.903
EST+2.809
Neg logit contributions
 Signs-4.746
ecause-4.516
avorite-4.496
ateg-4.358
estones-4.336
Token and
Token ID290
Feature activation-6.57e-01

Pos logit contributions
TRY+5.344
tub+5.025
SER+4.976
hop+4.897
EST+4.721
Neg logit contributions
 Signs-6.787
ecause-6.483
avorite-6.475
ateg-6.198
estones-6.169
Token let
Token ID1309
Feature activation-1.57e+00

Pos logit contributions
TRY+2.825
tub+2.523
hop+2.517
SER+2.514
EST+2.441
Neg logit contributions
 Signs-4.100
avorite-3.927
ecause-3.925
ateg-3.791
estones-3.759
Token it
Token ID340
Feature activation-9.86e-01

Pos logit contributions
TRY+6.512
tub+6.073
SER+6.068
hop+5.952
EST+5.887
Neg logit contributions
 Signs-9.133
ecause-8.844
avorite-8.638
estones-8.383
ateg-8.258
Token simmer
Token ID32857
Feature activation-1.51e+00

Pos logit contributions
TRY+3.338
tub+3.030
hop+2.998
SER+2.931
EST+2.848
Neg logit contributions
 Signs-7.036
ecause-6.690
avorite-6.590
ateg-6.461
estones-6.424
Token until
Token ID1566
Feature activation-1.94e+00

Pos logit contributions
TRY+5.554
hop+5.184
tub+5.135
SER+5.039
EST+5.018
Neg logit contributions
 Signs-9.644
avorite-9.215
ecause-9.132
ateg-8.926
estones-8.666
Token most
Token ID749
Feature activation-8.15e-01

Pos logit contributions
TRY+8.226
tub+7.710
SER+7.698
 attachment+7.561
hop+7.489
Neg logit contributions
 Signs-10.301
avorite-10.295
ecause-10.138
ateg-9.559
estones-9.448
Token of
Token ID286
Feature activation-4.53e-01

Pos logit contributions
TRY+4.029
tub+3.779
SER+3.761
hop+3.719
EST+3.643
Neg logit contributions
 Signs-4.518
ecause-4.244
ateg-4.159
avorite-4.125
 Levels-4.021
Token the
Token ID262
Feature activation-2.70e-01

Pos logit contributions
TRY+2.353
SER+2.203
hop+2.192
tub+2.191
EST+2.129
Neg logit contributions
 Signs-2.446
ecause-2.306
ateg-2.241
avorite-2.210
 Levels-2.153
Token water
Token ID1660
Feature activation-6.64e-01

Pos logit contributions
TRY+1.191
SER+1.098
tub+1.092
hop+1.090
EST+1.061
Neg logit contributions
 Signs-1.679
ecause-1.595
ateg-1.562
avorite-1.536
 Levels-1.510
Token has
Token ID468
Feature activation-4.52e-01

Pos logit contributions
TRY+3.468
tub+3.260
SER+3.201
hop+3.194
EST+3.160
Neg logit contributions
 Signs-3.537
ecause-3.338
ateg-3.268
avorite-3.250
estones-3.114
Token it
Token ID340
Feature activation-7.92e-01

Pos logit contributions
TRY+4.691
tub+4.370
SER+4.304
hop+4.269
EST+4.151
Neg logit contributions
 Signs-6.354
ecause-6.094
avorite-6.025
ateg-5.920
estones-5.666
Token to
Token ID284
Feature activation-9.21e-01

Pos logit contributions
TRY+3.699
tub+3.498
hop+3.438
SER+3.387
EST+3.292
Neg logit contributions
 Signs-4.669
ecause-4.401
avorite-4.330
ateg-4.291
 Levels-4.080
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
TRY+3.952
SER+3.674
tub+3.605
hop+3.535
EST+3.500
Neg logit contributions
 Signs-5.741
ecause-5.436
avorite-5.361
ateg-5.270
estones-5.148
Token boil
Token ID20667
Feature activation-1.66e+00

Pos logit contributions
TRY+2.849
SER+2.573
tub+2.561
hop+2.481
EST+2.345
Neg logit contributions
 Signs-8.099
avorite-7.811
ecause-7.804
ateg-7.628
estones-7.547
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
TRY+6.863
hop+6.443
tub+6.358
©+6.138
SER+6.136
Neg logit contributions
 Signs-9.665
avorite-9.497
ecause-9.162
ateg-8.964
aminer-8.738
Token Once
Token ID4874
Feature activation-1.87e+00

Pos logit contributions
TRY+6.063
SER+5.686
hop+5.542
EST+5.494
©+5.449
Neg logit contributions
avorite-5.840
ecause-5.531
 Signs-5.351
aminer-5.324
ateg-5.285
Token its
Token ID663
Feature activation-8.97e-01

Pos logit contributions
TRY+7.543
hop+7.000
tub+6.971
SER+6.870
EST+6.644
Neg logit contributions
 Signs-10.907
avorite-10.550
ecause-10.194
ateg-9.955
estones-9.899
Token boiling
Token ID24372
Feature activation-1.53e+00

Pos logit contributions
TRY+3.087
tub+2.806
SER+2.796
hop+2.796
EST+2.671
Neg logit contributions
 Signs-6.281
ecause-5.953
avorite-5.909
ateg-5.844
estones-5.782
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
TRY+6.798
tub+6.536
hop+6.427
EST+6.370
SER+6.170
Neg logit contributions
 Signs-8.636
ecause-8.204
avorite-8.058
ateg-7.848
estones-7.629
Token reduce
Token ID4646
Feature activation-5.65e-01

Pos logit contributions
TRY+3.158
hop+2.881
tub+2.878
SER+2.844
EST+2.768
Neg logit contributions
 Signs-4.420
ecause-4.228
avorite-4.172
ateg-4.066
estones-4.049
Token the
Token ID262
Feature activation-5.24e-01

Pos logit contributions
TRY+2.794
SER+2.605
tub+2.599
hop+2.581
EST+2.526
Neg logit contributions
 Signs-3.180
ecause-3.002
ateg-2.934
avorite-2.933
estones-2.830
Tokenimi
Token ID25236
Feature activation+1.75e-01

Pos logit contributions
TRY+0.832
SER+0.775
hop+0.771
tub+0.769
EST+0.753
Neg logit contributions
 Signs-0.920
ecause-0.865
ateg-0.844
avorite-0.829
 Levels-0.816
Token Kan
Token ID14248
Feature activation+8.35e-02

Pos logit contributions
 Signs+0.955
ecause+0.906
ateg+0.883
avorite+0.867
 Levels+0.847
Neg logit contributions
TRY-0.904
SER-0.846
hop-0.840
tub-0.834
EST-0.821
Tokenasis
Token ID17765
Feature activation-1.00e-01

Pos logit contributions
 Signs+0.485
ecause+0.459
ateg+0.449
avorite+0.439
 Levels+0.433
Neg logit contributions
TRY-0.403
SER-0.374
hop-0.370
tub-0.370
EST-0.362
Token in
Token ID287
Feature activation+4.28e-01

Pos logit contributions
TRY+0.505
SER+0.471
hop+0.470
tub+0.469
EST+0.457
Neg logit contributions
 Signs-0.561
ecause-0.529
ateg-0.516
avorite-0.508
 Levels-0.499
Token Un
Token ID791
Feature activation-1.08e-01

Pos logit contributions
 Signs+2.348
ecause+2.187
ateg+2.133
avorite+2.079
 Levels+2.071
Neg logit contributions
TRY-2.218
SER-2.064
hop-2.056
tub-2.052
EST-2.005
Tokenbreak
Token ID9032
Feature activation-1.87e+00

Pos logit contributions
TRY+0.456
SER+0.419
hop+0.415
tub+0.414
EST+0.404
Neg logit contributions
 Signs-0.690
ecause-0.655
ateg-0.642
avorite-0.630
 Levels-0.622
Tokenable
Token ID540
Feature activation-4.07e-01

Pos logit contributions
hop+4.687
tub+4.670
EST+4.668
TRY+4.458
ORN+4.423
Neg logit contributions
 Signs-13.098
ecause-13.019
avorite-12.975
ateg-12.446
 Levels-12.224
Token Kim
Token ID6502
Feature activation-3.83e-02

Pos logit contributions
TRY+1.979
tub+1.859
SER+1.848
hop+1.839
EST+1.819
Neg logit contributions
 Signs-2.300
ecause-2.202
ateg-2.140
avorite-2.125
 Levels-2.050
Tokenmy
Token ID1820
Feature activation-3.88e-01

Pos logit contributions
TRY+0.145
SER+0.131
hop+0.130
tub+0.130
EST+0.126
Neg logit contributions
 Signs-0.262
ecause-0.251
ateg-0.246
avorite-0.241
 Levels-0.238
Token Schmidt
Token ID24740
Feature activation-3.24e-01

Pos logit contributions
TRY+1.592
SER+1.508
tub+1.473
hop+1.469
EST+1.435
Neg logit contributions
 Signs-2.490
ecause-2.416
ateg-2.391
avorite-2.315
estones-2.270
Token,
Token ID11
Feature activation+1.56e-01

Pos logit contributions
TRY+1.578
SER+1.475
tub+1.465
hop+1.462
EST+1.434
Neg logit contributions
 Signs-1.816
ecause-1.740
ateg-1.706
avorite-1.678
estones-1.635
Token soft
Token ID2705
Feature activation-9.71e-02

Pos logit contributions
TRY+3.195
tub+3.009
SER+2.938
hop+2.912
EST+2.859
Neg logit contributions
 Signs-4.623
ecause-4.420
avorite-4.377
ateg-4.322
 Levels-4.176
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
TRY+0.513
SER+0.480
hop+0.477
tub+0.476
EST+0.466
Neg logit contributions
 Signs-0.518
ecause-0.487
ateg-0.476
avorite-0.466
 Levels-0.458
Token crunch
Token ID23527
Feature activation+1.64e-01

Pos logit contributions
TRY+1.679
tub+1.559
SER+1.555
hop+1.542
EST+1.510
Neg logit contributions
 Signs-2.131
ecause-2.022
ateg-1.986
avorite-1.971
 Levels-1.916
Tokeny
Token ID88
Feature activation-1.32e-02

Pos logit contributions
 Signs+0.979
ecause+0.927
ateg+0.904
avorite+0.880
 Levels+0.876
Neg logit contributions
TRY-0.767
SER-0.710
hop-0.702
tub-0.698
Unix-0.685
Token.
Token ID13
Feature activation-8.38e-01

Pos logit contributions
TRY+0.071
SER+0.066
hop+0.066
tub+0.065
EST+0.064
Neg logit contributions
 Signs-0.070
ecause-0.066
ateg-0.064
avorite-0.063
 Levels-0.062
Token Once
Token ID4874
Feature activation-1.68e+00

Pos logit contributions
TRY+4.494
SER+4.229
hop+4.147
tub+4.103
EST+4.075
Neg logit contributions
 Signs-4.099
avorite-4.093
ecause-4.076
ateg-3.939
estones-3.841
Token they
Token ID484
Feature activation-1.22e+00

Pos logit contributions
TRY+7.747
tub+7.275
SER+7.195
hop+7.167
 attachment+6.948
Neg logit contributions
 Signs-9.000
avorite-8.682
ecause-8.453
ateg-8.364
estones-8.143
Token are
Token ID389
Feature activation-1.11e+00

Pos logit contributions
TRY+5.285
tub+5.117
hop+4.891
SER+4.798
EST+4.771
Neg logit contributions
 Signs-7.198
ecause-6.782
avorite-6.766
ateg-6.624
 Levels-6.558
Token ready
Token ID3492
Feature activation-8.48e-01

Pos logit contributions
TRY+4.495
tub+4.194
SER+4.057
hop+4.057
EST+3.954
Neg logit contributions
 Signs-7.022
avorite-6.717
ecause-6.668
ateg-6.536
 Levels-6.427
Token throw
Token ID3714
Feature activation-7.62e-01

Pos logit contributions
TRY+4.321
tub+4.000
SER+3.986
hop+3.956
EST+3.885
Neg logit contributions
 Signs-4.618
ecause-4.377
avorite-4.271
ateg-4.231
estones-4.126
Token them
Token ID606
Feature activation-9.24e-01

Pos logit contributions
TRY+3.697
SER+3.445
tub+3.429
hop+3.406
EST+3.326
Neg logit contributions
 Signs-4.317
ecause-4.090
ateg-4.003
avorite-3.961
 Levels-3.861
Token and
Token ID290
Feature activation-1.10e+00

Pos logit contributions
TRY+6.050
hop+5.577
tub+5.568
SER+5.386
 attachment+5.305
Neg logit contributions
 Signs-8.244
avorite-8.009
ecause-7.857
ateg-7.669
estones-7.605
Token bring
Token ID2222
Feature activation-1.06e+00

Pos logit contributions
TRY+4.760
SER+4.215
tub+4.213
hop+4.127
EST+4.031
Neg logit contributions
 Signs-6.636
avorite-6.533
ecause-6.382
estones-6.182
ateg-6.154
Token it
Token ID340
Feature activation-7.92e-01

Pos logit contributions
TRY+4.691
tub+4.370
SER+4.304
hop+4.269
EST+4.151
Neg logit contributions
 Signs-6.354
ecause-6.094
avorite-6.025
ateg-5.920
estones-5.666
Token to
Token ID284
Feature activation-9.21e-01

Pos logit contributions
TRY+3.699
tub+3.498
hop+3.438
SER+3.387
EST+3.292
Neg logit contributions
 Signs-4.669
ecause-4.401
avorite-4.330
ateg-4.291
 Levels-4.080
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
TRY+3.952
SER+3.674
tub+3.605
hop+3.535
EST+3.500
Neg logit contributions
 Signs-5.741
ecause-5.436
avorite-5.361
ateg-5.270
estones-5.148
Token boil
Token ID20667
Feature activation-1.66e+00

Pos logit contributions
TRY+2.849
SER+2.573
tub+2.561
hop+2.481
EST+2.345
Neg logit contributions
 Signs-8.099
avorite-7.811
ecause-7.804
ateg-7.628
estones-7.547
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
TRY+6.863
hop+6.443
tub+6.358
©+6.138
SER+6.136
Neg logit contributions
 Signs-9.665
avorite-9.497
ecause-9.162
ateg-8.964
aminer-8.738
Token Once
Token ID4874
Feature activation-1.87e+00

Pos logit contributions
TRY+6.063
SER+5.686
hop+5.542
EST+5.494
©+5.449
Neg logit contributions
avorite-5.840
ecause-5.531
 Signs-5.351
aminer-5.324
ateg-5.285
Token its
Token ID663
Feature activation-8.97e-01

Pos logit contributions
TRY+7.543
hop+7.000
tub+6.971
SER+6.870
EST+6.644
Neg logit contributions
 Signs-10.907
avorite-10.550
ecause-10.194
ateg-9.955
estones-9.899
Token boiling
Token ID24372
Feature activation-1.53e+00

Pos logit contributions
TRY+3.087
tub+2.806
SER+2.796
hop+2.796
EST+2.671
Neg logit contributions
 Signs-6.281
ecause-5.953
avorite-5.909
ateg-5.844
estones-5.782
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
TRY+6.798
tub+6.536
hop+6.427
EST+6.370
SER+6.170
Neg logit contributions
 Signs-8.636
ecause-8.204
avorite-8.058
ateg-7.848
estones-7.629
Token the
Token ID262
Feature activation-2.43e-01

Pos logit contributions
TRY+0.745
SER+0.700
hop+0.693
tub+0.692
idious+0.676
Neg logit contributions
 Signs-0.766
ecause-0.722
ateg-0.704
avorite-0.689
 Levels-0.676
Token other
Token ID584
Feature activation-2.66e-01

Pos logit contributions
TRY+1.089
SER+1.009
tub+1.001
hop+0.999
EST+0.972
Neg logit contributions
 Signs-1.493
ecause-1.421
ateg-1.390
avorite-1.361
 Levels-1.338
Token ingredients
Token ID9391
Feature activation-6.62e-01

Pos logit contributions
TRY+1.183
SER+1.098
tub+1.090
hop+1.082
EST+1.056
Neg logit contributions
 Signs-1.629
ecause-1.560
ateg-1.519
avorite-1.497
 Levels-1.464
Token and
Token ID290
Feature activation-1.06e+00

Pos logit contributions
TRY+3.120
tub+2.906
hop+2.903
SER+2.873
EST+2.816
Neg logit contributions
 Signs-3.848
ecause-3.672
avorite-3.564
ateg-3.554
estones-3.423
Token mix
Token ID5022
Feature activation-1.25e+00

Pos logit contributions
TRY+4.668
tub+4.350
SER+4.272
hop+4.154
EST+4.075
Neg logit contributions
 Signs-6.299
ecause-6.176
avorite-6.005
ateg-5.813
estones-5.729
Token well
Token ID880
Feature activation-1.63e+00

Pos logit contributions
TRY+4.931
tub+4.587
hop+4.548
SER+4.522
EST+4.248
Neg logit contributions
 Signs-7.926
ecause-7.786
avorite-7.433
ateg-7.375
 Levels-7.070
Token until
Token ID1566
Feature activation-2.16e+00

Pos logit contributions
TRY+6.627
tub+6.261
hop+5.946
EST+5.917
SER+5.861
Neg logit contributions
 Signs-9.605
ecause-9.449
avorite-9.353
ateg-8.883
aminer-8.726
Token the
Token ID262
Feature activation-7.84e-01

Pos logit contributions
tub+7.850
TRY+7.791
 attachment+7.552
SER+7.235
EST+7.151
Neg logit contributions
ecause-12.556
 Signs-12.381
avorite-12.313
ateg-11.623
estones-11.174
Token mixture
Token ID11710
Feature activation-6.79e-01

Pos logit contributions
TRY+3.413
tub+3.231
SER+3.182
hop+3.139
EST+3.080
Neg logit contributions
 Signs-4.711
ecause-4.562
ateg-4.427
avorite-4.374
estones-4.212
Token feels
Token ID5300
Feature activation-1.31e+00

Pos logit contributions
TRY+3.305
tub+3.146
hop+3.080
SER+3.063
EST+2.998
Neg logit contributions
 Signs-3.802
ecause-3.658
avorite-3.529
ateg-3.491
 Levels-3.361
Token like
Token ID588
Feature activation-9.20e-01

Pos logit contributions
TRY+5.113
tub+4.947
hop+4.735
SER+4.696
EST+4.606
Neg logit contributions
 Signs-8.236
ecause-8.023
avorite-7.812
ateg-7.643
estones-7.390
Token bit
Token ID1643
Feature activation-1.13e+00

Pos logit contributions
TRY+3.822
tub+3.595
SER+3.549
hop+3.506
EST+3.441
Neg logit contributions
 Signs-5.131
ecause-4.921
ateg-4.810
avorite-4.776
estones-4.661
Token soft
Token ID2705
Feature activation-1.20e-01

Pos logit contributions
TRY+4.667
tub+4.537
hop+4.339
SER+4.288
EST+4.228
Neg logit contributions
 Signs-6.952
ecause-6.667
avorite-6.636
ateg-6.541
 Levels-6.336
Token and
Token ID290
Feature activation-4.10e-01

Pos logit contributions
TRY+0.639
SER+0.597
hop+0.593
tub+0.593
EST+0.580
Neg logit contributions
 Signs-0.640
ecause-0.600
ateg-0.587
avorite-0.575
 Levels-0.565
Token nice
Token ID3621
Feature activation-4.31e-01

Pos logit contributions
TRY+1.959
tub+1.821
SER+1.818
hop+1.805
EST+1.761
Neg logit contributions
 Signs-2.377
ecause-2.253
ateg-2.210
avorite-2.189
 Levels-2.130
Token.
Token ID13
Feature activation-9.04e-01

Pos logit contributions
TRY+2.138
tub+2.005
SER+1.983
hop+1.972
EST+1.949
Neg logit contributions
 Signs-2.415
ecause-2.272
ateg-2.222
avorite-2.213
 Levels-2.152
Token Once
Token ID4874
Feature activation-1.62e+00

Pos logit contributions
TRY+4.778
SER+4.488
hop+4.406
tub+4.371
EST+4.328
Neg logit contributions
avorite-4.441
 Signs-4.430
ecause-4.407
ateg-4.273
estones-4.168
Token they
Token ID484
Feature activation-1.10e+00

Pos logit contributions
TRY+6.912
tub+6.460
hop+6.402
SER+6.372
EST+6.138
Neg logit contributions
 Signs-9.370
ecause-8.937
avorite-8.920
ateg-8.873
estones-8.568
Token are
Token ID389
Feature activation-1.20e+00

Pos logit contributions
TRY+4.851
tub+4.657
hop+4.522
SER+4.430
EST+4.386
Neg logit contributions
 Signs-6.465
ecause-6.075
avorite-5.992
ateg-5.963
 Levels-5.845
Token ready
Token ID3492
Feature activation-7.69e-01

Pos logit contributions
TRY+5.028
tub+4.756
SER+4.613
hop+4.586
EST+4.471
Neg logit contributions
 Signs-7.288
ecause-6.924
avorite-6.893
ateg-6.812
estones-6.655
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
TRY+3.716
tub+3.441
SER+3.407
hop+3.389
EST+3.287
Neg logit contributions
 Signs-4.388
ecause-4.195
avorite-4.087
ateg-4.052
estones-3.964
Token throw
Token ID3714
Feature activation-5.43e-01

Pos logit contributions
TRY+3.304
tub+3.044
hop+3.029
SER+3.009
EST+2.917
Neg logit contributions
 Signs-4.263
ecause-4.092
avorite-4.020
ateg-3.968
estones-3.885
Tokenries
Token ID1678
Feature activation-1.94e-01

Pos logit contributions
 Signs+1.438
ecause+1.359
ateg+1.328
avorite+1.326
 Levels+1.295
Neg logit contributions
TRY-0.975
hop-0.898
tub-0.895
SER-0.891
idious-0.861
Token,
Token ID11
Feature activation-1.64e-01

Pos logit contributions
TRY+1.002
SER+0.934
hop+0.931
tub+0.929
EST+0.907
Neg logit contributions
 Signs-1.062
ecause-1.000
ateg-0.977
avorite-0.958
 Levels-0.941
Token and
Token ID290
Feature activation-2.30e-01

Pos logit contributions
TRY+0.842
SER+0.781
hop+0.780
tub+0.779
EST+0.760
Neg logit contributions
 Signs-0.905
ecause-0.857
ateg-0.835
avorite-0.823
 Levels-0.799
Token keep
Token ID1394
Feature activation-4.79e-01

Pos logit contributions
TRY+1.093
SER+1.007
tub+1.006
hop+1.000
EST+0.976
Neg logit contributions
 Signs-1.351
ecause-1.287
ateg-1.252
avorite-1.239
 Levels-1.202
Token going
Token ID1016
Feature activation-1.69e-01

Pos logit contributions
TRY+2.524
tub+2.370
SER+2.356
hop+2.332
idious+2.283
Neg logit contributions
 Signs-2.550
ecause-2.426
ateg-2.348
avorite-2.325
estones-2.235
Token until
Token ID1566
Feature activation-1.60e+00

Pos logit contributions
TRY+0.736
hop+0.677
tub+0.676
SER+0.673
idious+0.652
Neg logit contributions
 Signs-1.067
ecause-1.011
ateg-0.993
avorite-0.984
 Levels-0.955
Token the
Token ID262
Feature activation-4.34e-01

Pos logit contributions
TRY+7.007
tub+6.548
EST+6.314
SER+6.305
hop+6.297
Neg logit contributions
 Signs-9.009
ecause-8.831
avorite-8.652
ateg-8.297
aminer-8.010
Token mix
Token ID5022
Feature activation-6.11e-01

Pos logit contributions
TRY+1.997
tub+1.842
SER+1.836
hop+1.828
EST+1.795
Neg logit contributions
 Signs-2.604
ecause-2.479
ateg-2.415
avorite-2.388
estones-2.317
Token is
Token ID318
Feature activation-7.20e-01

Pos logit contributions
TRY+3.155
tub+2.981
hop+2.920
SER+2.905
EST+2.853
Neg logit contributions
 Signs-3.272
ecause-3.141
avorite-3.028
ateg-3.000
 Levels-2.870
Token this
Token ID428
Feature activation-6.35e-01

Pos logit contributions
TRY+3.140
tub+2.992
hop+2.867
SER+2.865
EST+2.788
Neg logit contributions
 Signs-4.399
ecause-4.232
avorite-4.104
ateg-4.078
estones-3.896
Token smooth
Token ID7209
Feature activation-5.36e-01

Pos logit contributions
TRY+2.568
tub+2.338
SER+2.327
hop+2.302
EST+2.268
Neg logit contributions
 Signs-4.148
ecause-3.966
avorite-3.832
ateg-3.818
estones-3.719
Token broccoli
Token ID44653
Feature activation-4.02e-01

Pos logit contributions
TRY+1.996
tub+1.867
hop+1.820
SER+1.785
EST+1.683
Neg logit contributions
 Signs-4.548
ecause-4.357
avorite-4.290
ateg-4.241
 Levels-4.138
Token for
Token ID329
Feature activation-5.23e-01

Pos logit contributions
TRY+1.613
hop+1.482
tub+1.472
SER+1.460
EST+1.412
Neg logit contributions
 Signs-2.668
ecause-2.543
avorite-2.503
ateg-2.464
 Levels-2.398
Token a
Token ID257
Feature activation-4.45e-01

Pos logit contributions
TRY+2.150
tub+1.978
hop+1.973
SER+1.951
EST+1.898
Neg logit contributions
 Signs-3.373
ecause-3.237
avorite-3.203
ateg-3.119
estones-3.038
Token few
Token ID1178
Feature activation-3.57e-02

Pos logit contributions
TRY+2.098
tub+1.927
SER+1.926
hop+1.924
EST+1.862
Neg logit contributions
 Signs-2.637
ecause-2.509
avorite-2.498
ateg-2.461
 Levels-2.360
Token minutes
Token ID2431
Feature activation-1.13e+00

Pos logit contributions
TRY+0.188
SER+0.177
hop+0.176
tub+0.174
EST+0.172
Neg logit contributions
 Signs-0.190
ecause-0.179
avorite-0.177
ateg-0.177
estones-0.166
Token until
Token ID1566
Feature activation-1.59e+00

Pos logit contributions
TRY+2.913
tub+2.655
SER+2.597
hop+2.523
EST+2.391
Neg logit contributions
 Signs-8.815
ateg-8.278
avorite-8.245
ecause-8.206
gger-8.059
Token fork
Token ID15563
Feature activation-3.00e-01

Pos logit contributions
TRY+3.340
tub+3.263
hop+3.186
Unix+2.919
 attachment+2.841
Neg logit contributions
 Signs-12.355
ecause-12.120
avorite-11.922
 Levels-11.759
estones-11.711
Token-
Token ID12
Feature activation-4.07e-02

Pos logit contributions
TRY+1.154
hop+1.054
tub+1.049
SER+1.047
idious+1.006
Neg logit contributions
 Signs-2.037
ecause-1.959
avorite-1.912
ateg-1.901
 Levels-1.853
Tokent
Token ID83
Feature activation+2.37e-01

Pos logit contributions
TRY+0.165
SER+0.151
hop+0.150
tub+0.150
EST+0.145
Neg logit contributions
 Signs-0.268
ecause-0.255
ateg-0.249
avorite-0.246
 Levels-0.243
Tokenender
Token ID2194
Feature activation-4.94e-01

Pos logit contributions
 Signs+1.094
ecause+1.018
ateg+0.989
avorite+0.964
 Levels+0.946
Neg logit contributions
TRY-1.423
SER-1.342
hop-1.334
tub-1.331
EST-1.307
Token.
Token ID13
Feature activation-6.81e-01

Pos logit contributions
TRY+2.321
hop+2.185
tub+2.162
SER+2.159
EST+2.097
Neg logit contributions
 Signs-2.889
avorite-2.816
ecause-2.809
ateg-2.692
estones-2.581
Token a
Token ID257
Feature activation-3.86e-01

Pos logit contributions
TRY+3.248
hop+3.033
SER+3.024
tub+3.022
EST+2.940
Neg logit contributions
 Signs-3.748
ecause-3.539
ateg-3.448
avorite-3.382
 Levels-3.344
Token tablespoon
Token ID33107
Feature activation-8.03e-01

Pos logit contributions
TRY+1.669
tub+1.538
SER+1.531
hop+1.523
EST+1.482
Neg logit contributions
 Signs-2.422
ecause-2.311
ateg-2.254
avorite-2.226
 Levels-2.178
Token coconut
Token ID20132
Feature activation-2.60e-01

Pos logit contributions
TRY+3.888
tub+3.644
hop+3.628
SER+3.604
EST+3.472
Neg logit contributions
 Signs-4.561
ecause-4.288
ateg-4.183
avorite-4.146
estones-4.091
Token oil
Token ID3056
Feature activation-5.89e-01

Pos logit contributions
TRY+1.169
SER+1.082
hop+1.070
tub+1.067
EST+1.041
Neg logit contributions
 Signs-1.594
ecause-1.511
ateg-1.480
avorite-1.447
 Levels-1.433
Token and
Token ID290
Feature activation-9.40e-01

Pos logit contributions
TRY+2.897
tub+2.702
hop+2.699
SER+2.690
EST+2.616
Neg logit contributions
 Signs-3.314
ecause-3.115
ateg-3.063
avorite-3.052
 Levels-2.943
Token fry
Token ID32959
Feature activation-1.58e+00

Pos logit contributions
TRY+4.181
SER+3.803
tub+3.789
hop+3.758
EST+3.633
Neg logit contributions
 Signs-5.669
ecause-5.421
avorite-5.358
ateg-5.253
estones-5.197
Token until
Token ID1566
Feature activation-2.35e+00

Pos logit contributions
TRY+7.276
hop+6.866
tub+6.838
SER+6.756
EST+6.480
Neg logit contributions
 Signs-8.742
ecause-8.184
avorite-8.144
ateg-8.108
estones-7.685
Token carrots
Token ID34397
Feature activation-8.57e-01

Pos logit contributions
TRY+9.172
tub+8.688
 attachment+8.491
SER+8.466
hop+8.428
Neg logit contributions
avorite-12.776
 Signs-12.761
ecause-12.326
ateg-11.877
estones-11.569
Token are
Token ID389
Feature activation-7.49e-01

Pos logit contributions
TRY+4.027
tub+3.836
hop+3.772
SER+3.701
EST+3.634
Neg logit contributions
 Signs-4.876
ecause-4.637
avorite-4.599
ateg-4.539
 Levels-4.384
Token soft
Token ID2705
Feature activation-9.71e-02

Pos logit contributions
TRY+3.195
tub+3.009
SER+2.938
hop+2.912
EST+2.859
Neg logit contributions
 Signs-4.623
ecause-4.420
avorite-4.377
ateg-4.322
 Levels-4.176
Token and
Token ID290
Feature activation-3.60e-01

Pos logit contributions
TRY+0.513
SER+0.480
hop+0.477
tub+0.476
EST+0.466
Neg logit contributions
 Signs-0.518
ecause-0.487
ateg-0.476
avorite-0.466
 Levels-0.458
Token the
Token ID262
Feature activation-5.24e-01

Pos logit contributions
TRY+2.794
SER+2.605
tub+2.599
hop+2.581
EST+2.526
Neg logit contributions
 Signs-3.180
ecause-3.002
ateg-2.934
avorite-2.933
estones-2.830
Token heat
Token ID4894
Feature activation-7.02e-01

Pos logit contributions
TRY+2.436
tub+2.265
SER+2.261
hop+2.238
EST+2.181
Neg logit contributions
 Signs-3.114
ecause-2.938
ateg-2.890
avorite-2.873
estones-2.805
Token and
Token ID290
Feature activation-7.58e-01

Pos logit contributions
TRY+3.172
tub+2.963
SER+2.925
hop+2.908
EST+2.877
Neg logit contributions
 Signs-4.166
ecause-4.007
avorite-3.912
ateg-3.906
estones-3.746
Token cover
Token ID3002
Feature activation-1.18e+00

Pos logit contributions
TRY+3.252
tub+2.919
hop+2.908
SER+2.903
EST+2.809
Neg logit contributions
 Signs-4.746
ecause-4.516
avorite-4.496
ateg-4.358
estones-4.336
Token and
Token ID290
Feature activation-6.57e-01

Pos logit contributions
TRY+5.344
tub+5.025
SER+4.976
hop+4.897
EST+4.721
Neg logit contributions
 Signs-6.787
ecause-6.483
avorite-6.475
ateg-6.198
estones-6.169
Token let
Token ID1309
Feature activation-1.57e+00

Pos logit contributions
TRY+2.825
tub+2.523
hop+2.517
SER+2.514
EST+2.441
Neg logit contributions
 Signs-4.100
avorite-3.927
ecause-3.925
ateg-3.791
estones-3.759
Token it
Token ID340
Feature activation-9.86e-01

Pos logit contributions
TRY+6.512
tub+6.073
SER+6.068
hop+5.952
EST+5.887
Neg logit contributions
 Signs-9.133
ecause-8.844
avorite-8.638
estones-8.383
ateg-8.258
Token simmer
Token ID32857
Feature activation-1.51e+00

Pos logit contributions
TRY+3.338
tub+3.030
hop+2.998
SER+2.931
EST+2.848
Neg logit contributions
 Signs-7.036
ecause-6.690
avorite-6.590
ateg-6.461
estones-6.424
Token until
Token ID1566
Feature activation-1.94e+00

Pos logit contributions
TRY+5.554
hop+5.184
tub+5.135
SER+5.039
EST+5.018
Neg logit contributions
 Signs-9.644
avorite-9.215
ecause-9.132
ateg-8.926
estones-8.666
Token most
Token ID749
Feature activation-8.15e-01

Pos logit contributions
TRY+8.226
tub+7.710
SER+7.698
 attachment+7.561
hop+7.489
Neg logit contributions
 Signs-10.301
avorite-10.295
ecause-10.138
ateg-9.559
estones-9.448
Token of
Token ID286
Feature activation-4.53e-01

Pos logit contributions
TRY+4.029
tub+3.779
SER+3.761
hop+3.719
EST+3.643
Neg logit contributions
 Signs-4.518
ecause-4.244
ateg-4.159
avorite-4.125
 Levels-4.021
Token other
Token ID584
Feature activation+2.24e-01

Pos logit contributions
 Signs+1.176
ecause+1.092
ateg+1.063
avorite+1.031
 Levels+1.017
Neg logit contributions
TRY-1.534
SER-1.446
hop-1.437
tub-1.434
EST-1.408
Token ingredients
Token ID9391
Feature activation-5.69e-01

Pos logit contributions
 Signs+1.390
ecause+1.324
ateg+1.295
avorite+1.273
 Levels+1.249
Neg logit contributions
TRY-0.990
SER-0.915
tub-0.904
hop-0.904
EST-0.882
Token and
Token ID290
Feature activation-7.54e-01

Pos logit contributions
TRY+2.548
hop+2.418
tub+2.381
SER+2.364
EST+2.324
Neg logit contributions
 Signs-3.412
avorite-3.354
ecause-3.338
ateg-3.175
VIDIA-3.054
Token mix
Token ID5022
Feature activation-1.10e+00

Pos logit contributions
TRY+2.990
tub+2.822
hop+2.800
SER+2.712
EST+2.670
Neg logit contributions
 Signs-4.955
ecause-4.714
avorite-4.673
ateg-4.503
estones-4.449
Token well
Token ID880
Feature activation-1.51e+00

Pos logit contributions
TRY+3.606
hop+3.542
EST+3.421
tub+3.365
SER+3.335
Neg logit contributions
 Signs-7.710
avorite-7.552
ecause-7.468
ateg-7.129
 Levels-6.930
Token until
Token ID1566
Feature activation-1.57e+00

Pos logit contributions
TRY+3.338
tub+3.319
SER+3.144
 attachment+3.004
EST+2.873
Neg logit contributions
 Signs-11.760
avorite-11.277
ecause-11.178
ateg-11.069
aminer-10.763
Token the
Token ID262
Feature activation-6.40e-01

Pos logit contributions
TRY+5.589
tub+5.511
hop+5.320
 attachment+5.225
EST+5.220
Neg logit contributions
avorite-9.957
ecause-9.903
 Signs-9.861
VIDIA-9.294
aminer-9.052
Token mixture
Token ID11710
Feature activation-3.62e-01

Pos logit contributions
TRY+2.661
tub+2.487
hop+2.437
SER+2.422
EST+2.387
Neg logit contributions
 Signs-4.123
ecause-3.885
avorite-3.799
ateg-3.760
estones-3.688
Token feels
Token ID5300
Feature activation-1.37e+00

Pos logit contributions
TRY+1.978
hop+1.876
tub+1.867
SER+1.844
idious+1.820
Neg logit contributions
 Signs-1.848
ecause-1.742
ateg-1.688
avorite-1.680
 Levels-1.604
Token like
Token ID588
Feature activation-1.08e+00

Pos logit contributions
TRY+5.271
hop+5.043
tub+5.023
SER+4.782
 attachment+4.742
Neg logit contributions
 Signs-8.889
avorite-8.523
ecause-8.499
ateg-8.108
estones-7.968
Token it
Token ID340
Feature activation-4.72e-02

Pos logit contributions
TRY+4.818
hop+4.530
tub+4.437
SER+4.355
 attachment+4.338
Neg logit contributions
 Signs-6.392
avorite-6.324
ecause-6.257
ateg-5.893
estones-5.758
Tokenai
Token ID1872
Feature activation+2.18e-01

Pos logit contributions
TRY+1.107
hop+1.044
SER+1.041
tub+1.040
EST+1.009
Neg logit contributions
 Signs-1.162
ecause-1.087
ateg-1.061
avorite-1.043
 Levels-1.030
Token curry
Token ID34611
Feature activation-1.69e-01

Pos logit contributions
 Signs+1.380
ecause+1.307
ateg+1.284
avorite+1.260
 Levels+1.243
Neg logit contributions
TRY-0.934
SER-0.860
hop-0.842
tub-0.840
idious-0.828
Token paste
Token ID17008
Feature activation-7.62e-01

Pos logit contributions
TRY+0.726
SER+0.668
hop+0.663
tub+0.662
EST+0.644
Neg logit contributions
 Signs-1.070
ecause-1.017
ateg-0.995
avorite-0.977
 Levels-0.964
Token and
Token ID290
Feature activation-6.38e-01

Pos logit contributions
TRY+3.681
hop+3.461
tub+3.443
SER+3.401
EST+3.349
Neg logit contributions
 Signs-4.333
ecause-4.099
ateg-3.970
avorite-3.950
 Levels-3.838
Token mix
Token ID5022
Feature activation-1.10e+00

Pos logit contributions
TRY+2.801
SER+2.554
tub+2.540
hop+2.533
EST+2.451
Neg logit contributions
 Signs-3.950
ecause-3.780
avorite-3.675
ateg-3.655
estones-3.578
Token well
Token ID880
Feature activation-1.56e+00

Pos logit contributions
TRY+4.545
hop+4.268
tub+4.265
SER+4.207
EST+4.006
Neg logit contributions
 Signs-6.807
ecause-6.604
avorite-6.404
ateg-6.340
 Levels-6.055
Token.
Token ID13
Feature activation-1.20e+00

Pos logit contributions
TRY+6.510
tub+6.119
hop+5.922
SER+5.877
EST+5.837
Neg logit contributions
 Signs-9.043
avorite-8.773
ecause-8.746
ateg-8.393
aminer-8.166
Token Season
Token ID7369
Feature activation-2.68e-01

Pos logit contributions
TRY+6.055
SER+5.686
hop+5.526
tub+5.478
EST+5.444
Neg logit contributions
avorite-6.146
ecause-5.970
 Signs-5.857
estones-5.699
ateg-5.686
Token with
Token ID351
Feature activation-4.32e-01

Pos logit contributions
TRY+1.318
SER+1.228
hop+1.222
tub+1.218
EST+1.185
Neg logit contributions
 Signs-1.528
ecause-1.447
ateg-1.408
avorite-1.385
 Levels-1.360
Token salt
Token ID8268
Feature activation-5.31e-01

Pos logit contributions
TRY+2.070
SER+1.931
hop+1.919
tub+1.915
EST+1.870
Neg logit contributions
 Signs-2.506
ecause-2.385
ateg-2.319
avorite-2.265
 Levels-2.230
Token,
Token ID11
Feature activation-7.71e-01

Pos logit contributions
TRY+2.625
hop+2.470
tub+2.454
SER+2.447
EST+2.391
Neg logit contributions
 Signs-2.964
ecause-2.825
ateg-2.717
avorite-2.696
 Levels-2.619
Token broccoli
Token ID44653
Feature activation-1.22e-01

Pos logit contributions
TRY+1.608
tub+1.510
SER+1.510
hop+1.500
EST+1.455
Neg logit contributions
 Signs-2.036
ecause-1.931
ateg-1.883
avorite-1.852
estones-1.808
Token for
Token ID329
Feature activation-4.33e-01

Pos logit contributions
TRY+0.619
SER+0.577
hop+0.573
tub+0.572
EST+0.559
Neg logit contributions
 Signs-0.678
ecause-0.639
ateg-0.624
avorite-0.611
 Levels-0.602
Token a
Token ID257
Feature activation-3.14e-01

Pos logit contributions
TRY+2.054
SER+1.902
tub+1.879
hop+1.878
EST+1.846
Neg logit contributions
 Signs-2.521
ecause-2.411
ateg-2.355
avorite-2.345
estones-2.259
Token few
Token ID1178
Feature activation-3.21e-01

Pos logit contributions
TRY+1.463
SER+1.363
hop+1.355
tub+1.348
EST+1.313
Neg logit contributions
 Signs-1.857
ecause-1.761
ateg-1.723
avorite-1.712
estones-1.676
Token minutes
Token ID2431
Feature activation-8.27e-01

Pos logit contributions
TRY+1.744
SER+1.635
hop+1.622
tub+1.618
EST+1.589
Neg logit contributions
 Signs-1.641
ecause-1.551
ateg-1.513
avorite-1.503
estones-1.436
Token until
Token ID1566
Feature activation-1.54e+00

Pos logit contributions
TRY+3.945
tub+3.678
hop+3.658
SER+3.632
EST+3.555
Neg logit contributions
 Signs-4.680
avorite-4.410
ecause-4.407
ateg-4.373
estones-4.117
Token fork
Token ID15563
Feature activation-2.86e-01

Pos logit contributions
TRY+6.550
tub+6.189
SER+6.122
hop+5.960
EST+5.940
Neg logit contributions
 Signs-8.788
avorite-8.583
ecause-8.447
ateg-8.221
estones-7.964
Token-
Token ID12
Feature activation-2.94e-01

Pos logit contributions
TRY+1.197
SER+1.097
tub+1.092
hop+1.090
EST+1.057
Neg logit contributions
 Signs-1.842
ecause-1.753
ateg-1.714
avorite-1.690
 Levels-1.664
Tokent
Token ID83
Feature activation+2.09e-01

Pos logit contributions
TRY+1.236
SER+1.135
hop+1.133
tub+1.131
EST+1.094
Neg logit contributions
 Signs-1.891
ecause-1.795
ateg-1.759
avorite-1.733
 Levels-1.707
Tokenender
Token ID2194
Feature activation-6.13e-02

Pos logit contributions
 Signs+0.953
ecause+0.892
ateg+0.874
avorite+0.835
 Levels+0.820
Neg logit contributions
TRY-1.267
SER-1.196
hop-1.181
tub-1.180
EST-1.158
Token.
Token ID13
Feature activation-9.08e-01

Pos logit contributions
TRY+0.324
SER+0.302
hop+0.301
tub+0.300
EST+0.294
Neg logit contributions
 Signs-0.328
ecause-0.309
ateg-0.301
avorite-0.295
 Levels-0.290
Token a
Token ID257
Feature activation-1.05e+00

Pos logit contributions
TRY+3.952
SER+3.674
tub+3.605
hop+3.535
EST+3.500
Neg logit contributions
 Signs-5.741
ecause-5.436
avorite-5.361
ateg-5.270
estones-5.148
Token boil
Token ID20667
Feature activation-1.66e+00

Pos logit contributions
TRY+2.849
SER+2.573
tub+2.561
hop+2.481
EST+2.345
Neg logit contributions
 Signs-8.099
avorite-7.811
ecause-7.804
ateg-7.628
estones-7.547
Token.
Token ID13
Feature activation-1.15e+00

Pos logit contributions
TRY+6.863
hop+6.443
tub+6.358
©+6.138
SER+6.136
Neg logit contributions
 Signs-9.665
avorite-9.497
ecause-9.162
ateg-8.964
aminer-8.738
Token Once
Token ID4874
Feature activation-1.87e+00

Pos logit contributions
TRY+6.063
SER+5.686
hop+5.542
EST+5.494
©+5.449
Neg logit contributions
avorite-5.840
ecause-5.531
 Signs-5.351
aminer-5.324
ateg-5.285
Token its
Token ID663
Feature activation-8.97e-01

Pos logit contributions
TRY+7.543
hop+7.000
tub+6.971
SER+6.870
EST+6.644
Neg logit contributions
 Signs-10.907
avorite-10.550
ecause-10.194
ateg-9.955
estones-9.899
Token boiling
Token ID24372
Feature activation-1.53e+00

Pos logit contributions
TRY+3.087
tub+2.806
SER+2.796
hop+2.796
EST+2.671
Neg logit contributions
 Signs-6.281
ecause-5.953
avorite-5.909
ateg-5.844
estones-5.782
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
TRY+6.798
tub+6.536
hop+6.427
EST+6.370
SER+6.170
Neg logit contributions
 Signs-8.636
ecause-8.204
avorite-8.058
ateg-7.848
estones-7.629
Token reduce
Token ID4646
Feature activation-5.65e-01

Pos logit contributions
TRY+3.158
hop+2.881
tub+2.878
SER+2.844
EST+2.768
Neg logit contributions
 Signs-4.420
ecause-4.228
avorite-4.172
ateg-4.066
estones-4.049
Token the
Token ID262
Feature activation-5.24e-01

Pos logit contributions
TRY+2.794
SER+2.605
tub+2.599
hop+2.581
EST+2.526
Neg logit contributions
 Signs-3.180
ecause-3.002
ateg-2.934
avorite-2.933
estones-2.830
Token heat
Token ID4894
Feature activation-7.02e-01

Pos logit contributions
TRY+2.436
tub+2.265
SER+2.261
hop+2.238
EST+2.181
Neg logit contributions
 Signs-3.114
ecause-2.938
ateg-2.890
avorite-2.873
estones-2.805
Token and
Token ID290
Feature activation-7.58e-01

Pos logit contributions
TRY+3.172
tub+2.963
SER+2.925
hop+2.908
EST+2.877
Neg logit contributions
 Signs-4.166
ecause-4.007
avorite-3.912
ateg-3.906
estones-3.746
Token tax
Token ID1687
Feature activation-7.57e-01

Pos logit contributions
TRY+0.456
SER+0.425
hop+0.424
tub+0.421
EST+0.412
Neg logit contributions
 Signs-0.534
ecause-0.508
ateg-0.494
avorite-0.485
 Levels-0.476
Tokena
Token ID64
Feature activation-3.43e-01

Pos logit contributions
TRY+2.888
hop+2.757
tub+2.696
arious+2.678
SER+2.674
Neg logit contributions
 Signs-4.886
ecause-4.719
avorite-4.550
ateg-4.523
 Levels-4.428
Token.
Token ID13
Feature activation-3.12e-01

Pos logit contributions
TRY+1.734
hop+1.631
SER+1.623
tub+1.612
idious+1.575
Neg logit contributions
 Signs-1.890
ecause-1.804
ateg-1.737
avorite-1.728
 Levels-1.676
Token Australian
Token ID6638
Feature activation-5.72e-01

Pos logit contributions
TRY+1.647
SER+1.544
hop+1.535
tub+1.518
EST+1.505
Neg logit contributions
 Signs-1.639
ecause-1.583
ateg-1.533
avorite-1.525
estones-1.458
Token Journal
Token ID4913
Feature activation-1.07e+00

Pos logit contributions
TRY+3.019
hop+2.866
SER+2.859
tub+2.830
EST+2.782
Neg logit contributions
 Signs-2.906
ecause-2.804
ateg-2.692
avorite-2.679
estones-2.586
Token of
Token ID286
Feature activation-1.52e+00

Pos logit contributions
TRY+4.550
hop+4.464
tub+4.242
SER+4.241
©+4.151
Neg logit contributions
ecause-6.323
avorite-6.293
 Signs-6.061
ateg-5.842
estones-5.697
Token Zo
Token ID31645
Feature activation-8.98e-01

Pos logit contributions
TRY+4.307
hop+4.210
SER+3.977
tub+3.927
EST+3.882
Neg logit contributions
ecause-10.415
 Signs-10.332
avorite-10.317
ateg-10.003
aminer-9.931
Tokenology
Token ID1435
Feature activation-1.03e+00

Pos logit contributions
TRY+3.190
hop+3.138
tub+2.908
idious+2.865
SER+2.856
Neg logit contributions
 Signs-6.010
avorite-5.915
ecause-5.776
 Levels-5.518
ateg-5.495
Token 62
Token ID8190
Feature activation-1.11e+00

Pos logit contributions
TRY+4.756
hop+4.574
SER+4.401
tub+4.359
EST+4.353
Neg logit contributions
ecause-5.775
 Signs-5.724
avorite-5.498
ateg-5.416
estones-5.313
Token (
Token ID357
Feature activation-6.55e-01

Pos logit contributions
TRY+4.711
hop+4.430
SER+4.374
EST+4.365
tub+4.267
Neg logit contributions
ecause-6.579
 Signs-6.516
avorite-6.202
ateg-6.131
estones-5.966
Token5
Token ID20
Feature activation-4.23e-01

Pos logit contributions
TRY+2.612
SER+2.412
hop+2.402
EST+2.344
tub+2.336
Neg logit contributions
 Signs-4.308
ecause-4.168
ateg-4.016
avorite-4.009
 Levels-3.901
Token c
Token ID269
Feature activation+6.40e-01

Pos logit contributions
 Signs+0.869
ecause+0.828
ateg+0.806
avorite+0.799
 Levels+0.774
Neg logit contributions
TRY-0.713
SER-0.665
hop-0.661
tub-0.659
EST-0.644
Tokenv
Token ID85
Feature activation+2.69e-01

Pos logit contributions
 Signs+3.763
ateg+3.565
ecause+3.560
 Levels+3.381
avorite+3.353
Neg logit contributions
TRY-2.973
SER-2.744
idious-2.671
ORN-2.669
hop-2.668
Token2
Token ID17
Feature activation+1.34e-01

Pos logit contributions
 Signs+1.713
ecause+1.624
ateg+1.595
avorite+1.557
 Levels+1.547
Neg logit contributions
TRY-1.142
SER-1.049
hop-1.040
tub-1.036
idious-1.012
Token.
Token ID764
Feature activation+6.21e-01

Pos logit contributions
 Signs+0.742
ecause+0.702
ateg+0.683
avorite+0.674
 Levels+0.657
Neg logit contributions
TRY-0.688
SER-0.643
hop-0.638
tub-0.636
EST-0.623
Token COL
Token ID20444
Feature activation+1.01e-01

Pos logit contributions
 Signs+2.859
ecause+2.648
 Levels+2.569
ateg+2.564
avorite+2.524
Neg logit contributions
TRY-3.654
hop-3.436
idious-3.398
tub-3.385
SER-3.381
TokenOR
Token ID1581
Feature activation+1.09e+00

Pos logit contributions
 Signs+0.781
ecause+0.751
ateg+0.737
avorite+0.730
 Levels+0.718
Neg logit contributions
TRY-0.296
SER-0.262
hop-0.256
tub-0.254
EST-0.249
Token_
Token ID62
Feature activation+3.31e-02

Pos logit contributions
 Signs+5.946
ateg+5.474
 Levels+5.373
 herself+5.302
ecause+5.280
Neg logit contributions
TRY-5.115
idious-4.975
tub-4.839
arious-4.802
hop-4.772
TokenB
Token ID33
Feature activation+5.66e-01

Pos logit contributions
 Signs+0.214
ecause+0.204
ateg+0.200
avorite+0.196
 Levels+0.193
Neg logit contributions
TRY-0.138
SER-0.127
hop-0.126
tub-0.125
EST-0.123
TokenGR
Token ID10761
Feature activation+3.92e-01

Pos logit contributions
 Signs+3.253
ecause+3.054
ateg+3.009
 Levels+2.912
avorite+2.902
Neg logit contributions
TRY-2.693
hop-2.504
tub-2.498
SER-2.494
idious-2.474
Token2
Token ID17
Feature activation+1.40e+00

Pos logit contributions
 Signs+2.502
ecause+2.373
ateg+2.328
avorite+2.278
 Levels+2.259
Neg logit contributions
TRY-1.659
SER-1.523
hop-1.513
tub-1.508
idious-1.470
TokenG
Token ID38
Feature activation+4.56e-01

Pos logit contributions
 Signs+5.917
 Levels+5.181
ateg+5.151
ecause+5.145
 herself+5.142
Neg logit contributions
idious-8.115
TRY-8.019
arious-7.806
tub-7.748
hop-7.627
Token the
Token ID262
Feature activation-2.77e-01

Pos logit contributions
TRY+2.059
SER+1.947
hop+1.934
tub+1.927
EST+1.887
Neg logit contributions
 Signs-2.199
ecause-2.094
ateg-2.025
avorite-1.985
estones-1.933
Token documents
Token ID4963
Feature activation+3.49e-01

Pos logit contributions
TRY+1.468
SER+1.383
hop+1.377
tub+1.370
EST+1.345
Neg logit contributions
 Signs-1.457
ecause-1.397
ateg-1.341
avorite-1.320
estones-1.273
Token some
Token ID617
Feature activation+1.73e-01

Pos logit contributions
 Signs+1.798
ecause+1.687
ateg+1.642
avorite+1.601
 Levels+1.588
Neg logit contributions
TRY-1.913
SER-1.787
hop-1.763
tub-1.761
ORN-1.732
Token time
Token ID640
Feature activation-4.37e-01

Pos logit contributions
 Signs+1.033
ecause+0.977
ateg+0.954
avorite+0.933
 Levels+0.926
Neg logit contributions
TRY-0.804
SER-0.743
hop-0.736
tub-0.735
EST-0.717
Token ago
Token ID2084
Feature activation+1.43e-01

Pos logit contributions
TRY+2.167
hop+2.065
tub+2.058
SER+2.042
idious+2.001
Neg logit contributions
 Signs-2.423
ecause-2.276
ateg-2.234
avorite-2.211
 Levels-2.145
Token by
Token ID416
Feature activation+7.28e-01

Pos logit contributions
 Signs+0.743
ecause+0.698
ateg+0.679
avorite+0.663
 Levels+0.655
Neg logit contributions
TRY-0.782
SER-0.733
hop-0.726
tub-0.724
EST-0.710
Token Julian
Token ID18322
Feature activation+4.39e-01

Pos logit contributions
 Signs+4.170
ecause+3.924
ateg+3.809
 Levels+3.715
avorite+3.696
Neg logit contributions
TRY-3.557
SER-3.263
tub-3.253
hop-3.230
EST-3.172
Token Assange
Token ID21882
Feature activation+4.78e-01

Pos logit contributions
 Signs+2.759
ecause+2.591
ateg+2.546
 Levels+2.488
avorite+2.460
Neg logit contributions
TRY-1.935
SER-1.747
tub-1.740
hop-1.738
idious-1.705
Token,
Token ID11
Feature activation+1.36e-01

Pos logit contributions
 Signs+2.343
ecause+2.177
ateg+2.129
avorite+2.046
 herself+2.044
Neg logit contributions
TRY-2.754
SER-2.576
tub-2.536
hop-2.534
ORN-2.492
Token the
Token ID262
Feature activation-5.73e-02

Pos logit contributions
 Signs+0.738
ecause+0.694
ateg+0.678
avorite+0.662
 Levels+0.653
Neg logit contributions
TRY-0.710
SER-0.663
hop-0.658
tub-0.656
EST-0.642
Token founder
Token ID9119
Feature activation-1.20e-01

Pos logit contributions
TRY+0.262
SER+0.243
hop+0.241
tub+0.240
EST+0.234
Neg logit contributions
 Signs-0.347
ecause-0.329
ateg-0.322
avorite-0.316
 Levels-0.311
Tokenread
Token ID961
Feature activation-1.06e-01

Pos logit contributions
TRY+0.612
hop+0.568
SER+0.564
tub+0.561
EST+0.546
Neg logit contributions
 Signs-0.847
ecause-0.799
ateg-0.785
avorite-0.775
 Levels-0.760
Token:
Token ID25
Feature activation+2.43e-01

Pos logit contributions
TRY+0.584
hop+0.546
SER+0.544
tub+0.541
EST+0.537
Neg logit contributions
 Signs-0.541
ecause-0.512
ateg-0.497
avorite-0.496
 Levels-0.476
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
 Signs+1.139
ecause+1.058
ateg+1.024
avorite+0.994
 Levels+0.987
Neg logit contributions
TRY-1.446
SER-1.367
tub-1.358
hop-1.347
idious-1.333
Token
Token ID198
Feature activation-2.41e-02

Pos logit contributions
 Signs+0.658
ecause+0.620
ateg+0.600
avorite+0.594
 Levels+0.573
Neg logit contributions
TRY-0.790
SER-0.743
hop-0.740
tub-0.737
EST-0.724
TokenIf
Token ID1532
Feature activation-7.75e-02

Pos logit contributions
TRY+0.113
SER+0.105
hop+0.104
tub+0.103
EST+0.101
Neg logit contributions
 Signs-0.143
ecause-0.136
ateg-0.133
avorite-0.131
 Levels-0.128
Token all
Token ID477
Feature activation+8.41e-01

Pos logit contributions
TRY+0.391
SER+0.365
hop+0.364
tub+0.361
idious+0.354
Neg logit contributions
 Signs-0.430
ecause-0.408
ateg-0.398
avorite-0.391
 Levels-0.382
Token that
Token ID326
Feature activation+1.57e-01

Pos logit contributions
 Signs+3.886
ecause+3.514
ateg+3.433
 Levels+3.368
avorite+3.287
Neg logit contributions
TRY-5.017
SER-4.701
tub-4.688
hop-4.625
EST-4.577
Token wasn
Token ID2492
Feature activation+1.34e-01

Pos logit contributions
 Signs+0.829
ecause+0.777
ateg+0.759
avorite+0.740
 Levels+0.731
Neg logit contributions
TRY-0.845
SER-0.791
hop-0.785
tub-0.783
EST-0.767
Token't
Token ID470
Feature activation-3.62e-01

Pos logit contributions
 Signs+0.883
ecause+0.847
ateg+0.825
avorite+0.823
 Levels+0.800
Neg logit contributions
TRY-0.540
SER-0.493
hop-0.491
tub-0.488
EST-0.476
Token enough
Token ID1576
Feature activation+1.13e-01

Pos logit contributions
TRY+1.718
hop+1.596
SER+1.595
tub+1.589
idious+1.561
Neg logit contributions
 Signs-2.087
ecause-2.000
ateg-1.931
avorite-1.916
 Levels-1.886
Token,
Token ID11
Feature activation+2.30e-01

Pos logit contributions
 Signs+0.576
ecause+0.537
ateg+0.524
avorite+0.510
 Levels+0.505
Neg logit contributions
TRY-0.624
SER-0.587
hop-0.582
tub-0.580
EST-0.569
Token.
Token ID13
Feature activation-5.05e-02

Pos logit contributions
 Signs+0.808
ecause+0.759
ateg+0.741
avorite+0.724
 Levels+0.713
Neg logit contributions
TRY-0.793
SER-0.741
hop-0.736
tub-0.734
EST-0.719
Token
Token ID198
Feature activation-1.49e-02

Pos logit contributions
TRY+0.287
SER+0.271
hop+0.269
tub+0.268
EST+0.263
Neg logit contributions
 Signs-0.246
ecause-0.234
ateg-0.228
avorite-0.224
 Levels-0.215
Token
Token ID198
Feature activation+3.20e-02

Pos logit contributions
TRY+0.084
hop+0.079
SER+0.079
tub+0.078
EST+0.077
Neg logit contributions
 Signs-0.074
ecause-0.071
avorite-0.069
ateg-0.068
 Levels-0.065
TokenFor
Token ID1890
Feature activation+2.73e-01

Pos logit contributions
 Signs+0.182
ecause+0.172
ateg+0.168
avorite+0.165
 Levels+0.162
Neg logit contributions
TRY-0.159
SER-0.149
hop-0.147
tub-0.146
EST-0.144
Token photographic
Token ID35788
Feature activation+5.86e-02

Pos logit contributions
 Signs+1.393
ecause+1.304
ateg+1.271
avorite+1.239
 Levels+1.222
Neg logit contributions
TRY-1.514
SER-1.421
hop-1.411
tub-1.407
EST-1.378
Token evidence
Token ID2370
Feature activation+7.44e-01

Pos logit contributions
 Signs+0.267
ecause+0.249
ateg+0.241
avorite+0.235
 Levels+0.231
Neg logit contributions
TRY-0.356
SER-0.336
hop-0.334
tub-0.333
EST-0.327
Token of
Token ID286
Feature activation-8.85e-02

Pos logit contributions
 Signs+3.550
ecause+3.218
ateg+3.168
 herself+3.094
 Levels+3.085
Neg logit contributions
TRY-4.254
SER-4.024
hop-4.001
tub-3.943
ORN-3.921
Token Manson
Token ID41645
Feature activation+1.46e-01

Pos logit contributions
TRY+0.466
SER+0.438
hop+0.435
tub+0.433
idious+0.425
Neg logit contributions
 Signs-0.470
ecause-0.447
ateg-0.435
avorite-0.425
 Levels-0.416
Token's
Token ID338
Feature activation-2.96e-01

Pos logit contributions
 Signs+0.759
ecause+0.709
ateg+0.692
avorite+0.675
 Levels+0.667
Neg logit contributions
TRY-0.790
SER-0.740
hop-0.734
tub-0.733
EST-0.718
Token chi
Token ID33166
Feature activation-2.69e-01

Pos logit contributions
TRY+1.401
SER+1.296
hop+1.287
tub+1.277
idious+1.268
Neg logit contributions
 Signs-1.733
ecause-1.670
ateg-1.620
avorite-1.599
 Levels-1.555
Tokenrog
Token ID3828
Feature activation-1.27e-01

Pos logit contributions
TRY+1.236
hop+1.151
SER+1.146
tub+1.138
EST+1.125
Neg logit contributions
 Signs-1.618
ecause-1.537
ateg-1.486
avorite-1.465
 Levels-1.442
Token.
Token ID13
Feature activation-1.37e-01

Pos logit contributions
TRY+1.646
hop+1.553
SER+1.546
tub+1.542
EST+1.511
Neg logit contributions
 Signs-1.720
ecause-1.625
ateg-1.600
avorite-1.569
 Levels-1.522
Token
Token ID198
Feature activation-6.78e-02

Pos logit contributions
TRY+0.769
SER+0.723
hop+0.718
tub+0.713
EST+0.704
Neg logit contributions
 Signs-0.671
ecause-0.646
ateg-0.628
avorite-0.626
estones-0.595
Token
Token ID198
Feature activation-3.74e-02

Pos logit contributions
TRY+0.379
hop+0.355
SER+0.355
tub+0.352
EST+0.346
Neg logit contributions
 Signs-0.341
ecause-0.330
avorite-0.324
ateg-0.315
 Levels-0.299
TokenH
Token ID39
Feature activation+3.25e-02

Pos logit contributions
TRY+0.174
SER+0.162
hop+0.160
tub+0.159
EST+0.156
Neg logit contributions
 Signs-0.224
ecause-0.213
ateg-0.208
avorite-0.206
 Levels-0.202
Tokenills
Token ID2171
Feature activation-1.43e-01

Pos logit contributions
 Signs+0.167
ecause+0.157
ateg+0.153
avorite+0.149
 Levels+0.147
Neg logit contributions
TRY-0.178
SER-0.167
hop-0.166
tub-0.165
idious-0.162
Tokenborough
Token ID17913
Feature activation+9.11e-01

Pos logit contributions
TRY+0.632
hop+0.601
tub+0.589
SER+0.589
idious+0.573
Neg logit contributions
 Signs-0.870
ecause-0.841
ateg-0.812
avorite-0.810
 Levels-0.785
Token County
Token ID3418
Feature activation+1.05e+00

Pos logit contributions
 Signs+5.210
ecause+4.688
ateg+4.646
 Levels+4.641
 herself+4.544
Neg logit contributions
TRY-4.450
SER-4.083
hop-4.071
tub-4.068
Unix-4.014
Token residents
Token ID5085
Feature activation+3.59e-01

Pos logit contributions
 Signs+4.947
ateg+4.354
ecause+4.331
 Levels+4.274
 herself+4.142
Neg logit contributions
TRY-6.192
SER-5.772
hop-5.741
tub-5.726
idious-5.696
Token who
Token ID508
Feature activation+9.37e-03

Pos logit contributions
 Signs+1.959
ecause+1.820
ateg+1.791
 Levels+1.726
avorite+1.722
Neg logit contributions
TRY-1.862
SER-1.746
tub-1.708
hop-1.707
idious-1.686
Token find
Token ID1064
Feature activation+1.51e-01

Pos logit contributions
 Signs+0.054
ecause+0.052
ateg+0.050
avorite+0.050
 Levels+0.049
Neg logit contributions
TRY-0.045
tub-0.042
hop-0.042
SER-0.041
EST-0.040
Token themselves
Token ID2405
Feature activation+8.63e-02

Pos logit contributions
 Signs+0.724
ecause+0.684
ateg+0.664
avorite+0.649
 Levels+0.633
Neg logit contributions
TRY-0.878
SER-0.827
hop-0.822
tub-0.820
EST-0.806
Token on
Token ID319
Feature activation-8.70e-02

Pos logit contributions
TRY+0.736
SER+0.687
hop+0.684
tub+0.682
EST+0.668
Neg logit contributions
 Signs-0.808
ecause-0.765
ateg-0.747
avorite-0.733
 Levels-0.719
Token the
Token ID262
Feature activation-4.11e-02

Pos logit contributions
TRY+0.422
SER+0.392
hop+0.388
tub+0.386
EST+0.381
Neg logit contributions
 Signs-0.498
ecause-0.479
ateg-0.466
avorite-0.461
 Levels-0.446
Token street
Token ID4675
Feature activation-4.29e-01

Pos logit contributions
TRY+0.188
SER+0.173
hop+0.173
tub+0.172
EST+0.169
Neg logit contributions
 Signs-0.247
ecause-0.237
ateg-0.232
avorite-0.229
 Levels-0.223
Token,
Token ID11
Feature activation-1.09e-01

Pos logit contributions
TRY+2.141
hop+2.001
tub+1.996
SER+1.989
EST+1.944
Neg logit contributions
 Signs-2.359
ecause-2.270
ateg-2.213
avorite-2.194
 Levels-2.119
Token as
Token ID355
Feature activation-3.39e-01

Pos logit contributions
TRY+0.570
hop+0.534
SER+0.532
tub+0.530
EST+0.520
Neg logit contributions
 Signs-0.584
ecause-0.556
ateg-0.543
avorite-0.534
 Levels-0.520
Token part
Token ID636
Feature activation-1.83e-01

Pos logit contributions
TRY+1.568
hop+1.468
tub+1.468
SER+1.464
idious+1.432
Neg logit contributions
 Signs-1.970
ecause-1.905
ateg-1.864
avorite-1.834
estones-1.783
Token of
Token ID286
Feature activation+2.03e-01

Pos logit contributions
TRY+0.927
hop+0.870
SER+0.867
tub+0.860
EST+0.840
Neg logit contributions
 Signs-1.014
ecause-0.978
avorite-0.964
ateg-0.942
estones-0.904
Token her
Token ID607
Feature activation-2.51e-01

Pos logit contributions
 Signs+1.126
ecause+1.059
ateg+1.034
avorite+1.010
 Levels+0.999
Neg logit contributions
TRY-1.030
SER-0.960
hop-0.954
tub-0.952
EST-0.931
Token project
Token ID1628
Feature activation-1.64e-01

Pos logit contributions
TRY+1.208
SER+1.119
hop+1.109
tub+1.104
idious+1.083
Neg logit contributions
 Signs-1.445
ecause-1.388
ateg-1.355
avorite-1.337
 Levels-1.296
Token Hey
Token ID14690
Feature activation-2.12e-01

Pos logit contributions
TRY+0.876
SER+0.816
hop+0.813
tub+0.811
EST+0.796
Neg logit contributions
 Signs-0.863
ecause-0.822
ateg-0.800
avorite-0.790
 Levels-0.764
Token Baby
Token ID14801
Feature activation+1.35e-03

Pos logit contributions
TRY+1.140
SER+1.067
hop+1.066
tub+1.061
EST+1.043
Neg logit contributions
 Signs-1.092
ecause-1.046
ateg-1.018
avorite-1.005
 Levels-0.969
Token the
Token ID262
Feature activation+3.32e-01

Pos logit contributions
 Signs+3.415
ecause+3.173
ateg+3.140
 Levels+3.034
 herself+3.000
Neg logit contributions
TRY-3.067
SER-2.839
tub-2.823
hop-2.787
EST-2.734
Token best
Token ID1266
Feature activation-1.75e-01

Pos logit contributions
 Signs+1.947
ecause+1.837
ateg+1.796
avorite+1.755
 Levels+1.737
Neg logit contributions
TRY-1.584
SER-1.469
hop-1.455
tub-1.452
idious-1.417
Token integrity
Token ID11540
Feature activation+1.20e-01

Pos logit contributions
TRY+0.841
SER+0.781
hop+0.778
tub+0.772
EST+0.763
Neg logit contributions
 Signs-1.015
ecause-0.962
ateg-0.939
avorite-0.926
 Levels-0.907
Token.
Token ID13
Feature activation-2.52e-01

Pos logit contributions
 Signs+0.651
ecause+0.612
ateg+0.598
avorite+0.584
 Levels+0.576
Neg logit contributions
TRY-0.620
SER-0.579
hop-0.574
tub-0.573
EST-0.561
Token
Token ID198
Feature activation+1.36e-01

Pos logit contributions
TRY+1.402
SER+1.319
hop+1.311
tub+1.287
EST+1.285
Neg logit contributions
 Signs-1.241
ecause-1.198
avorite-1.173
ateg-1.165
estones-1.102
Token
Token ID198
Feature activation+9.37e-02

Pos logit contributions
 Signs+0.657
ecause+0.619
ateg+0.599
avorite+0.592
 Levels+0.572
Neg logit contributions
TRY-0.793
SER-0.747
hop-0.743
tub-0.740
EST-0.727
TokenThe
Token ID464
Feature activation-4.05e-02

Pos logit contributions
 Signs+0.558
ecause+0.528
ateg+0.517
avorite+0.507
 Levels+0.500
Neg logit contributions
TRY-0.439
SER-0.407
hop-0.404
tub-0.402
EST-0.393
Token first
Token ID717
Feature activation+7.49e-02

Pos logit contributions
TRY+0.211
SER+0.198
hop+0.196
tub+0.195
EST+0.192
Neg logit contributions
 Signs-0.217
ecause-0.207
ateg-0.202
avorite-0.198
estones-0.192
Token thing
Token ID1517
Feature activation-1.03e-01

Pos logit contributions
 Signs+0.429
ecause+0.408
ateg+0.398
avorite+0.391
 Levels+0.382
Neg logit contributions
TRY-0.367
SER-0.341
hop-0.340
tub-0.337
EST-0.331
Token we
Token ID356
Feature activation-4.73e-01

Pos logit contributions
TRY+0.522
SER+0.487
hop+0.487
tub+0.481
EST+0.476
Neg logit contributions
 Signs-0.567
ecause-0.540
ateg-0.526
avorite-0.518
 Levels-0.505
Token want
Token ID765
Feature activation-4.58e-01

Pos logit contributions
TRY+1.955
hop+1.851
tub+1.840
SER+1.783
EST+1.769
Neg logit contributions
 Signs-2.989
ecause-2.879
avorite-2.771
ateg-2.764
 Levels-2.718
Token up
Token ID510
Feature activation-2.16e-01

Pos logit contributions
 Signs+1.179
ecause+1.108
ateg+1.077
avorite+1.050
 Levels+1.039
Neg logit contributions
TRY-1.203
SER-1.127
tub-1.116
hop-1.115
idious-1.093
Token loaded
Token ID9639
Feature activation-2.11e-01

Pos logit contributions
TRY+1.098
hop+1.031
SER+1.019
tub+1.012
EST+0.996
Neg logit contributions
 Signs-1.188
ecause-1.114
ateg-1.102
avorite-1.097
 Levels-1.055
Token in
Token ID287
Feature activation+2.88e-01

Pos logit contributions
TRY+1.068
hop+1.001
SER+0.993
tub+0.992
EST+0.969
Neg logit contributions
 Signs-1.171
ecause-1.098
ateg-1.081
avorite-1.069
 Levels-1.038
Token ro
Token ID686
Feature activation-4.69e-02

Pos logit contributions
 Signs+1.576
ecause+1.483
ateg+1.447
avorite+1.411
 Levels+1.398
Neg logit contributions
TRY-1.482
SER-1.383
hop-1.371
tub-1.371
EST-1.340
Tokenpas
Token ID44429
Feature activation-3.27e-01

Pos logit contributions
TRY+0.209
SER+0.193
hop+0.191
tub+0.191
EST+0.186
Neg logit contributions
 Signs-0.290
ecause-0.275
ateg-0.269
avorite-0.264
 Levels-0.260
Tokenaurus
Token ID22302
Feature activation+1.76e-01

Pos logit contributions
TRY+1.598
SER+1.489
hop+1.485
tub+1.457
EST+1.433
Neg logit contributions
 Signs-1.887
ecause-1.772
ateg-1.736
avorite-1.709
 Levels-1.681
Tokenrex
Token ID21510
Feature activation+2.30e-01

Pos logit contributions
 Signs+0.969
ecause+0.914
ateg+0.892
avorite+0.870
 Levels+0.859
Neg logit contributions
TRY-0.893
SER-0.832
tub-0.827
hop-0.826
EST-0.808
Token in
Token ID287
Feature activation+7.43e-02

Pos logit contributions
 Signs+1.256
ecause+1.186
ateg+1.156
avorite+1.127
 Levels+1.113
Neg logit contributions
TRY-1.189
SER-1.106
tub-1.100
hop-1.098
EST-1.076
Token our
Token ID674
Feature activation-2.29e-02

Pos logit contributions
 Signs+0.430
ecause+0.407
ateg+0.398
avorite+0.391
 Levels+0.384
Neg logit contributions
TRY-0.359
SER-0.334
hop-0.332
tub-0.330
EST-0.323
Token debugger
Token ID49518
Feature activation-6.88e-01

Pos logit contributions
TRY+0.109
SER+0.102
hop+0.101
tub+0.101
EST+0.098
Neg logit contributions
 Signs-0.135
ecause-0.128
ateg-0.125
avorite-0.122
 Levels-0.120
Token:
Token ID25
Feature activation+1.72e-01

Pos logit contributions
TRY+3.331
hop+3.100
tub+3.095
SER+3.088
EST+3.015
Neg logit contributions
 Signs-3.881
ecause-3.664
avorite-3.640
ateg-3.615
estones-3.451
Token pro
Token ID386
Feature activation-4.49e-03

Pos logit contributions
TRY+4.296
SER+4.108
tub+4.065
hop+4.034
idious+4.018
Neg logit contributions
 Signs-5.457
ecause-5.051
avorite-4.949
ateg-4.905
 Levels-4.889
Token quo
Token ID18658
Feature activation-1.21e-01

Pos logit contributions
TRY+0.016
hop+0.014
SER+0.014
tub+0.014
EST+0.014
Neg logit contributions
 Signs-0.032
ecause-0.031
avorite-0.030
ateg-0.030
 Levels-0.029
Token,
Token ID11
Feature activation+1.63e-01

Pos logit contributions
TRY+0.615
SER+0.575
hop+0.569
tub+0.568
idious+0.558
Neg logit contributions
 Signs-0.667
ecause-0.628
ateg-0.610
avorite-0.602
 Levels-0.591
Token it
Token ID340
Feature activation+7.25e-02

Pos logit contributions
 Signs+0.831
ecause+0.779
ateg+0.759
avorite+0.742
 Levels+0.730
Neg logit contributions
TRY-0.904
SER-0.849
hop-0.843
tub-0.840
EST-0.824
Token
Token ID447
Feature activation-2.02e-01

Pos logit contributions
 Signs+0.382
ecause+0.359
ateg+0.350
avorite+0.343
 Levels+0.337
Neg logit contributions
TRY-0.388
SER-0.363
hop-0.362
tub-0.361
EST-0.353
Token
Token ID247
Feature activation-1.63e-01

Pos logit contributions
TRY+1.370
SER+1.294
hop+1.288
tub+1.268
©+1.252
Neg logit contributions
 Signs-0.790
ecause-0.747
avorite-0.723
ateg-0.695
estones-0.683
Tokens
Token ID82
Feature activation-1.33e-01

Pos logit contributions
TRY+0.623
hop+0.582
SER+0.578
tub+0.572
idious+0.556
Neg logit contributions
 Signs-1.098
ecause-1.047
avorite-1.036
ateg-1.017
 Levels-0.997
Token true
Token ID2081
Feature activation-6.81e-01

Pos logit contributions
TRY+0.615
SER+0.575
hop+0.572
tub+0.567
idious+0.557
Neg logit contributions
 Signs-0.793
ecause-0.753
ateg-0.732
avorite-0.723
 Levels-0.712
Token that
Token ID326
Feature activation-3.24e-01

Pos logit contributions
TRY+3.200
SER+3.043
hop+2.995
tub+2.987
idious+2.960
Neg logit contributions
 Signs-3.898
ecause-3.707
ateg-3.601
avorite-3.569
estones-3.504
Token while
Token ID981
Feature activation-5.05e-01

Pos logit contributions
TRY+1.750
SER+1.673
hop+1.647
tub+1.630
idious+1.620
Neg logit contributions
 Signs-1.637
ecause-1.537
ateg-1.495
avorite-1.471
estones-1.454
Token the
Token ID262
Feature activation-2.05e-02

Pos logit contributions
TRY+2.710
SER+2.600
hop+2.555
tub+2.527
idious+2.513
Neg logit contributions
 Signs-2.542
ecause-2.407
ateg-2.333
avorite-2.298
estones-2.256
Token the
Token ID262
Feature activation-2.48e-01

Pos logit contributions
TRY+1.231
SER+1.160
hop+1.150
tub+1.137
idious+1.118
Neg logit contributions
 Signs-1.306
ecause-1.241
ateg-1.209
avorite-1.194
estones-1.155
Token 1980
Token ID7169
Feature activation+2.03e-01

Pos logit contributions
TRY+1.142
SER+1.075
hop+1.064
tub+1.052
EST+1.036
Neg logit contributions
 Signs-1.467
ecause-1.408
ateg-1.377
avorite-1.363
estones-1.317
Token
Token ID447
Feature activation+1.53e-01

Pos logit contributions
 Signs+1.209
ecause+1.135
ateg+1.116
 Levels+1.086
avorite+1.069
Neg logit contributions
TRY-0.959
SER-0.882
hop-0.876
tub-0.869
idious-0.856
Token
Token ID247
Feature activation+1.06e-01

Pos logit contributions
 Signs+0.537
ecause+0.499
ateg+0.476
avorite+0.475
estones+0.446
Neg logit contributions
TRY-1.098
SER-1.036
hop-1.035
tub-1.027
EST-1.014
Tokens
Token ID82
Feature activation-7.02e-02

Pos logit contributions
 Signs+0.664
ecause+0.630
ateg+0.617
avorite+0.603
 Levels+0.598
Neg logit contributions
TRY-0.463
SER-0.425
hop-0.421
tub-0.421
idious-0.409
Token when
Token ID618
Feature activation+1.81e-02

Pos logit contributions
TRY+0.367
SER+0.344
hop+0.342
tub+0.341
EST+0.335
Neg logit contributions
 Signs-0.378
ecause-0.355
ateg-0.347
avorite-0.341
 Levels-0.333
Token packet
Token ID19638
Feature activation+3.68e-01

Pos logit contributions
 Signs+0.100
ecause+0.095
ateg+0.092
avorite+0.090
 Levels+0.089
Neg logit contributions
TRY-0.093
SER-0.086
hop-0.086
tub-0.086
EST-0.084
Token loss
Token ID2994
Feature activation+5.27e-01

Pos logit contributions
 Signs+1.880
ecause+1.753
ateg+1.712
avorite+1.659
 Levels+1.651
Neg logit contributions
TRY-2.033
SER-1.903
tub-1.888
hop-1.883
EST-1.848
Token and
Token ID290
Feature activation-5.64e-02

Pos logit contributions
 Signs+2.687
ecause+2.509
ateg+2.456
 Levels+2.379
avorite+2.378
Neg logit contributions
TRY-2.900
SER-2.709
tub-2.698
hop-2.686
ORN-2.632
Token congestion
Token ID28014
Feature activation+3.79e-01

Pos logit contributions
TRY+0.270
SER+0.252
hop+0.250
tub+0.249
EST+0.243
Neg logit contributions
 Signs-0.328
ecause-0.311
ateg-0.304
avorite-0.299
 Levels-0.293
Token were
Token ID547
Feature activation+2.65e-01

Pos logit contributions
 Signs+2.101
ecause+1.974
ateg+1.932
avorite+1.882
 Levels+1.872
Neg logit contributions
TRY-1.919
SER-1.784
tub-1.773
hop-1.772
EST-1.729
Token automation
Token ID22771
Feature activation-8.33e-03

Pos logit contributions
TRY+0.559
SER+0.523
hop+0.521
tub+0.518
EST+0.507
Neg logit contributions
 Signs-0.584
ecause-0.553
ateg-0.539
avorite-0.530
 Levels-0.515
Token and
Token ID290
Feature activation-1.30e-01

Pos logit contributions
TRY+0.044
SER+0.041
hop+0.041
tub+0.041
EST+0.040
Neg logit contributions
 Signs-0.044
ecause-0.042
ateg-0.040
avorite-0.040
 Levels-0.039
Token computer
Token ID3644
Feature activation-4.49e-01

Pos logit contributions
TRY+0.653
SER+0.610
hop+0.607
tub+0.603
EST+0.590
Neg logit contributions
 Signs-0.723
ecause-0.687
ateg-0.668
avorite-0.663
 Levels-0.641
Tokenization
Token ID1634
Feature activation-2.84e-01

Pos logit contributions
TRY+1.338
hop+1.199
SER+1.194
tub+1.148
EST+1.124
Neg logit contributions
 Signs-3.393
ecause-3.291
avorite-3.211
ateg-3.198
estones-3.108
Token and
Token ID290
Feature activation-3.31e-02

Pos logit contributions
TRY+1.448
hop+1.368
SER+1.366
tub+1.359
EST+1.323
Neg logit contributions
 Signs-1.531
ecause-1.457
avorite-1.406
ateg-1.405
estones-1.366
Token enabling
Token ID15882
Feature activation-3.17e-01

Pos logit contributions
TRY+0.173
SER+0.162
hop+0.161
tub+0.160
EST+0.157
Neg logit contributions
 Signs-0.178
ecause-0.168
ateg-0.163
avorite-0.161
 Levels-0.157
Token online
Token ID2691
Feature activation-4.40e-01

Pos logit contributions
TRY+1.615
hop+1.507
SER+1.507
tub+1.500
EST+1.466
Neg logit contributions
 Signs-1.735
ecause-1.647
ateg-1.599
avorite-1.593
estones-1.539
Token services
Token ID2594
Feature activation-1.44e-01

Pos logit contributions
TRY+2.248
hop+2.092
SER+2.086
tub+2.051
EST+2.023
Neg logit contributions
 Signs-2.384
ecause-2.305
avorite-2.217
ateg-2.209
estones-2.137
Token.
Token ID13
Feature activation-3.09e-01

Pos logit contributions
TRY+0.730
hop+0.682
SER+0.681
tub+0.677
EST+0.661
Neg logit contributions
 Signs-0.792
ecause-0.751
ateg-0.728
avorite-0.720
estones-0.699
Token
Token ID198
Feature activation-1.91e-01

Pos logit contributions
TRY+1.753
SER+1.654
hop+1.645
tub+1.631
EST+1.609
Neg logit contributions
 Signs-1.483
ecause-1.425
avorite-1.380
ateg-1.374
estones-1.304
Token
Token ID198
Feature activation-8.39e-02

Pos logit contributions
TRY+1.048
SER+0.981
hop+0.981
tub+0.970
EST+0.958
Neg logit contributions
 Signs-0.982
ecause-0.959
avorite-0.954
ateg-0.910
estones-0.864
Token like
Token ID588
Feature activation+2.73e-01

Pos logit contributions
 Signs+1.581
ecause+1.480
ateg+1.443
avorite+1.406
 Levels+1.393
Neg logit contributions
TRY-1.635
SER-1.529
hop-1.514
tub-1.514
idious-1.486
Token Google
Token ID3012
Feature activation+3.58e-01

Pos logit contributions
 Signs+1.149
ecause+1.074
ateg+1.038
avorite+1.012
 Levels+0.979
Neg logit contributions
TRY-1.748
SER-1.659
hop-1.653
tub-1.646
EST-1.615
Token,
Token ID11
Feature activation+1.02e-01

Pos logit contributions
 Signs+1.864
ecause+1.735
ateg+1.696
avorite+1.649
 Levels+1.641
Neg logit contributions
TRY-1.952
SER-1.821
hop-1.807
tub-1.802
idious-1.776
Token Yahoo
Token ID16551
Feature activation-1.76e-01

Pos logit contributions
 Signs+0.503
ecause+0.473
ateg+0.462
avorite+0.451
 Levels+0.440
Neg logit contributions
TRY-0.572
SER-0.539
hop-0.537
tub-0.535
EST-0.524
Token,
Token ID11
Feature activation+2.20e-01

Pos logit contributions
TRY+0.959
SER+0.900
hop+0.894
tub+0.892
EST+0.874
Neg logit contributions
 Signs-0.914
ecause-0.859
ateg-0.837
avorite-0.818
 Levels-0.804
Token Duck
Token ID21867
Feature activation-3.81e-01

Pos logit contributions
 Signs+1.125
ecause+1.055
ateg+1.028
avorite+1.003
 Levels+0.988
Neg logit contributions
TRY-1.213
SER-1.138
hop-1.130
tub-1.128
EST-1.106
TokenD
Token ID35
Feature activation-4.94e-01

Pos logit contributions
TRY+1.365
hop+1.241
SER+1.240
tub+1.207
EST+1.181
Neg logit contributions
 Signs-2.641
ecause-2.561
avorite-2.504
ateg-2.498
estones-2.413
Tokenuck
Token ID1347
Feature activation+2.20e-03

Pos logit contributions
hop+1.847
TRY+1.846
EST+1.715
tub+1.698
SER+1.678
Neg logit contributions
 Signs-3.297
ecause-3.146
avorite-3.080
ateg-3.029
 Levels-3.007
TokenGo
Token ID5247
Feature activation+4.02e-01

Pos logit contributions
 Signs+0.012
ecause+0.011
ateg+0.011
avorite+0.011
 Levels+0.010
Neg logit contributions
TRY-0.012
hop-0.011
SER-0.011
tub-0.011
Unix-0.011
Token,
Token ID11
Feature activation+1.76e-01

Pos logit contributions
 Signs+1.981
ecause+1.834
ateg+1.798
avorite+1.736
 Levels+1.736
Neg logit contributions
TRY-2.292
SER-2.135
tub-2.121
hop-2.100
idious-2.095
Token B
Token ID347
Feature activation+1.65e-01

Pos logit contributions
 Signs+0.929
ecause+0.874
ateg+0.853
avorite+0.833
 Levels+0.819
Neg logit contributions
TRY-0.943
SER-0.883
hop-0.877
tub-0.875
EST-0.857
Token homophobic
Token ID36034
Feature activation-1.80e-01

Pos logit contributions
TRY+1.472
SER+1.381
tub+1.368
hop+1.366
EST+1.334
Neg logit contributions
 Signs-1.719
ecause-1.644
ateg-1.590
avorite-1.575
estones-1.545
Token,
Token ID11
Feature activation-5.25e-01

Pos logit contributions
TRY+1.003
hop+0.949
SER+0.946
tub+0.942
EST+0.927
Neg logit contributions
 Signs-0.898
ecause-0.857
ateg-0.818
avorite-0.816
 Levels-0.790
Token misogyn
Token ID22795
Feature activation-2.59e-01

Pos logit contributions
TRY+2.267
tub+2.092
hop+2.078
SER+2.065
EST+2.046
Neg logit contributions
 Signs-3.252
ecause-3.161
ateg-3.046
avorite-3.013
estones-2.960
Tokenist
Token ID396
Feature activation-2.91e-01

Pos logit contributions
TRY+1.357
hop+1.273
SER+1.270
tub+1.267
idious+1.244
Neg logit contributions
 Signs-1.386
ecause-1.302
ateg-1.262
avorite-1.247
 Levels-1.232
Token,
Token ID11
Feature activation-4.80e-01

Pos logit contributions
TRY+1.543
hop+1.453
SER+1.447
tub+1.444
idious+1.420
Neg logit contributions
 Signs-1.523
ecause-1.463
ateg-1.389
avorite-1.384
 Levels-1.348
Token and
Token ID290
Feature activation-4.54e-01

Pos logit contributions
TRY+2.303
tub+2.141
hop+2.129
SER+2.126
EST+2.087
Neg logit contributions
 Signs-2.765
ecause-2.660
ateg-2.564
avorite-2.530
 Levels-2.478
Token racist
Token ID9811
Feature activation-2.20e-01

Pos logit contributions
TRY+2.076
tub+1.912
SER+1.901
hop+1.899
idious+1.861
Neg logit contributions
 Signs-2.717
ecause-2.626
ateg-2.531
avorite-2.497
estones-2.463
Token comments
Token ID3651
Feature activation+6.90e-02

Pos logit contributions
TRY+1.198
hop+1.132
SER+1.132
tub+1.121
idious+1.105
Neg logit contributions
 Signs-1.112
ecause-1.074
ateg-1.019
avorite-1.019
estones-0.982
Token.
Token ID13
Feature activation-6.20e-02

Pos logit contributions
 Signs+0.374
ecause+0.353
ateg+0.344
avorite+0.338
 Levels+0.331
Neg logit contributions
TRY-0.359
SER-0.336
hop-0.334
tub-0.333
EST-0.326
Token (
Token ID357
Feature activation-3.97e-02

Pos logit contributions
TRY+0.335
SER+0.317
hop+0.313
tub+0.313
EST+0.310
Neg logit contributions
 Signs-0.313
ecause-0.305
avorite-0.300
ateg-0.295
estones-0.282
TokenIn
Token ID818
Feature activation+2.35e-01

Pos logit contributions
TRY+0.184
SER+0.172
hop+0.169
tub+0.168
EST+0.166
Neg logit contributions
 Signs-0.238
ecause-0.227
ateg-0.222
avorite-0.221
 Levels-0.214
Token...
Token ID986
Feature activation+3.91e-01

Pos logit contributions
 Signs+1.127
ecause+1.062
ateg+1.039
avorite+1.014
 Levels+1.003
Neg logit contributions
TRY-1.009
SER-0.942
hop-0.932
tub-0.931
idious-0.911
Token to
Token ID284
Feature activation-2.19e-01

Pos logit contributions
 Signs+2.302
ecause+2.171
ateg+2.118
 Levels+2.057
avorite+2.050
Neg logit contributions
TRY-1.848
SER-1.716
tub-1.693
hop-1.688
idious-1.672
Token see
Token ID766
Feature activation-1.80e-01

Pos logit contributions
TRY+1.101
SER+1.020
hop+1.017
tub+1.012
EST+0.997
Neg logit contributions
 Signs-1.226
ecause-1.163
ateg-1.131
avorite-1.117
 Levels-1.089
Token Shib
Token ID44254
Feature activation+2.05e-01

Pos logit contributions
TRY+0.926
SER+0.865
hop+0.865
tub+0.859
EST+0.846
Neg logit contributions
 Signs-0.970
ecause-0.922
ateg-0.899
avorite-0.885
 Levels-0.865
Tokenata
Token ID1045
Feature activation-4.83e-02

Pos logit contributions
 Signs+1.106
ecause+1.042
ateg+1.017
avorite+0.993
 Levels+0.979
Neg logit contributions
TRY-1.074
SER-1.004
hop-0.996
tub-0.994
EST-0.973
Token.
Token ID13
Feature activation-3.34e-01

Pos logit contributions
TRY+0.219
SER+0.203
hop+0.202
tub+0.201
EST+0.196
Neg logit contributions
 Signs-0.294
ecause-0.279
ateg-0.273
avorite-0.269
 Levels-0.263
Token
Token ID447
Feature activation+2.84e-01

Pos logit contributions
TRY+1.448
SER+1.334
hop+1.331
tub+1.321
EST+1.293
Neg logit contributions
 Signs-2.073
ecause-1.994
avorite-1.969
ateg-1.951
estones-1.882
Token
Token ID251
Feature activation+6.75e-02

Pos logit contributions
 Signs+1.088
ecause+0.995
ateg+0.962
avorite+0.922
 Levels+0.910
Neg logit contributions
TRY-1.928
SER-1.836
hop-1.821
tub-1.817
idious-1.793
Token Shib
Token ID44254
Feature activation+3.22e-01

Pos logit contributions
 Signs+0.368
ecause+0.347
ateg+0.339
avorite+0.332
 Levels+0.326
Neg logit contributions
TRY-0.349
SER-0.326
hop-0.324
tub-0.323
EST-0.316
Tokenata
Token ID1045
Feature activation+1.07e-02

Pos logit contributions
 Signs+1.398
ecause+1.285
ateg+1.228
avorite+1.219
 Levels+1.191
Neg logit contributions
TRY-2.034
tub-1.927
hop-1.926
SER-1.923
EST-1.886
Token Yu
Token ID10605
Feature activation+8.66e-02

Pos logit contributions
 Signs+0.052
ecause+0.049
ateg+0.048
avorite+0.047
 Levels+0.046
Neg logit contributions
TRY-0.062
hop-0.058
SER-0.058
tub-0.057
EST-0.056
Token running
Token ID2491
Feature activation-1.48e-01

Pos logit contributions
 Signs+0.093
ecause+0.088
ateg+0.085
avorite+0.083
 Levels+0.082
Neg logit contributions
TRY-0.098
SER-0.092
hop-0.092
tub-0.091
EST-0.090
Token one
Token ID530
Feature activation+9.08e-02

Pos logit contributions
TRY+0.726
SER+0.675
hop+0.670
tub+0.669
EST+0.655
Neg logit contributions
 Signs-0.844
ecause-0.801
ateg-0.781
avorite-0.769
 Levels-0.754
Token
Token ID447
Feature activation+5.97e-02

Pos logit contributions
 Signs+0.465
ecause+0.437
ateg+0.431
avorite+0.412
 herself+0.411
Neg logit contributions
TRY-0.494
SER-0.467
hop-0.463
tub-0.457
Unix-0.452
Token
Token ID247
Feature activation-9.05e-02

Pos logit contributions
 Signs+0.210
ecause+0.194
avorite+0.186
ateg+0.183
estones+0.175
Neg logit contributions
TRY-0.427
hop-0.404
tub-0.403
SER-0.403
EST-0.395
Tokens
Token ID82
Feature activation-1.02e-01

Pos logit contributions
TRY+0.393
SER+0.363
hop+0.360
tub+0.359
idious+0.349
Neg logit contributions
 Signs-0.570
ecause-0.540
ateg-0.529
avorite-0.521
 Levels-0.514
Token own
Token ID898
Feature activation-4.65e-01

Pos logit contributions
TRY+0.544
SER+0.508
hop+0.506
tub+0.504
idious+0.494
Neg logit contributions
 Signs-0.544
ecause-0.514
ateg-0.499
avorite-0.489
 Levels-0.481
Token pirate
Token ID25868
Feature activation-5.78e-01

Pos logit contributions
TRY+2.052
hop+1.880
SER+1.870
tub+1.858
idious+1.841
Neg logit contributions
 Signs-2.860
ecause-2.769
ateg-2.670
avorite-2.653
 Levels-2.597
Token crew
Token ID5462
Feature activation+5.47e-02

Pos logit contributions
TRY+2.633
hop+2.435
SER+2.405
tub+2.383
EST+2.354
Neg logit contributions
 Signs-3.489
ecause-3.344
ateg-3.222
avorite-3.182
 Levels-3.134
Token
Token ID960
Feature activation+1.20e-01

Pos logit contributions
 Signs+0.294
ecause+0.277
ateg+0.270
avorite+0.264
 Levels+0.260
Neg logit contributions
TRY-0.286
SER-0.268
hop-0.265
tub-0.264
EST-0.260
Tokenwhat
Token ID10919
Feature activation+8.69e-02

Pos logit contributions
 Signs+0.736
ecause+0.698
ateg+0.684
avorite+0.669
 Levels+0.661
Neg logit contributions
TRY-0.532
SER-0.491
tub-0.485
hop-0.484
idious-0.474
Token it
Token ID340
Feature activation+5.30e-01

Pos logit contributions
 Signs+0.449
ecause+0.420
ateg+0.410
avorite+0.399
 Levels+0.395
Neg logit contributions
TRY-0.475
SER-0.445
hop-0.442
tub-0.441
EST-0.432
Token)
Token ID8
Feature activation-6.04e-01

Pos logit contributions
 Signs+0.639
ecause+0.607
ateg+0.587
avorite+0.585
 Levels+0.563
Neg logit contributions
TRY-0.655
SER-0.615
hop-0.611
tub-0.608
EST-0.597
Token Mem
Token ID4942
Feature activation-2.85e-01

Pos logit contributions
TRY+3.056
SER+2.887
hop+2.857
tub+2.833
EST+2.796
Neg logit contributions
 Signs-3.271
ecause-3.127
ateg-3.054
avorite-3.041
estones-2.910
Tokenc
Token ID66
Feature activation-3.39e-01

Pos logit contributions
TRY+0.876
hop+0.809
tub+0.798
SER+0.791
EST+0.758
Neg logit contributions
 Signs-2.158
ecause-2.065
avorite-2.053
ateg-1.999
 Levels-1.975
Tokenpy
Token ID9078
Feature activation-3.38e-01

Pos logit contributions
TRY+0.111
hop+0.018
tub+0.005
py+0.001
SER-0.023
Neg logit contributions
 Signs-3.527
ecause-3.417
avorite-3.387
 Levels-3.299
ateg-3.286
TokenIn
Token ID818
Feature activation+3.79e-01

Pos logit contributions
TRY+1.750
SER+1.632
hop+1.624
tub+1.620
idious+1.588
Neg logit contributions
 Signs-1.839
ecause-1.734
ateg-1.691
avorite-1.651
 Levels-1.631
TokenMemory
Token ID30871
Feature activation-1.35e+00

Pos logit contributions
 Signs+1.839
ecause+1.720
ateg+1.671
avorite+1.636
 Levels+1.601
Neg logit contributions
TRY-2.195
SER-2.065
hop-2.052
tub-2.045
EST-2.009
Token 4
Token ID604
Feature activation-3.00e-01

Pos logit contributions
TRY+5.228
tub+4.731
SER+4.718
hop+4.712
EST+4.683
Neg logit contributions
 Signs-8.426
ecause-8.289
avorite-8.008
ateg-7.978
estones-7.675
Token66
Token ID2791
Feature activation-7.08e-01

Pos logit contributions
TRY+1.478
SER+1.377
hop+1.365
tub+1.361
EST+1.334
Neg logit contributions
 Signs-1.713
ecause-1.614
ateg-1.577
avorite-1.542
 Levels-1.523
Token284
Token ID30336
Feature activation-4.89e-01

Pos logit contributions
TRY+3.074
SER+2.856
hop+2.822
tub+2.815
EST+2.731
Neg logit contributions
 Signs-4.418
ecause-4.169
ateg-4.074
avorite-3.989
 Levels-3.972
Token 4
Token ID604
Feature activation-3.71e-01

Pos logit contributions
TRY+1.929
hop+1.753
tub+1.749
SER+1.744
Unix+1.704
Neg logit contributions
ecause-3.158
 Signs-3.156
avorite-3.108
ateg-3.008
estones-2.903
Token64
Token ID2414
Feature activation-3.41e-01

Pos logit contributions
TRY+1.473
SER+1.342
hop+1.335
tub+1.333
EST+1.332
Neg logit contributions
 Signs-2.445
ecause-2.387
avorite-2.379
ateg-2.296
 Levels-2.216
Token bit
Token ID1643
Feature activation-1.13e+00

Pos logit contributions
TRY+3.822
tub+3.595
SER+3.549
hop+3.506
EST+3.441
Neg logit contributions
 Signs-5.131
ecause-4.921
ateg-4.810
avorite-4.776
estones-4.661
Token soft
Token ID2705
Feature activation-1.20e-01

Pos logit contributions
TRY+4.667
tub+4.537
hop+4.339
SER+4.288
EST+4.228
Neg logit contributions
 Signs-6.952
ecause-6.667
avorite-6.636
ateg-6.541
 Levels-6.336
Token and
Token ID290
Feature activation-4.10e-01

Pos logit contributions
TRY+0.639
SER+0.597
hop+0.593
tub+0.593
EST+0.580
Neg logit contributions
 Signs-0.640
ecause-0.600
ateg-0.587
avorite-0.575
 Levels-0.565
Token nice
Token ID3621
Feature activation-4.31e-01

Pos logit contributions
TRY+1.959
tub+1.821
SER+1.818
hop+1.805
EST+1.761
Neg logit contributions
 Signs-2.377
ecause-2.253
ateg-2.210
avorite-2.189
 Levels-2.130
Token.
Token ID13
Feature activation-9.04e-01

Pos logit contributions
TRY+2.138
tub+2.005
SER+1.983
hop+1.972
EST+1.949
Neg logit contributions
 Signs-2.415
ecause-2.272
ateg-2.222
avorite-2.213
 Levels-2.152
Token Once
Token ID4874
Feature activation-1.62e+00

Pos logit contributions
TRY+4.778
SER+4.488
hop+4.406
tub+4.371
EST+4.328
Neg logit contributions
avorite-4.441
 Signs-4.430
ecause-4.407
ateg-4.273
estones-4.168
Token they
Token ID484
Feature activation-1.10e+00

Pos logit contributions
TRY+6.912
tub+6.460
hop+6.402
SER+6.372
EST+6.138
Neg logit contributions
 Signs-9.370
ecause-8.937
avorite-8.920
ateg-8.873
estones-8.568
Token are
Token ID389
Feature activation-1.20e+00

Pos logit contributions
TRY+4.851
tub+4.657
hop+4.522
SER+4.430
EST+4.386
Neg logit contributions
 Signs-6.465
ecause-6.075
avorite-5.992
ateg-5.963
 Levels-5.845
Token ready
Token ID3492
Feature activation-7.69e-01

Pos logit contributions
TRY+5.028
tub+4.756
SER+4.613
hop+4.586
EST+4.471
Neg logit contributions
 Signs-7.288
ecause-6.924
avorite-6.893
ateg-6.812
estones-6.655
Token,
Token ID11
Feature activation-7.22e-01

Pos logit contributions
TRY+3.716
tub+3.441
SER+3.407
hop+3.389
EST+3.287
Neg logit contributions
 Signs-4.388
ecause-4.195
avorite-4.087
ateg-4.052
estones-3.964
Token throw
Token ID3714
Feature activation-5.43e-01

Pos logit contributions
TRY+3.304
tub+3.044
hop+3.029
SER+3.009
EST+2.917
Neg logit contributions
 Signs-4.263
ecause-4.092
avorite-4.020
ateg-3.968
estones-3.885
Token Foundation
Token ID5693
Feature activation-2.21e-01

Pos logit contributions
 Signs+0.265
ecause+0.249
ateg+0.243
avorite+0.237
 Levels+0.234
Neg logit contributions
TRY-0.262
SER-0.245
hop-0.243
tub-0.243
EST-0.237
Token,
Token ID11
Feature activation-1.30e-01

Pos logit contributions
TRY+1.192
SER+1.119
hop+1.116
tub+1.110
EST+1.088
Neg logit contributions
 Signs-1.141
ecause-1.084
ateg-1.045
avorite-1.032
 Levels-1.005
Token told
Token ID1297
Feature activation-2.63e-01

Pos logit contributions
TRY+0.662
hop+0.622
SER+0.620
tub+0.618
EST+0.602
Neg logit contributions
 Signs-0.708
ecause-0.671
ateg-0.651
avorite-0.644
 Levels-0.627
Token the
Token ID262
Feature activation-4.83e-02

Pos logit contributions
TRY+1.378
SER+1.292
hop+1.288
tub+1.275
EST+1.266
Neg logit contributions
 Signs-1.371
ecause-1.324
avorite-1.297
ateg-1.281
 Levels-1.228
Token Ou
Token ID42201
Feature activation+8.75e-02

Pos logit contributions
TRY+0.249
SER+0.233
hop+0.232
tub+0.230
EST+0.226
Neg logit contributions
 Signs-0.264
ecause-0.250
ateg-0.243
avorite-0.239
 Levels-0.234
Tokenach
Token ID620
Feature activation-1.36e+00

Pos logit contributions
 Signs+0.620
ecause+0.592
ateg+0.582
avorite+0.571
 Levels+0.565
Neg logit contributions
TRY-0.310
SER-0.279
tub-0.276
hop-0.275
idious-0.267
Tokenita
Token ID5350
Feature activation+3.20e-02

Pos logit contributions
TRY+5.818
hop+5.667
tub+5.359
EST+5.148
©+4.932
Neg logit contributions
ecause-8.063
 Signs-7.899
avorite-7.392
��-7.192
ateg-7.040
Token Citizen
Token ID22307
Feature activation+9.66e-02

Pos logit contributions
 Signs+0.154
ecause+0.144
ateg+0.140
avorite+0.138
 Levels+0.134
Neg logit contributions
TRY-0.185
SER-0.175
hop-0.174
tub-0.173
EST-0.171
Token that
Token ID326
Feature activation+3.12e-02

Pos logit contributions
 Signs+0.527
ecause+0.494
ateg+0.484
avorite+0.472
 Levels+0.466
Neg logit contributions
TRY-0.501
SER-0.467
hop-0.464
tub-0.463
idious-0.453
Token the
Token ID262
Feature activation-1.61e-01

Pos logit contributions
 Signs+0.165
ecause+0.156
ateg+0.152
avorite+0.150
 Levels+0.146
Neg logit contributions
TRY-0.165
SER-0.155
hop-0.154
tub-0.153
EST-0.151
Token think
Token ID892
Feature activation+1.66e-01

Pos logit contributions
TRY+0.785
SER+0.729
hop+0.728
tub+0.721
EST+0.708
Neg logit contributions
 Signs-0.919
ecause-0.874
ateg-0.854
avorite-0.840
 Levels-0.818
Token and
Token ID290
Feature activation-7.58e-01

Pos logit contributions
TRY+3.172
tub+2.963
SER+2.925
hop+2.908
EST+2.877
Neg logit contributions
 Signs-4.166
ecause-4.007
avorite-3.912
ateg-3.906
estones-3.746
Token cover
Token ID3002
Feature activation-1.18e+00

Pos logit contributions
TRY+3.252
tub+2.919
hop+2.908
SER+2.903
EST+2.809
Neg logit contributions
 Signs-4.746
ecause-4.516
avorite-4.496
ateg-4.358
estones-4.336
Token and
Token ID290
Feature activation-6.57e-01

Pos logit contributions
TRY+5.344
tub+5.025
SER+4.976
hop+4.897
EST+4.721
Neg logit contributions
 Signs-6.787
ecause-6.483
avorite-6.475
ateg-6.198
estones-6.169
Token let
Token ID1309
Feature activation-1.57e+00

Pos logit contributions
TRY+2.825
tub+2.523
hop+2.517
SER+2.514
EST+2.441
Neg logit contributions
 Signs-4.100
avorite-3.927
ecause-3.925
ateg-3.791
estones-3.759
Token it
Token ID340
Feature activation-9.86e-01

Pos logit contributions
TRY+6.512
tub+6.073
SER+6.068
hop+5.952
EST+5.887
Neg logit contributions
 Signs-9.133
ecause-8.844
avorite-8.638
estones-8.383
ateg-8.258
Token simmer
Token ID32857
Feature activation-1.51e+00

Pos logit contributions
TRY+3.338
tub+3.030
hop+2.998
SER+2.931
EST+2.848
Neg logit contributions
 Signs-7.036
ecause-6.690
avorite-6.590
ateg-6.461
estones-6.424
Token until
Token ID1566
Feature activation-1.94e+00

Pos logit contributions
TRY+5.554
hop+5.184
tub+5.135
SER+5.039
EST+5.018
Neg logit contributions
 Signs-9.644
avorite-9.215
ecause-9.132
ateg-8.926
estones-8.666
Token most
Token ID749
Feature activation-8.15e-01

Pos logit contributions
TRY+8.226
tub+7.710
SER+7.698
 attachment+7.561
hop+7.489
Neg logit contributions
 Signs-10.301
avorite-10.295
ecause-10.138
ateg-9.559
estones-9.448
Token of
Token ID286
Feature activation-4.53e-01

Pos logit contributions
TRY+4.029
tub+3.779
SER+3.761
hop+3.719
EST+3.643
Neg logit contributions
 Signs-4.518
ecause-4.244
ateg-4.159
avorite-4.125
 Levels-4.021
Token the
Token ID262
Feature activation-2.70e-01

Pos logit contributions
TRY+2.353
SER+2.203
hop+2.192
tub+2.191
EST+2.129
Neg logit contributions
 Signs-2.446
ecause-2.306
ateg-2.241
avorite-2.210
 Levels-2.153
Token water
Token ID1660
Feature activation-6.64e-01

Pos logit contributions
TRY+1.191
SER+1.098
tub+1.092
hop+1.090
EST+1.061
Neg logit contributions
 Signs-1.679
ecause-1.595
ateg-1.562
avorite-1.536
 Levels-1.510
Token tax
Token ID1687
Feature activation-7.57e-01

Pos logit contributions
TRY+0.456
SER+0.425
hop+0.424
tub+0.421
EST+0.412
Neg logit contributions
 Signs-0.534
ecause-0.508
ateg-0.494
avorite-0.485
 Levels-0.476
Tokena
Token ID64
Feature activation-3.43e-01

Pos logit contributions
TRY+2.888
hop+2.757
tub+2.696
arious+2.678
SER+2.674
Neg logit contributions
 Signs-4.886
ecause-4.719
avorite-4.550
ateg-4.523
 Levels-4.428
Token.
Token ID13
Feature activation-3.12e-01

Pos logit contributions
TRY+1.734
hop+1.631
SER+1.623
tub+1.612
idious+1.575
Neg logit contributions
 Signs-1.890
ecause-1.804
ateg-1.737
avorite-1.728
 Levels-1.676
Token Australian
Token ID6638
Feature activation-5.72e-01

Pos logit contributions
TRY+1.647
SER+1.544
hop+1.535
tub+1.518
EST+1.505
Neg logit contributions
 Signs-1.639
ecause-1.583
ateg-1.533
avorite-1.525
estones-1.458
Token Journal
Token ID4913
Feature activation-1.07e+00

Pos logit contributions
TRY+3.019
hop+2.866
SER+2.859
tub+2.830
EST+2.782
Neg logit contributions
 Signs-2.906
ecause-2.804
ateg-2.692
avorite-2.679
estones-2.586
Token of
Token ID286
Feature activation-1.52e+00

Pos logit contributions
TRY+4.550
hop+4.464
tub+4.242
SER+4.241
©+4.151
Neg logit contributions
ecause-6.323
avorite-6.293
 Signs-6.061
ateg-5.842
estones-5.697
Token Zo
Token ID31645
Feature activation-8.98e-01

Pos logit contributions
TRY+4.307
hop+4.210
SER+3.977
tub+3.927
EST+3.882
Neg logit contributions
ecause-10.415
 Signs-10.332
avorite-10.317
ateg-10.003
aminer-9.931
Tokenology
Token ID1435
Feature activation-1.03e+00

Pos logit contributions
TRY+3.190
hop+3.138
tub+2.908
idious+2.865
SER+2.856
Neg logit contributions
 Signs-6.010
avorite-5.915
ecause-5.776
 Levels-5.518
ateg-5.495
Token 62
Token ID8190
Feature activation-1.11e+00

Pos logit contributions
TRY+4.756
hop+4.574
SER+4.401
tub+4.359
EST+4.353
Neg logit contributions
ecause-5.775
 Signs-5.724
avorite-5.498
ateg-5.416
estones-5.313
Token (
Token ID357
Feature activation-6.55e-01

Pos logit contributions
TRY+4.711
hop+4.430
SER+4.374
EST+4.365
tub+4.267
Neg logit contributions
ecause-6.579
 Signs-6.516
avorite-6.202
ateg-6.131
estones-5.966
Token5
Token ID20
Feature activation-4.23e-01

Pos logit contributions
TRY+2.612
SER+2.412
hop+2.402
EST+2.344
tub+2.336
Neg logit contributions
 Signs-4.308
ecause-4.168
ateg-4.016
avorite-4.009
 Levels-3.901