From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from mail-qt0-f172.google.com (mail-qt0-f172.google.com [209.85.216.172]) by fantadrom.bsd.lv (OpenSMTPD) with ESMTP id 100f5fa4 for ; Fri, 25 Aug 2017 04:53:01 -0500 (EST) Received: by mail-qt0-f172.google.com with SMTP id v29so8791371qtv.3 for ; Fri, 25 Aug 2017 02:53:01 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:from:date:message-id:subject:to; bh=4iMRGZYQVpCQiCzA98ZMjCBM5/szSQ+E+4McL0VdiMc=; b=vRl75DfmVsMx4BjpYOIHT46OUvRfqGIttzOBmq7yVrIMwKpkf1unGfFROsr8kNno7t 9R2Iz4UUqg7nTKWterDgxqsCqwRl5+4/KOHMcaB0ZKJnQNdL9Qy9cEPmZ7Ex6wATd/Gy 4YsUP9H8XBBnPXXpH7hyKTcluVa3dy1aYUhQqPZQmPMMxSdgIfjfYOv/ejrEaTyEflNI ypZrbClXYvEicUx7eE/AWk0owqaCpmsymMTmcXY8TuPc9V+OyOMdHcJ3GSw508q6kQhO xfGy04BP5zv82ZGzbBA3EPQl7ouYz0EtsNBRzZyGkgkrgjBsqhEyfqit4NrJQoQwfEfy kYxw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:from:date:message-id:subject:to; bh=4iMRGZYQVpCQiCzA98ZMjCBM5/szSQ+E+4McL0VdiMc=; b=m3pEWFG8hPuipo6p+5abE9Vu3LhCMXvu6IilJypFZwPiqGSxVPB4rh+ZLS9h+I15q1 0x2hw2o19p0mOH3HztfSBgYkyWks+bQSdOk2sbGJNhdBTKTcX2SIbDXXrR6migCokNTy veCN3Fh9ciW0p4eFRTfbwl1yi+sI5qyWzJfLIquSe13MF1SLtMzABvZvV6eEKj08TVDJ bYu8xHa8g9/9KPbAGp4lUXccEtQ7PKLnMhENfyaBfwdpxmoICMwZn40e+L21OdvxL/p6 I/He7b6v8u2KeR2+oyiody6epsWFNhcsD3a3XB56NqLu1ovflDgJa2xZP3j+BcGAV5Lu l9Tg== X-Gm-Message-State: AHYfb5gbtSUCAyXqrNy+HiTUOa94op36jScTxpqCIjKf1D95zkkkZ2F1 hXYDEu4B7+L74FRkg5VekG+fFHHCNNJf X-Received: by 10.200.47.207 with SMTP id m15mr9700458qta.15.1503654778827; Fri, 25 Aug 2017 02:52:58 -0700 (PDT) X-Mailinglist: mandoc-discuss Reply-To: discuss@mandoc.bsd.lv MIME-Version: 1.0 Received: by 10.140.81.199 with HTTP; Fri, 25 Aug 2017 02:52:58 -0700 (PDT) From: Jackson Pauls Date: Fri, 25 Aug 2017 10:52:58 +0100 Message-ID: Subject: mandoc HTML output: minor issues To: discuss@mandoc.bsd.lv Content-Type: text/plain; charset="UTF-8" Hi, I use mandoc 1.14.3 to generate HTML from mdoc sources. I've noticed a few issues with the HTML output, in no particular order: 1. Duplicate IDs (invalid HTML). E.g. dash.1 gets two elements with id="HISTSIZE": ... 1138
HISTSIZE
... 1921
HISTSIZE
... (1138 and 1921 are the line numbers as per my version.) 2. Broken in-page links. E.g. in dash.1, there is an

with id="White_Space_Splitting_(Field_Splitting)", but the link to that heading is missing the bit in parentheses (href="#White_Space_Splitting"): ... 867

White 868 Space Splitting (Field Splitting)

... 1915 See the White Space 1916 Splitting section for more details. ... Perhaps this is an issue with the mdoc source, but maybe links shouldn't be created in this case: ... 1014 .Ss White Space Splitting (Field Splitting) ... 2300 .Sx White Space Splitting ... 3. In babel.1, for the -: flag, the colon appears outside the .Fl element: ... 63 -: ... I expected: -: mdoc source line: ... 21 .Op Fl i Ar input-type | Fl : Qo Ar SMILES-string Qc ... 4. Running v.Nu (https://validator.github.io/validator/) on a collection of HTML files generated by mandoc picks up duplicate IDs, and a bunch of other issues: * unescaped characters in href attributes (%, "), * unescaped characters in URL fragments (\, {, }, #, ^, [, ], <, |), *
s appearing inside
s, and
* mismatches between column count in  and table rows.
I can gather a bunch of examples if of interest.


5. Finally, I see mandoc adds today's date to the footer if it can't
parse one from the source file. I think this can be misleading, making
it appear a man page has been updated more recently than it actually
has. It would be nice to have an option to disable this behaviour, and
output an empty string or "UNDATED" instead.


mandoc is a pleasure to use btw, I hope the above is useful.


Cheers, Jackson

www.mankier.com
--
 To unsubscribe send an email to discuss+unsubscribe@mandoc.bsd.lv