From mboxrd@z Thu Jan 1 00:00:00 1970
Received: from mailout.scc.kit.edu (mailout.scc.kit.edu [129.13.185.202])
by krisdoz.my.domain (8.14.5/8.14.5) with ESMTP id s7ENosSl023226
for ; Thu, 14 Aug 2014 19:50:55 -0400 (EDT)
Received: from hekate.usta.de (asta-nat.asta.uni-karlsruhe.de [172.22.63.82])
by scc-mailout-02.scc.kit.edu with esmtp (Exim 4.72 #1)
id 1XI4nC-0000jl-5J; Fri, 15 Aug 2014 01:50:50 +0200
Received: from donnerwolke.usta.de ([172.24.96.3])
by hekate.usta.de with esmtp (Exim 4.77)
(envelope-from )
id 1XI4nC-00082Z-2T; Fri, 15 Aug 2014 01:50:50 +0200
Received: from iris.usta.de ([172.24.96.5] helo=usta.de)
by donnerwolke.usta.de with esmtp (Exim 4.72)
(envelope-from )
id 1XI4nC-0005J5-0g; Fri, 15 Aug 2014 01:50:50 +0200
Received: from schwarze by usta.de with local (Exim 4.77)
(envelope-from )
id 1XI4mR-0000Yz-Bn; Fri, 15 Aug 2014 01:50:03 +0200
Date: Fri, 15 Aug 2014 01:50:02 +0200
From: Ingo Schwarze
To: Kristaps Dzonsons
Cc: tech@mdocml.bsd.lv
Subject: Re: HTML5 redux
Message-ID: <20140814235002.GM29858@iris.usta.de>
References: <53EBE549.2000100@bsd.lv>
<20140814012540.GI26534@iris.usta.de>
<53ED1B85.6070202@bsd.lv>
X-Mailinglist: mdocml-tech
Reply-To: tech@mdocml.bsd.lv
MIME-Version: 1.0
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
In-Reply-To: <53ED1B85.6070202@bsd.lv>
User-Agent: Mutt/1.5.21 (2010-09-15)
Hi Kristaps,
Kristaps Dzonsons wrote on Thu, Aug 14, 2014 at 10:26:45PM +0200:
> Ingo Schwarze wrote:
>> Kristaps Dzonsons wrote:
>>> With this patch, both -Thtml and -Txhtml will produce HTML5. This
>>> can be disabled in html.c by replacing the HTML_HTML5 allocation
>>> with the respective type. The resulting HTML5 validates just fine,
>>> as do the existing modes (if replaced in html.c).
>> Not for me, there is one regression.
>> But if you remove the chunk cited below, the regression goes
>> away, so i'd say, go ahead - in particular if you plan to do the
>> cleanup right afterwards.
> This is a bit trickier than appears.
I fear i don't follow where the problem is.
What i'm saying is that
- output is valid when you leave the HTML_NOSTACK in place
-
output is (often, not always) invalid when you delete it
> Basically,
doesn't exist in HTML5 because isn't a void
> element.
Sure. However, i don't see how that is relevant. For HTML5,
html.c never generates
no matter what we do with HTML_NOSTACK.
Two remarks regarding HTML_AUTOCLOSE:
1) It is obsolete. It is set if and only if HTML_NOSTACK is set,
so one flag would suffice.
2) After the full switch to HTML5, it is obsolete for a second
reason: it is never used for anything in HTML5 mode.
Already now, the only place where it is used for syntactically
significant output, in print_otag(), is only executed when
HTML_XHTML_1_0_STRICT is set.
> However, we can't just remove the auto-close parts of
> because then we might have
>
>
> Hello
>
world
>
>
> Which is invalid.
Exactly what is said.
But wy don't we just leave the HTML_NOSTACK as it is now?
> http://www.w3.org/TR/html-markup/p.html#p
>
> In short, mdoc(7) and man(7) can have all sorts of nested block
> scopes, and so can HTML, but can't contain flow elements.
Sure. It just ends implicitly when a flow element begins.
> I think it's best, and cleanest, to just avoid
and use
,
> say as
.
I don't understand which problem you think that might solve.
But it would make producing well-formed HTML5 output harder,
not easier. Because, if you make .Pp a
, how do you know
when to close it out? It don't say you cannot come up with a
working algorithm, you certainly can. But you just make it
harder than if you just open it and never ever close it again.
> I understand we lose whatever semantic-ness may be acquired from
>
, but I don't think it's much of a loss
Actually, it is no loss at all, since .Pp is *not* implemented
as a block but as an in_line element. It doesn't mark up any
content in any way, it just says "put a paragraph break here",
exactly like .br says "put a line break here".
> given that, with a classed
, we'll have *real* semantics
> (the flow elements will be within the paragraph).
I have no idea what the advantage would be...
> Naturally, this would work happily with HTML4 just as it
> would with HTML5.
So does what we have now as far as i can see.
> I'll work on a patch and will submit it soon...
Actually, there is one problem with
, or rather, with
.
The .Bd block is marked up as , which can only contain
phrasing, but .Bd can contain all sorts of blocks and .Pp.
But that's unrelated to the HTML5 switch.
Yours,
Ingo
--
To unsubscribe send an email to tech+unsubscribe@mdocml.bsd.lv