From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp1.rz.uni-karlsruhe.de (Debian-exim@smtp1.rz.uni-karlsruhe.de [129.13.185.217]) by krisdoz.my.domain (8.14.3/8.14.3) with ESMTP id o67KDGN6000826 for ; Wed, 7 Jul 2010 16:13:17 -0400 (EDT) Received: from hekate.usta.de (asta-nat.asta.uni-karlsruhe.de [172.22.63.82]) by smtp1.rz.uni-karlsruhe.de with esmtp (Exim 4.63 #1) id 1OWazb-0003cp-3Q; Wed, 07 Jul 2010 22:13:15 +0200 Received: from donnerwolke.usta.de ([172.24.96.3]) by hekate.usta.de with esmtp (Exim 4.71) (envelope-from ) id 1OWazb-0005Yn-25 for discuss@mdocml.bsd.lv; Wed, 07 Jul 2010 22:13:15 +0200 Received: from iris.usta.de ([172.24.96.5] helo=usta.de) by donnerwolke.usta.de with esmtp (Exim 4.69) (envelope-from ) id 1OWazb-0004EH-16 for discuss@mdocml.bsd.lv; Wed, 07 Jul 2010 22:13:15 +0200 Received: from schwarze by usta.de with local (Exim 4.71) (envelope-from ) id 1OWaza-0002Xq-O4 for discuss@mdocml.bsd.lv; Wed, 07 Jul 2010 22:13:14 +0200 Date: Wed, 7 Jul 2010 22:13:14 +0200 From: Ingo Schwarze To: discuss@mdocml.bsd.lv Subject: Re: Raw UTF-8? Message-ID: <20100707201314.GB19725@iris.usta.de> References: <4c33f0f0.0c87970a.3458.fffff43f@mx.google.com> <4C3449D5.7020106@bsd.lv> <4c349192.071f960a.7247.2ccd@mx.google.com> X-Mailinglist: mdocml-discuss Reply-To: discuss@mdocml.bsd.lv MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <4c349192.071f960a.7247.2ccd@mx.google.com> User-Agent: Mutt/1.5.20 (2009-06-14) Hi Anthony, Anthony J. Bentley wrote on Wed, Jul 07, 2010 at 08:39:09AM -0600: > mandoc_char(7) isn't much help either, as it's several pages long and > doesn't display the actual character in the terminal, See the problem? In the manual you write, \(/o won't be displayed either, no matter how you try to input it. The output is the problem here, not the input. I think we should reorganize mandoc_char(7) in the following way. 1) First the sane escape sequences that render well everywhere and serve a real purpose. Examples: \~ \ \& \(ba \(em \(en \(hy \e \- The number of sane sequences is relatively small, which will also solve the problem you are pointing out above: mandoc_char(7) is unreasonably long. A typesetting system like TeX needs long character tables, and it needs the ability to print obscure characters. A manual page does not. 2) Then a sentence explaining that what follows is rarely needed, because mandoc(1) is not really intended for general purpose typesetting, and much less typesetting of mathematical formulas, but just for writing manuals, encouraging people to express their intention using text, not symbols. Still, all escape sequences in this section are guaranteed to render well and may be useful in uncommon situations. Examples: \(co \(rg \(-> \(rA \(+- \(<= \*(Pi 3) Then a sentence explaining that what follows is rarely needed, because mdoc(7) has alternative concepts handling the typical use case better. Note that writing new man(7) code is discouraged anyway. Still, all escape sequences in this section are guaranteed to render well and may be useful in very uncommon situations. Examples: \0 -- use .Dl or .Bd -literal \bu -- use .Bl -bullet \lq -- use .Qq or .Qo \lB -- use .Bq or .Bc 4) Then a sentence mildly discouraging the use of what follows, listing obsolete sequences that render well, but are not needed at all because equivalent recommended escape sequences exist. Examples: \^ \% \| \(at \(mi \(eq \*(Ba 5) Then a sentence strongly discouraging any use of the rest, listing those escape sequences that render badly. Examples: \(r! \(sr \(ss \('e \(`u \(~n \(:a \(^o \(,c \(oa \(eu Obviously, this is by far the largest group. Yours, Ingo -- To unsubscribe send an email to discuss+unsubscribe@mdocml.bsd.lv