From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp-2.sys.kth.se (smtp-2.sys.kth.se [130.237.32.160]) by krisdoz.my.domain (8.14.3/8.14.3) with ESMTP id p4I9s4rZ026760 for ; Wed, 18 May 2011 05:54:04 -0400 (EDT) Received: from mailscan-1.sys.kth.se (mailscan-1.sys.kth.se [130.237.32.91]) by smtp-2.sys.kth.se (Postfix) with ESMTP id AE79314DEEE for ; Wed, 18 May 2011 11:53:58 +0200 (CEST) X-Virus-Scanned: by amavisd-new at kth.se Received: from smtp-2.sys.kth.se ([130.237.32.160]) by mailscan-1.sys.kth.se (mailscan-1.sys.kth.se [130.237.32.91]) (amavisd-new, port 10024) with LMTP id gzCJP0GWTlom for ; Wed, 18 May 2011 11:53:56 +0200 (CEST) X-KTH-Auth: kristaps [213.103.216.43] X-KTH-mail-from: kristaps@bsd.lv X-KTH-rcpt-to: discuss@mdocml.bsd.lv Received: from macky.local (s213-103-216-43.cust.tele2.se [213.103.216.43]) by smtp-2.sys.kth.se (Postfix) with ESMTP id A862D14DCB1 for ; Wed, 18 May 2011 11:53:56 +0200 (CEST) Message-ID: <4DD39734.70004@bsd.lv> Date: Wed, 18 May 2011 11:53:56 +0200 From: Kristaps Dzonsons User-Agent: Mozilla/5.0 (Macintosh; U; Intel Mac OS X 10.6; en-US; rv:1.9.2.17) Gecko/20110414 Thunderbird/3.1.10 X-Mailinglist: mdocml-discuss Reply-To: discuss@mdocml.bsd.lv MIME-Version: 1.0 To: discuss@mdocml.bsd.lv Subject: Re: Full mandoc locale support committed. References: <4DD2FFB0.2070303@bsd.lv> <20110518062137.GD1321@procyon.xvoid.org> In-Reply-To: <20110518062137.GD1321@procyon.xvoid.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit >> With this last commit, initial [full] locale support has been fitted >> into mandoc! Attached is eye-candy: a manual full of random Unicode >> input (\[uNNNN]) first with -Tascii, then with -Tlocale. > > Unicode escapes work for me, but does that mean that existing localized > manpages can't be used? Tried several ja and ru from debian 6, all of > them make mandoc die with "FATAL: line scope broken, syntax violated" > (-Tlint talks a lot about "ERROR: skipping bad character: ignoring > byte"). Yuri, Groff doesn't actually accept multi-byte input: it translates with preconv from the localised form to the Unicode escapes. This is hidden away with the `-k' groff option. See: http://manpages.ubuntu.com/manpages/lucid/man1/groff.1.html http://manpages.ubuntu.com/manpages/lucid/man1/preconv.1.html (Why doesn't groff have its own manuals online instead of just the texinfo? I bet it's cause grohtml doesn't look so good... ;)) We'll worry about doing preconv-style translation later. For this release, the push is for the Unicode support itself. >> The check-ins: >> >> (1) http://mdocml.bsd.lv/archives/source/0920.html > > A small typo here in Makefile's comment - USE_CHAR. Fixed---thanks. > Checked on Solaris 11 and Illumos - both do not export the symbol, > though commenting out #undef USE_WCHAR in term_ascii.c makes -Tlocale > work. Great news! (The Internet agrees, as there are bug reports regarding Solaris and 10646.) I'm still trying to figure out a good way to let the lack of this symbol be ignored when it's supported anyway (as in Solaris' case). For the release itself, I'll probably let USE_WCHAR be authoritative and set or unset by downstream. Thanks again, Kristaps -- To unsubscribe send an email to discuss+unsubscribe@mdocml.bsd.lv