From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/3836 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: Re: iconv Korean and Traditional Chinese research so far Date: Mon, 5 Aug 2013 15:12:47 -0400 Message-ID: <20130805191246.GM221@brightrain.aerifal.cx> References: <20130804165152.GA32076@brightrain.aerifal.cx> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1375729977 21164 80.91.229.3 (5 Aug 2013 19:12:57 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 5 Aug 2013 19:12:57 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-3840-gllmg-musl=m.gmane.org@lists.openwall.com Mon Aug 05 21:13:01 2013 Return-path: Envelope-to: gllmg-musl@plane.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1V6QDD-0007k8-VY for gllmg-musl@plane.gmane.org; Mon, 05 Aug 2013 21:13:00 +0200 Original-Received: (qmail 24031 invoked by uid 550); 5 Aug 2013 19:12:59 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 24023 invoked from network); 5 Aug 2013 19:12:59 -0000 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Xref: news.gmane.org gmane.linux.lib.musl.general:3836 Archived-At: On Mon, Aug 05, 2013 at 04:28:32PM +0800, Roy wrote: > Since I'm a Traditional Chinese and Japanese legacy encoding user, I > think I can say something here. > [...] > There is another Big5 extension called Big5-UAO, which is being used > in world's largest telnet-based BBS called "ptt.cc". > > It has two tables, one for Big5-UAO to Unicode, another one is > Unicode to Big5-UAO. > http://moztw.org/docs/big5/table/uao250-b2u.txt > http://moztw.org/docs/big5/table/uao250-u2b.txt > > Which extends DBCS lead byte to 0x81. OK, I've been trying to do some research on this and I turned up: http://lists.w3.org/Archives/Public/public-html-ig-zh/2012Apr/0061.html http://lists.gnu.org/archive/html/bug-gnu-libiconv/2010-11/msg00007.html My impression (please correct me if I'm wrong) is that you can't use Big5-UAO as the system encoding on modern versions of Windows (just ancient ones where you install unmaintained third-party software that hacks the system charset tables) and that it's not supported in GNU libiconv. If this is the case, and especially if Big5-UAO's main use is on a telnet-based BBS where everybody is using special telnet clients that have their own Big5-UAO converters, I'd find it really hard to justify trying to support this. But I'm open to hearing arguments on why we should, if you believe it's important. Rich