From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/3855 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: Re: Status of Big5 and extensions Date: Thu, 8 Aug 2013 01:31:07 -0400 Message-ID: <20130808053107.GL221@brightrain.aerifal.cx> References: <20130807165044.GA14867@brightrain.aerifal.cx> <20130808021118.GI221@brightrain.aerifal.cx> <20130808035321.GN25714@port70.net> <20130808043035.GK221@brightrain.aerifal.cx> <20130808045057.GO25714@port70.net> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1375939879 22347 80.91.229.3 (8 Aug 2013 05:31:19 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 8 Aug 2013 05:31:19 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-3859-gllmg-musl=m.gmane.org@lists.openwall.com Thu Aug 08 07:31:21 2013 Return-path: Envelope-to: gllmg-musl@plane.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1V7Ioi-0001th-T2 for gllmg-musl@plane.gmane.org; Thu, 08 Aug 2013 07:31:21 +0200 Original-Received: (qmail 32497 invoked by uid 550); 8 Aug 2013 05:31:20 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 32489 invoked from network); 8 Aug 2013 05:31:20 -0000 Content-Disposition: inline In-Reply-To: <20130808045057.GO25714@port70.net> User-Agent: Mutt/1.5.21 (2010-09-15) Xref: news.gmane.org gmane.linux.lib.musl.general:3855 Archived-At: On Thu, Aug 08, 2013 at 06:50:57AM +0200, Szabolcs Nagy wrote: > * Rich Felker [2013-08-08 00:30:35 -0400]: > > On Thu, Aug 08, 2013 at 05:53:21AM +0200, Szabolcs Nagy wrote: > > > note that > > > C87A, C87C, C8A4 are mapped to 2xxxx in hkscs > > > (old hkscs pua codes agree with uao) > > > > OK, so is this non-conflicting? > > > > i just wanted to note that these are the only codes > in the 89x157 table that map to non-16bit unicode > codepoints > > there is no conflict OK, great. I think rather than using any fancy tables for the 89x157 grid and HKSCS, we can just special-case these two in the code. It also looks to me like HKSCS and UAO are essentially non-conflicting in the 89x157 grid once you remove PUA junk; the only real conflict is the half-width kana UAO mapped over part of HKSCS. So it may make sense to just add all the extended mappings in this range except the kana to the base Big5 table. This would definitely simplify HKSCS support. If we later want UAO support, this range could just be special-cased algorithmically since it seems to be direct range mappings to Unicode. Rich