From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/3856 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: Re: Status of Big5 and extensions Date: Thu, 8 Aug 2013 03:19:45 -0400 Message-ID: <20130808071945.GO221@brightrain.aerifal.cx> References: <20130807165044.GA14867@brightrain.aerifal.cx> <20130808021118.GI221@brightrain.aerifal.cx> <20130808035321.GN25714@port70.net> <20130808043035.GK221@brightrain.aerifal.cx> <20130808045057.GO25714@port70.net> <20130808053107.GL221@brightrain.aerifal.cx> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1375946396 20379 80.91.229.3 (8 Aug 2013 07:19:56 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 8 Aug 2013 07:19:56 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-3860-gllmg-musl=m.gmane.org@lists.openwall.com Thu Aug 08 09:20:00 2013 Return-path: Envelope-to: gllmg-musl@plane.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1V7KVr-0003Zn-SB for gllmg-musl@plane.gmane.org; Thu, 08 Aug 2013 09:19:59 +0200 Original-Received: (qmail 1560 invoked by uid 550); 8 Aug 2013 07:19:58 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 1549 invoked from network); 8 Aug 2013 07:19:57 -0000 Content-Disposition: inline In-Reply-To: <20130808053107.GL221@brightrain.aerifal.cx> User-Agent: Mutt/1.5.21 (2010-09-15) Xref: news.gmane.org gmane.linux.lib.musl.general:3856 Archived-At: On Thu, Aug 08, 2013 at 01:31:07AM -0400, Rich Felker wrote: > On Thu, Aug 08, 2013 at 06:50:57AM +0200, Szabolcs Nagy wrote: > > * Rich Felker [2013-08-08 00:30:35 -0400]: > > > On Thu, Aug 08, 2013 at 05:53:21AM +0200, Szabolcs Nagy wrote: > > > > note that > > > > C87A, C87C, C8A4 are mapped to 2xxxx in hkscs > > > > (old hkscs pua codes agree with uao) > > > > > > OK, so is this non-conflicting? > > > > > > > i just wanted to note that these are the only codes > > in the 89x157 table that map to non-16bit unicode > > codepoints > > > > there is no conflict > > OK, great. I think rather than using any fancy tables for the 89x157 > grid and HKSCS, we can just special-case these two in the code. > > It also looks to me like HKSCS and UAO are essentially non-conflicting > in the 89x157 grid once you remove PUA junk; the only real conflict is > the half-width kana UAO mapped over part of HKSCS. So it may make > sense to just add all the extended mappings in this range except the > kana to the base Big5 table. This would definitely simplify HKSCS > support. If we later want UAO support, this range could just be > special-cased algorithmically since it seems to be direct range > mappings to Unicode. It seems this sort of unification has been attempted already, and it looks like it largely succeeded. See: http://lists.whatwg.org/htdig.cgi/whatwg-whatwg.org/2012-April/035330.html Rich