From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/7742 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: Revisiting byte-based C locale Date: Fri, 22 May 2015 00:09:59 -0400 Message-ID: <20150522040959.GJ17573@brightrain.aerifal.cx> References: <20150522022203.GA26651@brightrain.aerifal.cx> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1432267815 7469 80.91.229.3 (22 May 2015 04:10:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 22 May 2015 04:10:15 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-7754-gllmg-musl=m.gmane.org@lists.openwall.com Fri May 22 06:10:15 2015 Return-path: Envelope-to: gllmg-musl@m.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1YveHm-0004e4-S4 for gllmg-musl@m.gmane.org; Fri, 22 May 2015 06:10:15 +0200 Original-Received: (qmail 21882 invoked by uid 550); 22 May 2015 04:10:13 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 21847 invoked from network); 22 May 2015 04:10:12 -0000 Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Original-Sender: Rich Felker Xref: news.gmane.org gmane.linux.lib.musl.general:7742 Archived-At: On Thu, May 21, 2015 at 11:04:47PM -0500, Josiah Worcester wrote: > Given the POSIX rules on LC_CTYPE character classes effecting > [[:alpha:]], it seems to me now that the clear intent (if not > statement) is in fact for a byte-based C locale. Though maybe > unfortunate, it does seem like as though that is in fact the most > conformant way of doing it, and conforming looks to have little cost > now. Not necessarily. There's no rule against the existence of additional characters in the C locale -- in fact, the proposal to make the C locale "8-bit-clean" requires an additional 128 characters -- but the additional ones can't be in classes like alpha. Rich