From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/7874 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: Revisiting byte-based C locale Date: Thu, 4 Jun 2015 21:39:11 -0400 Message-ID: <20150605013911.GT17573@brightrain.aerifal.cx> References: <20150522022203.GA26651@brightrain.aerifal.cx> <20150604205332.GS17573@brightrain.aerifal.cx> <87eglrchph.fsf@gmail.com> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1433468370 16376 80.91.229.3 (5 Jun 2015 01:39:30 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Fri, 5 Jun 2015 01:39:30 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-7887-gllmg-musl=m.gmane.org@lists.openwall.com Fri Jun 05 03:39:30 2015 Return-path: Envelope-to: gllmg-musl@m.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1Z0gbW-0001Qj-Un for gllmg-musl@m.gmane.org; Fri, 05 Jun 2015 03:39:27 +0200 Original-Received: (qmail 30431 invoked by uid 550); 5 Jun 2015 01:39:25 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 30404 invoked from network); 5 Jun 2015 01:39:24 -0000 Content-Disposition: inline In-Reply-To: <87eglrchph.fsf@gmail.com> User-Agent: Mutt/1.5.21 (2010-09-15) Original-Sender: Rich Felker Xref: news.gmane.org gmane.linux.lib.musl.general:7874 Archived-At: On Thu, Jun 04, 2015 at 11:00:10PM +0200, Christian Neukirchen wrote: > Rich Felker writes: > > > On Thu, May 21, 2015 at 10:22:03PM -0400, Rich Felker wrote: > >> Any new opinions on the topic? Or interest in re-emphasizing a > >> previously stated opinion? :) > > > > No new opinions on this? I've tentatively added drafting a new > > proposed byte-based C locale patch as a roadmap item for this release > > cycle, not necessarily to commit it, but as a way to re-evaluate > > whether it's still costly to implement. > > Will it support regexec on 8-bit binary data? Yes, as long as the program has done one of the following: - Not called setlocale at all. - Called setlocale with an explicit "C" argument or in environment. - Called uselocale with a locale_t for "C". > We found out file(1) > needs this. Indeed, aside from the Austin Group issue 663, having this topic come up several times in real-world usage is the motivation for reconsidering it. I believe file(1) _attempts_ to do this right, making use of uselocale. Rich