From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/3201 Path: news.gmane.org!not-for-mail From: Rich Felker Newsgroups: gmane.linux.lib.musl.general Subject: Re: High-priority library replacements? Date: Thu, 25 Apr 2013 08:18:01 -0400 Message-ID: <20130425121801.GL20323@brightrain.aerifal.cx> References: <20130425041553.GA13951@brightrain.aerifal.cx> <20130425055516.GA82796@intma.in> <5178DC8B.8010705@gmail.com> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: ger.gmane.org 1366892295 22014 80.91.229.3 (25 Apr 2013 12:18:15 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 25 Apr 2013 12:18:15 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-3205-gllmg-musl=m.gmane.org@lists.openwall.com Thu Apr 25 14:18:20 2013 Return-path: Envelope-to: gllmg-musl@plane.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by plane.gmane.org with smtp (Exim 4.69) (envelope-from ) id 1UVL7v-0005aT-KF for gllmg-musl@plane.gmane.org; Thu, 25 Apr 2013 14:18:15 +0200 Original-Received: (qmail 7680 invoked by uid 550); 25 Apr 2013 12:18:14 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: Original-Received: (qmail 7672 invoked from network); 25 Apr 2013 12:18:14 -0000 Content-Disposition: inline In-Reply-To: <5178DC8B.8010705@gmail.com> User-Agent: Mutt/1.5.21 (2010-09-15) Xref: news.gmane.org gmane.linux.lib.musl.general:3201 Archived-At: On Thu, Apr 25, 2013 at 09:34:35AM +0200, Jens Staal wrote: > On 2013-04-25 07:55, Kurt H Maier wrote: > >On Thu, Apr 25, 2013 at 12:15:53AM -0400, Rich Felker wrote: > >> > >>1. A light, C, UTF-8-only Unicode library. > > > >The only one I know of that might not already be common knowledge is > >Connor Lane Smith's libutf: https://github.com/cls/libutf > > Does it work as a drop-in replacement for unixy utf/wchar > implementations This is exactly what libutf does. It's not a Unicode library in the form I described; it's just a workaround for broken systems lacking UTF-8 locales, for which it replaces all the important functions (mb*towc and wc*tomb, etc.) with versions that convert UTF-8. I believe it also replaces the is* and to* functions and maybe a few others. But it doesn't help with any Unicode processing needs the application might have outside the scope of C or POSIX, and it's not meant to be used on systems that already have proper UTF-8 support at the system library level. Rich