From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/10981 Path: news.gmane.org!.POSTED!not-for-mail From: He X Newsgroups: gmane.linux.lib.musl.general Subject: Re: Re: a bug in bindtextdomain() and strip '.UTF-8' Date: Mon, 30 Jan 2017 00:14:49 +0800 Message-ID: References: <20170129133946.GT17692@port70.net> <20170129140747.GJ1533@brightrain.aerifal.cx> <20170129155507.GK1533@brightrain.aerifal.cx> Reply-To: musl@lists.openwall.com NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 Content-Type: multipart/alternative; boundary=94eb2c14c268c335db05473e00c8 X-Trace: blaine.gmane.org 1485706531 2995 195.159.176.226 (29 Jan 2017 16:15:31 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Sun, 29 Jan 2017 16:15:31 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-10996-gllmg-musl=m.gmane.org@lists.openwall.com Sun Jan 29 17:15:24 2017 Return-path: Envelope-to: gllmg-musl@m.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by blaine.gmane.org with smtp (Exim 4.84_2) (envelope-from ) id 1cXs8N-0008Ef-D2 for gllmg-musl@m.gmane.org; Sun, 29 Jan 2017 17:15:20 +0100 Original-Received: (qmail 28114 invoked by uid 550); 29 Jan 2017 16:15:22 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-ID: Original-Received: (qmail 28090 invoked from network); 29 Jan 2017 16:15:21 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:in-reply-to:references:from:date:message-id:subject:to; bh=rJf/2zk3gG66pMUloD2bZWlPIJzDISI24iuGHeQmgu4=; b=gO4TMolvNdqI4AMVTfKY8iIzEjBJ/v2l0zJSsFouCvhSgIVnTE56g1c86TWQ97xNMW m9IIRoVNh+mPVn6Jnk8BCoR/c3ki+SexVsXV9E+LItk+VSe2D1Jd+V+BTfiS/cb3E+c6 w0tkvTr2gFhL7F8YjX2+k/WZayftHVgiSYNqQb/MkKYDSDe2slqbvBt60q1IiOl/NhWN Ijl/faY978KaEzvf04LBPto0D9ESi7cJr/ADaKR3xcygkR1fTtkG6VCnFOuu76dycuq3 IRjXLH66fqLXSrfsvEpaqKx4DVuLGoXfsA8ESGtOseH201y0x19BDM2OgswQsJcynW4h r/uA== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:in-reply-to:references:from:date :message-id:subject:to; bh=rJf/2zk3gG66pMUloD2bZWlPIJzDISI24iuGHeQmgu4=; b=oYgbwF1EGST1jJekriDuyGQapKjN+RuDF5j9AdTwUN5xBOsLF5ZmWHAadiBwX/x7tB u/z1W1xTXumyxJAFO62RjXGkMrzRlMW8v/RaDV6h558KF1og4uyrC30ctoWvfgpGhGt8 ONlFLIGdXOLgTaZ1/PLdlqFEwdvesDXQvtGy87nHr8iApf9skwTeiSV22vNiy0KJSVF9 WGdpuG9Bi+brdo7EesBfLjU6SWmjWwqpbVHB7bEFM28mY34R9BXtDxphLcfLNWsNbph4 BIOPROfXEUbt07fGlEz9vdRJvI+BeIe5wDaXmcr2IiTFFHyC0D1qQgH8r12DJnuka32q asXw== X-Gm-Message-State: AIkVDXJAgO7ZTZqV7AzFEhygApAfV5tYCLg6hzOR0rYVw8nVweUjgN+BSrK0bAJc55es/F2UF4NF5tFSh2FMmA== X-Received: by 10.31.242.11 with SMTP id q11mr8815138vkh.54.1485706509888; Sun, 29 Jan 2017 08:15:09 -0800 (PST) In-Reply-To: <20170129155507.GK1533@brightrain.aerifal.cx> Xref: news.gmane.org gmane.linux.lib.musl.general:10981 Archived-At: --94eb2c14c268c335db05473e00c8 Content-Type: text/plain; charset=UTF-8 I can't wait, can i work on it and make a patch for these issues if Masanori Ogino is busy now? I'd like to see that these issues could be solved in official musl repo as soon as possible. And maybe rejection for NON-UTF-8, since 'LANG=zh_CN.GBK ./a.out( setlocale(LC_*, "") )' showed me a segfault with glibc. Wang He 2017-01-29 23:55 GMT+08:00 Rich Felker : > On Sun, Jan 29, 2017 at 10:48:34PM +0800, He X wrote: > > 1. agreed with rich, nplurals is important too; compiling the kernel, > > cannot update the patch > > 2. no other ways, musl will use generic config 100%, and then the > > exception, the run time error is hardcoded there; but i doubt if this > > really breaks binaries, the function is only called by libstdc++ itself. > > you cant only update the config, but does not update libstdc++. libstdc++ > > exported the same abi for common binaries, wont break most dynamic-loaded > > binary in my view. > > > > btw, with 'p-> to q->', 'strip .UTF-8'(these two in the first thread), > and > > these two patches, fcitx, chromium are working well. > > > > but there're some names like 'de_DE@euro', 'zh_CN.GBK', these should be > > stripped, either, any good ideas? > > This has all been discussed before; see this email and others in the > thread: > > http://www.openwall.com/lists/musl/2016/05/11/8 > > Masanori Ogino was going to work on some follow-up research, testing, > and/or implementation but didn't get around to it. I'm not aware of > any newer findings that contradict the direction suggested in that > thread. > > For your specific examples, de_DE@euro would be searched in de_DE, > de@euro, and finally de; zh_CN.GBK would be invalid (non-UTF-8 > encodings not permitted) but it's not clear to me how it should be > handled (rejection or rewriting at setlocale time, stripping .GBK at > translation load time, or leaving .GBK there and letting translation > fail). > > Rich > --94eb2c14c268c335db05473e00c8 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: quoted-printable
I can't wait, can i work on it and make a patch for th= ese issues if=C2=A0Masanori Ogino is busy no= w? I'd like to see that these issues could be solved in official= musl repo as soon as possible.
And maybe rejection for NON-UTF-8, sinc= e 'LANG=3Dzh_CN.GBK ./a.out( setlocale(LC_*, "") )' showe= d me a segfault with glibc.

Wang He

2017-01-29 23:55 GMT= +08:00 Rich Felker <dalias@libc.org>:
On Sun, Jan 29, 2017 at 10:48:34PM +0800, He X w= rote:
> 1. agreed with rich, nplurals is important too; compiling the kernel,<= br> > cannot update the patch
> 2. no other ways, musl will use generic config 100%, and then the
> exception, the run time error is hardcoded there; but i doubt if this<= br> > really breaks binaries, the function is only called by libstdc++ itsel= f.
> you cant only update the config, but does not update libstdc++. libstd= c++
> exported the same abi for common binaries, wont break most dynamic-loa= ded
> binary in my view.
>
> btw, with 'p-> to q->', 'strip .UTF-8'(these two= in the first thread),=C2=A0 and
> these two patches, fcitx, chromium are working well.
>
> but there're some names like 'de_DE@euro', 'zh_CN.GBK&= #39;, these should be
> stripped, either, any good ideas?

This has all been discussed before; see this email and others in the=
thread:

http://www.openwall.com/lists/musl/2016/05/11/8<= /a>

Masanori Ogino was going to work on some follow-up research, testing,
and/or implementation but didn't get around to it. I'm not aware of=
any newer findings that contradict the direction suggested in that
thread.

For your specific examples, de_DE@euro would be searched in de_DE,
de@euro, and finally de; zh_CN.GBK would be invalid (non-UTF-8
encodings not permitted) but it's not clear to me how it should be
handled (rejection or rewriting at setlocale time, stripping .GBK at
translation load time, or leaving .GBK there and letting translation
fail).

Rich

--94eb2c14c268c335db05473e00c8--