From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/14809 Path: news.gmane.org!.POSTED.blaine.gmane.org!not-for-mail From: Eleftherios Kritikos Newsgroups: gmane.linux.lib.musl.general Subject: Re: [PATCH] Update ctype data to Unicode 12.1.0 Date: Mon, 14 Oct 2019 14:51:08 +0100 Message-ID: References: <20191012212742.29880-1-el01049@gmail.com> <20191012223947.GH16318@brightrain.aerifal.cx> <20191014130709.GL16318@brightrain.aerifal.cx> Reply-To: musl@lists.openwall.com Mime-Version: 1.0 Content-Type: multipart/alternative; boundary="00000000000092aa3a0594df2956" Injection-Info: blaine.gmane.org; posting-host="blaine.gmane.org:195.159.176.226"; logging-data="144250"; mail-complaints-to="usenet@blaine.gmane.org" Cc: musl@lists.openwall.com To: Rich Felker Original-X-From: musl-return-14825-gllmg-musl=m.gmane.org@lists.openwall.com Mon Oct 14 15:51:34 2019 Return-path: Envelope-to: gllmg-musl@m.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by blaine.gmane.org with smtp (Exim 4.89) (envelope-from ) id 1iK0l4-000bRN-Mw for gllmg-musl@m.gmane.org; Mon, 14 Oct 2019 15:51:34 +0200 Original-Received: (qmail 9537 invoked by uid 550); 14 Oct 2019 13:51:31 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-ID: Original-Received: (qmail 9519 invoked from network); 14 Oct 2019 13:51:31 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=Q68/mWZEkoJarbGUyz5ujk6V9wFRQZc/5Twnm72EXRo=; b=ht+eDGnN0qUb8vDtWURIlOZcD8uOz1Ufp7V219H/48vk9om0wmCTHEPtb3MMQpLkia mhVSDIceZ04q9MZ0BXZaWHuGaBiy5QRkY0iiPbyP4MgsycjjNMJihsi8f+Wd2l1E5Lpr Vjpo70O76/Bf1eE2FccEs3qA3B16oWWnJUfxGF3H859aStla2sOs/igbJ3PongWl9+j/ CnJH30ohqjLNUwrqGobkXq7UchN4VNVtCABF/b/fe9qpnO5ldf8wiuv5KpGJln4wyX71 /fJBoNe76uZ5FP1V6/nUgI+2fnkVIA9R353MklqRP+lv95cpIufbE9FFZ49xEVsK45YE UMXQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=Q68/mWZEkoJarbGUyz5ujk6V9wFRQZc/5Twnm72EXRo=; b=Dg9SOh0CI5iMvatXcH8uJxtE5zqU66Ol9MnKHAzsE1/C74Xa8I7Y4LJwfaU4EZ8ZTk RqQzcrJZSfI6pz+vYwr6Zis9/QRpeulnzBqPzKzRgJEK4Qw1Ga3wL89orIvPk18/QIfZ Ry/YObgC9C0vexBgt1r8pAGG1B/FIsAWb0n5gFN5WZ6L/u3gQOtW+gKClt1LcVcvgUQQ vgBkTfXSd9OpM2ogS4te1JhMQRAoVaGhNmdvVrDMROyL+cyhivrpXcLC5Wc9TDcpBykT AWlI5rS7L4LJH/03Z3U3IudPACwadYGgzCIacUkN1FslaPaM2U/w9+pPrRAFY7y/lu8X E/zw== X-Gm-Message-State: APjAAAUOI2OKfOeZa3ZWgYhPMsHjDaQuPeTe2/9xkdVre5zqtvY7iQbR MtQrlbpmSDqragBmxo0siJxFpl7B1l4ZEn5/2LsCGHfcQpKJIA== X-Google-Smtp-Source: APXvYqyhz1uEV/NDQDzDvJQfd6DqRxvnAKz+Fr+oy8hoQ7AcmujPn0pfQLSVYyRyzbLincC0AfETAb19iRmrzKfpBGM= X-Received: by 2002:a2e:3016:: with SMTP id w22mr19120101ljw.117.1571061079647; Mon, 14 Oct 2019 06:51:19 -0700 (PDT) In-Reply-To: <20191014130709.GL16318@brightrain.aerifal.cx> Xref: news.gmane.org gmane.linux.lib.musl.general:14809 Archived-At: --00000000000092aa3a0594df2956 Content-Type: text/plain; charset="UTF-8" >From what I read from here: https://stackoverflow.com/questions/12590255/what-does-stdc-iso-10646-exactly-mean and here: https://standards.iso.org/ittf/PubliclyAvailableStandards/index.html it seems like the latest ISO/IEC 10646 standard that most closely match Unicode 12.1.0 is ISO/IEC 10646:2017/Amd 2:2019 (fifth edition 2017, amendment 2). >From what I read in the document here: https://standards.iso.org/ittf/PubliclyAvailableStandards/c073773_ISO_IEC_10646_2017_Amd_2_2019%20(E).zip on first page, this amendment was made on 2019-06. So I would guess that the correct value should be: ``` #define __STDC_ISO_10646__ 201906L ``` All this with a grain of salt as this is the first time I am looking at ISO/IEC 10646. Thanks for looking into this! Regards, Lefteris On Mon, Oct 14, 2019 at 2:07 PM Rich Felker wrote: > On Sat, Oct 12, 2019 at 11:56:44PM +0100, Eleftherios Kritikos wrote: > > Yes. I also created a merge request for musl-chartable-tools > > https://github.com/richfelker/musl-chartable-tools/pull/2 > > Thanks. stdc-predef.h also needs to be updated with a new value for > __STD_ISO_10646__. Do you know the right yyyymm value it should have > for this version of Unicode? > > Rich > > > > On Sat, 12 Oct 2019, 11:40 pm Rich Felker, wrote: > > > > > On Sat, Oct 12, 2019 at 10:27:42PM +0100, u_quark wrote: > > > > --- > > > > src/ctype/alpha.h | 159 > +++++++++++++++++++++------------------- > > > > src/ctype/nonspacing.h | 88 ++++++++++++----------- > > > > src/ctype/punct.h | 160 > ++++++++++++++++++++++------------------- > > > > src/ctype/wide.h | 26 +++---- > > > > 4 files changed, 232 insertions(+), 201 deletions(-) > > > > > > Is this done just by dropping the new Unicode files into > > > musl-chartable-tools and running make? > > > > > > Rich > > > > --00000000000092aa3a0594df2956 Content-Type: text/html; charset="UTF-8" Content-Transfer-Encoding: quoted-printable

it seems like the latest ISO/= IEC 10646 standard that most closely match Unicode 12.1.0 is ISO/IEC 10646:= 2017/Amd 2:2019 (fifth edition 2017, amendment 2).

>From what I read in the document here: https://standards.iso.org/ittf/PubliclyAvailableStandards/c073773_I= SO_IEC_10646_2017_Amd_2_2019%20(E).zip

on firs= t page, this amendment was made on 2019-06. So I would guess that the corre= ct value should be:

```
#define __STDC_ISO_10646__ 201906L
```

All this with a grain of salt as th= is is the first time I am looking at ISO/IEC 10646.

Thanks for looking into this!

Regards,
Lefteris



On Mon, Oct 14, 2019 at 2= :07 PM Rich Felker <dalias@libc.org> wrote:
On= Sat, Oct 12, 2019 at 11:56:44PM +0100, Eleftherios Kritikos wrote:
> Yes. I also created a merge request for musl-chartable-tools
>
https://github.com/richfelker/musl-cha= rtable-tools/pull/2

Thanks. stdc-predef.h also needs to be updated with a new value for
__STD_ISO_10646__. Do you know the right yyyymm value it should have
for this version of Unicode?

Rich


> On Sat, 12 Oct 2019, 11:40 pm Rich Felker, <dalias@libc.org> wrote:
>
> > On Sat, Oct 12, 2019 at 10:27:42PM +0100, u_quark wrote:
> > > ---
> > >=C2=A0 src/ctype/alpha.h=C2=A0 =C2=A0 =C2=A0 | 159 ++++++++++= +++++++++++-------------------
> > >=C2=A0 src/ctype/nonspacing.h |=C2=A0 88 ++++++++++++--------= ---
> > >=C2=A0 src/ctype/punct.h=C2=A0 =C2=A0 =C2=A0 | 160 ++++++++++= ++++++++++++-------------------
> > >=C2=A0 src/ctype/wide.h=C2=A0 =C2=A0 =C2=A0 =C2=A0|=C2=A0 26 = +++----
> > >=C2=A0 4 files changed, 232 insertions(+), 201 deletions(-) > >
> > Is this done just by dropping the new Unicode files into
> > musl-chartable-tools and running make?
> >
> > Rich
> >
--00000000000092aa3a0594df2956--