From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.org/gmane.linux.lib.musl.general/11920 Path: news.gmane.org!.POSTED!not-for-mail From: Reini Urban Newsgroups: gmane.linux.lib.musl.general Subject: Re: [PATCH] towupper/towlower: Update to Unicode 9.0 Date: Wed, 13 Sep 2017 12:05:19 +0200 Message-ID: <13F34D7B-8E99-483A-A5F5-F139D0D906B9@cpan.org> References: Reply-To: musl@lists.openwall.com NNTP-Posting-Host: blaine.gmane.org Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\)) Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Trace: blaine.gmane.org 1505297139 29080 195.159.176.226 (13 Sep 2017 10:05:39 GMT) X-Complaints-To: usenet@blaine.gmane.org NNTP-Posting-Date: Wed, 13 Sep 2017 10:05:39 +0000 (UTC) To: musl@lists.openwall.com Original-X-From: musl-return-11934-gllmg-musl=m.gmane.org@lists.openwall.com Wed Sep 13 12:05:31 2017 Return-path: Envelope-to: gllmg-musl@m.gmane.org Original-Received: from mother.openwall.net ([195.42.179.200]) by blaine.gmane.org with smtp (Exim 4.84_2) (envelope-from ) id 1ds4Xz-0007NK-19 for gllmg-musl@m.gmane.org; Wed, 13 Sep 2017 12:05:31 +0200 Original-Received: (qmail 26177 invoked by uid 550); 13 Sep 2017 10:05:35 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-ID: Original-Received: (qmail 26140 invoked from network); 13 Sep 2017 10:05:34 -0000 DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20161025; h=sender:mime-version:subject:from:in-reply-to:date :content-transfer-encoding:message-id:references:to; bh=j3EvB5Wyv1oHTVKQ5+hpL2Fo0kXj3E7QJgygUwXaNqE=; b=TG5OkdGfj8PRIQP1Wh1wwLMg6uv2f4lGsu7dlFFPr81AIYDuYFaU9maTXTTf6j71EE MBnPkWomxswlhGZA82n4ySoD2XAVfRO6UQMl9GYF1mSePNnJ+EAPi0T6oDb4X7ufCo0e ySs7IM34BpM4QNrNlZdp3WbLw6dUmh+dzrhlJHe3bAFjQW2ACp/naV5o7Dxux43PD/hd AijWIAf1uSVAD95ngAgdLNsKOyCxV1L3cKgXgfThEnR+eiXC8+hAByr6gCGTfJoXvPp4 4+LkvRjOc367cWuUbftBx7TnXXKcjUqjmve++feqaO67NrynhSaMCKlNwtSUrNiDp0Uw FHvQ== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:sender:mime-version:subject:from:in-reply-to :date:content-transfer-encoding:message-id:references:to; bh=j3EvB5Wyv1oHTVKQ5+hpL2Fo0kXj3E7QJgygUwXaNqE=; b=Ut6ZOucGymZvgFGliP64ig5pPqD736z6UZ19d88Xnu9vEwO0+aNkPVTxCi0QHgwrh7 ts28v+aNy4QS263OKcFtwuTrD4z0ZLcSWaxCKafXjvSmbsaAOn6wYwqQyVqahRR9444a j3lqOHIW3GCtzLiORJCN3CEWk8WxUADY5gd3eqiQ62mQBDYRQlsawCSN4TMgT1BsbeOA ygEgfvIVM1It1uEep+9YDpYhRCpV56ie3wlAGSrrp509PZwJIJoR1Keca9Szqzd1Dn1L 3pbhgYySxNsCDJRKIIeswQv3F7KtXjcIliC0pz/6CLUdofpHrjf5Dm65yp4zDPhk0QtI IN/Q== X-Gm-Message-State: AHPjjUhp3aPS4h1kjQ+99ttCM3zCM7wDmwqOkQZ16muTFRqeVGn6Bdmr FN4og72v/gKn+QsvaRQ= X-Google-Smtp-Source: ADKCNb4y8IVd4MSKp6WC+qNZvVWXNWIVWXXOnJrussLUhcWii+qYUTW5a/+FaUqNG+B4la6Dxat2PQ== X-Received: by 10.80.144.42 with SMTP id b39mr6564020eda.204.1505297122557; Wed, 13 Sep 2017 03:05:22 -0700 (PDT) Original-Sender: Reini Urban In-Reply-To: X-Mailer: Apple Mail (2.3124) Xref: news.gmane.org gmane.linux.lib.musl.general:11920 Archived-At: Wait a bit with that. I think I found some more Unicode 9.0 issues with = the tables, and I=E2=80=99ve found a huge performance opportunity by sorting the 3 = tables (mostly pairs),=20 and break the loops earlier. This should come close to glibc table performance then, without the huge = memory costs they have. I=E2=80=99ll write a perl regression testing script not to miss any more = mappings, and maybe improve the current musl logic. This will need 1-2 days. I=E2=80=99ll also use it for cperl then. Reini Urban rurban@cpan.org > On Sep 13, 2017, at 10:25 AM, Reini Urban wrote: >=20 > taken from my safeclib (MIT licensed) and cross-checked with the perl = unicode tables >=20 > <0001-towupper-towlower-Update-to-Unicode-9.0.patch>