From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on starla X-Spam-Level: X-Spam-Status: No, score=-1.1 required=3.0 tests=DKIM_SIGNED,DKIM_VALID, DKIM_VALID_AU,MAILING_LIST_MULTI,SPF_HELO_PASS,SPF_PASS, T_SCC_BODY_TEXT_LINE autolearn=ham autolearn_force=no version=3.4.6 Received: from nue.mailmanlists.eu (nue.mailmanlists.eu [IPv6:2a01:4f8:1c0c:6b10::1]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (No client certificate requested) by dcvr.yhbt.net (Postfix) with ESMTPS id 87FBE1F406 for ; Tue, 26 Dec 2023 06:52:30 +0000 (UTC) Authentication-Results: dcvr.yhbt.net; dkim=pass (1024-bit key; secure) header.d=ml.ruby-lang.org header.i=@ml.ruby-lang.org header.a=rsa-sha256 header.s=mail header.b=TBDSgBb8; dkim=fail reason="signature verification failed" (2048-bit key; unprotected) header.d=ruby-lang.org header.i=@ruby-lang.org header.a=rsa-sha256 header.s=s1 header.b=Xp+5Fcd8; dkim-atps=neutral Received: from nue.mailmanlists.eu (localhost [127.0.0.1]) by nue.mailmanlists.eu (Postfix) with ESMTP id 21512819C5; Tue, 26 Dec 2023 06:52:23 +0000 (UTC) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/simple; d=ml.ruby-lang.org; s=mail; t=1703573543; bh=Phrgyl+2TLqIdTujHUEiflzOVXJJw0RO7DoGbcy/yEM=; h=Date:References:To:Reply-To:Subject:List-Id:List-Archive: List-Help:List-Owner:List-Post:List-Subscribe:List-Unsubscribe: From:Cc:From; b=TBDSgBb8l2jm2f1CW8y9ouT58mThP4EzSGkb0lV6Ld1qCZJ4vL4/ZihwErBcT/lYE nk0CfhIPD09NsQrF7H3p9IojciJSi2pe7EYSUfxKEl4NLSKugtHuv7xmABk/CH2UJO wN5cTYM2VK29YLcNuDHyXIhCHT7kCfhsl59x4sh8= Received: from csnrwnwx.outbound-mail.sendgrid.net (csnrwnwx.outbound-mail.sendgrid.net [198.37.146.154]) by nue.mailmanlists.eu (Postfix) with ESMTPS id 0B6A9819C1 for ; Tue, 26 Dec 2023 06:52:18 +0000 (UTC) Authentication-Results: nue.mailmanlists.eu; dkim=pass (2048-bit key; unprotected) header.d=ruby-lang.org header.i=@ruby-lang.org header.a=rsa-sha256 header.s=s1 header.b=Xp+5Fcd8; dkim-atps=neutral DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=ruby-lang.org; h=from:references:subject:mime-version:content-type: content-transfer-encoding:list-id:to:cc:content-type:from:subject:to; s=s1; bh=579bkJcrGhvJHhxbs/s4Sf8MGjGoDfYKnpHJrPx1kEI=; b=Xp+5Fcd8v2T3HIeMGQIkR89AV9+TklfdsVp4g0IjBUrQHBJw2Ni37fx9IAdNu9zuzX/G KtRPNukVpWCdyXBdhhi7iGGon6ZDHTjijAWOTx9KPByyoi5Gmvgmhlg8scu5W0d+vF2L0G r39KdgUFqOSWnpVz8JdIZjiR3BQag4TfjQfZWKwnAt4WEb5nH486/guaLu8L9lKVAKU05Z s7Yi5nEcGZJgAVlaCn9EH1wBwOL+slKdPodxOQwj0JKGcGddbZ6Hcckmg8z7MRru8m7HPU v5yPNz0h02Hsl621geoGtBaXLSIw5efsCVIyOu2dOckXw2JQb6jfHJrhK8gX3IXg== Received: by filterdrecv-656b5b4c75-nw64g with SMTP id filterdrecv-656b5b4c75-nw64g-1-658A7820-D 2023-12-26 06:52:16.772494545 +0000 UTC m=+6005539.643421555 Received: from herokuapp.com (unknown) by geopod-ismtpd-3 (SG) with ESMTP id ZwycH9QZR16SpNEL9N7zog for ; Tue, 26 Dec 2023 06:52:16.737 +0000 (UTC) Date: Tue, 26 Dec 2023 06:52:16 +0000 (UTC) Message-ID: References: Mime-Version: 1.0 X-Redmine-Project: ruby-master X-Redmine-Issue-Tracker: Feature X-Redmine-Issue-Id: 19908 X-Redmine-Issue-Author: nobu X-Redmine-Issue-Assignee: duerst X-Redmine-Sender: duerst X-Mailer: Redmine X-Redmine-Host: bugs.ruby-lang.org X-Redmine-Site: Ruby Issue Tracking System X-Auto-Response-Suppress: All Auto-Submitted: auto-generated X-Redmine-MailingListIntegration-Message-Ids: 92430 X-SG-EID: =?us-ascii?Q?sZFLhNnqjcW9Ht8ByfkgOQbWPVETXXFxLuf0UPwpQ=2FbXrx066LnwciY0taRhC1?= =?us-ascii?Q?++A6DTJJ4lt=2FzJkwBQUmmbJXhB+C4nQ033MjSGj?= =?us-ascii?Q?je1hQtW7+G3RM=2FRzJHxTQPC5X6W0Kt99u4R6ddV?= =?us-ascii?Q?GzgX771kuyrEA=2FxBfkcPnb7clupiOHDa3zDGeW8?= =?us-ascii?Q?HyyLicl3KFRYKN4KCcLHT275+aKJnQb+h35ACuJ?= =?us-ascii?Q?imJWeReR5YkUOgPPbn1PdXRDhfM7dDkzpqjvFCZ?= =?us-ascii?Q?B1BkrEgR61OrirFb84tag=3D=3D?= To: ruby-core@ml.ruby-lang.org X-Entity-ID: b/2+PoftWZ6GuOu3b0IycA== Message-ID-Hash: AAQWNRW3YQDFMQMZTNH47PGDSJJDUCVR X-Message-ID-Hash: AAQWNRW3YQDFMQMZTNH47PGDSJJDUCVR X-MailFrom: bounces+313651-b711-ruby-core=ml.ruby-lang.org@em5188.ruby-lang.org X-Mailman-Rule-Misses: dmarc-mitigation; no-senders; approved; emergency; loop; banned-address; member-moderation; nonmember-moderation; administrivia; implicit-dest; max-recipients; max-size; news-moderation; no-subject; digests; suspicious-header X-Mailman-Version: 3.3.3 Precedence: list Reply-To: Ruby developers Subject: [ruby-core:115899] [Ruby master Feature#19908] Update to Unicode 15.1 List-Id: Ruby developers Archived-At: List-Archive: List-Help: List-Owner: List-Post: List-Subscribe: List-Unsubscribe: From: duerst via ruby-core Cc: duerst Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Issue #19908 has been updated by duerst (Martin D=FCrst). There is a serious issue than just whether using an '_' or an '=3D' in the = property: Unicode 15.1 makes some serious changes to grapheme clusters. Our implementation (function 'node_extended_grapheme_cluster' in regparse.c= ) is based on Unicode 11.0, in particular https://www.unicode.org/reports/t= r29/tr29-33.html#Grapheme_Cluster_Boundaries. This is quite a bit different= from the current version at https://www.unicode.org/reports/tr29/tr29-43.h= tml#Grapheme_Cluster_Boundaries. One major difference is that for Unicode 1= 1.0, there was a regular expression for grapheme clusters, which I just imp= lemented in the above function. Unicode 15.1 just says that it's possible t= o use a regular expression, but doesn't give this regular expression. >From reading through https://www.unicode.org/versions/Unicode15.1.0/#Migrat= ion, that's the main issue affecting Ruby. ---------------------------------------- Feature #19908: Update to Unicode 15.1 https://bugs.ruby-lang.org/issues/19908#change-105854 * Author: nobu (Nobuyoshi Nakada) * Status: Assigned * Priority: Normal * Assignee: duerst (Martin D=FCrst) ---------------------------------------- The Unicode 15.1 is released. The current enc-unicode.rb seems to fail because of `Indic_Conjunct_break` = properties with values. I'm not sure how these properties should be handled well. `/\p{InCB_Liner}/` or `/\p{InCB=3DLiner}/` as the comments in that file? https://github.com/nobu/ruby/tree/unicode-15.1 is the former. --=20 https://bugs.ruby-lang.org/ ______________________________________________ ruby-core mailing list -- ruby-core@ml.ruby-lang.org To unsubscribe send an email to ruby-core-leave@ml.ruby-lang.org ruby-core info -- https://ml.ruby-lang.org/mailman3/postorius/lists/ruby-c= ore.ml.ruby-lang.org/