From mboxrd@z Thu Jan 1 00:00:00 1970 X-Spam-Checker-Version: SpamAssassin 3.4.4 (2020-01-24) on inbox.vuxu.org X-Spam-Level: X-Spam-Status: No, score=-3.3 required=5.0 tests=MAILING_LIST_MULTI, RCVD_IN_DNSWL_MED,RCVD_IN_MSPIKE_H3,RCVD_IN_MSPIKE_WL autolearn=ham autolearn_force=no version=3.4.4 Received: (qmail 27177 invoked from network); 30 Jul 2020 00:05:43 -0000 Received: from mother.openwall.net (195.42.179.200) by inbox.vuxu.org with ESMTPUTF8; 30 Jul 2020 00:05:43 -0000 Received: (qmail 24078 invoked by uid 550); 30 Jul 2020 00:05:41 -0000 Mailing-List: contact musl-help@lists.openwall.com; run by ezmlm Precedence: bulk List-Post: List-Help: List-Unsubscribe: List-Subscribe: List-ID: Reply-To: musl@lists.openwall.com Received: (qmail 24060 invoked from network); 30 Jul 2020 00:05:40 -0000 Date: Wed, 29 Jul 2020 20:05:28 -0400 From: Rich Felker To: "A. Wilcox" Cc: bruno@clisp.org, bug-bison@gnu.org, musl@lists.openwall.com Message-ID: <20200730000527.GS6949@brightrain.aerifal.cx> References: MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.5.21 (2010-09-15) Subject: Re: [musl] Building Bison 3.7 with musl (was Re: portability issues with unicodeio) On Wed, Jul 29, 2020 at 06:23:19PM -0500, A. Wilcox wrote: > Seeing some weird behaviour here building Bison 3.7 on musl libc. > > Something seems to be "intelligent" enough to know that \u2022 is a > bullet character, and is replacing it with "*" instead of ".", causing > all the tests to fail: > > awilcox on gwyn [17] bison: LC_ALL=C /bin/printf '\u2022\n' | od -t x1 > 0000000 2a 0a > 0000002 I don't think the '*' has anything to do with it being a bullet character. It's just the implementation-defined replacement character musl's iconv uses. I would guess the code in bison and coreutils printf is assuming the non-conforming glibc behavior for iconv of returning an error if a character from the input is not exactly representable in the output, rather than making replacements and returning the number of inexact conversions made. Rich