mailing list of musl libc
 help / color / mirror / code / Atom feed
* [musl] iconv EUC-JP support is incomplete
@ 2025-01-15 17:16 Nick Wellnhofer
  2025-01-18 11:34 ` Rich Felker
  0 siblings, 1 reply; 3+ messages in thread
From: Nick Wellnhofer @ 2025-01-15 17:16 UTC (permalink / raw)
  To: musl

Hello,

I ran into this after getting reports of libxml2 test failures on musl. EUC-JP also supports JIS X 0212 as third code set with characters encoded as three-byte sequences starting with 0x8F. It seems that musl's iconv implementation doesn't support these characters.

Here's the work-around in our test suite: https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5

Nick


^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [musl] iconv EUC-JP support is incomplete
  2025-01-15 17:16 [musl] iconv EUC-JP support is incomplete Nick Wellnhofer
@ 2025-01-18 11:34 ` Rich Felker
  2025-04-14 11:03   ` Szabolcs Nagy
  0 siblings, 1 reply; 3+ messages in thread
From: Rich Felker @ 2025-01-18 11:34 UTC (permalink / raw)
  To: Nick Wellnhofer; +Cc: musl

On Wed, Jan 15, 2025 at 06:16:00PM +0100, Nick Wellnhofer wrote:
> Hello,
> 
> I ran into this after getting reports of libxml2 test failures on
> musl. EUC-JP also supports JIS X 0212 as third code set with
> characters encoded as three-byte sequences starting with 0x8F. It
> seems that musl's iconv implementation doesn't support these
> characters.
> 
> Here's the work-around in our test suite:
> https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5

Thanks. Do you have any references for EUC-JP being expected to
include these extensions? Does the WHATWG definition specify them as
supported?

Rich

^ permalink raw reply	[flat|nested] 3+ messages in thread

* Re: [musl] iconv EUC-JP support is incomplete
  2025-01-18 11:34 ` Rich Felker
@ 2025-04-14 11:03   ` Szabolcs Nagy
  0 siblings, 0 replies; 3+ messages in thread
From: Szabolcs Nagy @ 2025-04-14 11:03 UTC (permalink / raw)
  To: Rich Felker; +Cc: Nick Wellnhofer, musl

* Rich Felker <dalias@libc.org> [2025-01-18 06:34:31 -0500]:

> On Wed, Jan 15, 2025 at 06:16:00PM +0100, Nick Wellnhofer wrote:
> > Hello,
> > 
> > I ran into this after getting reports of libxml2 test failures on
> > musl. EUC-JP also supports JIS X 0212 as third code set with
> > characters encoded as three-byte sequences starting with 0x8F. It
> > seems that musl's iconv implementation doesn't support these
> > characters.
> > 
> > Here's the work-around in our test suite:
> > https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5
> 
> Thanks. Do you have any references for EUC-JP being expected to
> include these extensions? Does the WHATWG definition specify them as
> supported?

fwiw i see 0x8F specified in

https://encoding.spec.whatwg.org/#euc-jp

the logic seems return a single code point
for a 0x8F, 0xA1-0xFE, 0xA1-0xFE sequence.

^ permalink raw reply	[flat|nested] 3+ messages in thread

end of thread, other threads:[~2025-04-14 11:03 UTC | newest]

Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-01-15 17:16 [musl] iconv EUC-JP support is incomplete Nick Wellnhofer
2025-01-18 11:34 ` Rich Felker
2025-04-14 11:03   ` Szabolcs Nagy

Code repositories for project(s) associated with this public inbox

	https://git.vuxu.org/mirror/musl/

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).