* [musl] iconv EUC-JP support is incomplete
@ 2025-01-15 17:16 Nick Wellnhofer
2025-01-18 11:34 ` Rich Felker
0 siblings, 1 reply; 3+ messages in thread
From: Nick Wellnhofer @ 2025-01-15 17:16 UTC (permalink / raw)
To: musl
Hello,
I ran into this after getting reports of libxml2 test failures on musl. EUC-JP also supports JIS X 0212 as third code set with characters encoded as three-byte sequences starting with 0x8F. It seems that musl's iconv implementation doesn't support these characters.
Here's the work-around in our test suite: https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5
Nick
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [musl] iconv EUC-JP support is incomplete
2025-01-15 17:16 [musl] iconv EUC-JP support is incomplete Nick Wellnhofer
@ 2025-01-18 11:34 ` Rich Felker
2025-04-14 11:03 ` Szabolcs Nagy
0 siblings, 1 reply; 3+ messages in thread
From: Rich Felker @ 2025-01-18 11:34 UTC (permalink / raw)
To: Nick Wellnhofer; +Cc: musl
On Wed, Jan 15, 2025 at 06:16:00PM +0100, Nick Wellnhofer wrote:
> Hello,
>
> I ran into this after getting reports of libxml2 test failures on
> musl. EUC-JP also supports JIS X 0212 as third code set with
> characters encoded as three-byte sequences starting with 0x8F. It
> seems that musl's iconv implementation doesn't support these
> characters.
>
> Here's the work-around in our test suite:
> https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5
Thanks. Do you have any references for EUC-JP being expected to
include these extensions? Does the WHATWG definition specify them as
supported?
Rich
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: [musl] iconv EUC-JP support is incomplete
2025-01-18 11:34 ` Rich Felker
@ 2025-04-14 11:03 ` Szabolcs Nagy
0 siblings, 0 replies; 3+ messages in thread
From: Szabolcs Nagy @ 2025-04-14 11:03 UTC (permalink / raw)
To: Rich Felker; +Cc: Nick Wellnhofer, musl
* Rich Felker <dalias@libc.org> [2025-01-18 06:34:31 -0500]:
> On Wed, Jan 15, 2025 at 06:16:00PM +0100, Nick Wellnhofer wrote:
> > Hello,
> >
> > I ran into this after getting reports of libxml2 test failures on
> > musl. EUC-JP also supports JIS X 0212 as third code set with
> > characters encoded as three-byte sequences starting with 0x8F. It
> > seems that musl's iconv implementation doesn't support these
> > characters.
> >
> > Here's the work-around in our test suite:
> > https://gitlab.gnome.org/GNOME/libxml2/-/commit/afe8f8451459750d330e8bca9ee3b02aeadeebb5
>
> Thanks. Do you have any references for EUC-JP being expected to
> include these extensions? Does the WHATWG definition specify them as
> supported?
fwiw i see 0x8F specified in
https://encoding.spec.whatwg.org/#euc-jp
the logic seems return a single code point
for a 0x8F, 0xA1-0xFE, 0xA1-0xFE sequence.
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~2025-04-14 11:03 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2025-01-15 17:16 [musl] iconv EUC-JP support is incomplete Nick Wellnhofer
2025-01-18 11:34 ` Rich Felker
2025-04-14 11:03 ` Szabolcs Nagy
Code repositories for project(s) associated with this public inbox
https://git.vuxu.org/mirror/musl/
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).