* Charset niggles
@ 1999-02-02 23:55 Lars Magne Ingebrigtsen
1999-02-03 0:42 ` Stephen J. Turnbull
0 siblings, 1 reply; 3+ messages in thread
From: Lars Magne Ingebrigtsen @ 1999-02-02 23:55 UTC (permalink / raw)
Cc: xemacs-mule
(I'm now playing in Emacs 20.3, but I'm guessing this applies to all
MULE Emacsen.)
On Amharic and Tigrana, char-charset says `ethiopic', and the
prefered-coding-system is reported as `iso-2022-jp'. However, when I
decode what I get, I don't get anything; just escape sequences. But
if I decode what I got as `iso-2022-jp-2', I get back the Amharic and
Tigrana.
On the other hand, the Japanese in the HELLO file can be encoded and
decoded as `iso-2022-jp' OK.
Er... Does this mean that it's just a faulty prefered-coding-system
in Emacs 20.3 for `ethiopic'?
--
(domestic pets only, the antidote for overdose, milk.)
larsi@ifi.uio.no * Lars Magne Ingebrigtsen
^ permalink raw reply [flat|nested] 3+ messages in thread
* Charset niggles
1999-02-02 23:55 Charset niggles Lars Magne Ingebrigtsen
@ 1999-02-03 0:42 ` Stephen J. Turnbull
1999-02-03 1:18 ` Lars Magne Ingebrigtsen
0 siblings, 1 reply; 3+ messages in thread
From: Stephen J. Turnbull @ 1999-02-03 0:42 UTC (permalink / raw)
Cc: ding, xemacs-mule
>>>>> "Lars" == Lars Magne Ingebrigtsen <larsi@ifi.uio.no> writes:
Lars> (I'm now playing in Emacs 20.3, but I'm guessing this
Lars> applies to all MULE Emacsen.)
I don't think Ethiopic is supported before Emacs 20.x, and XEmacs
includes the code but does not load it by default (due to a bug
probably in some X servers). With no Ethiopic speakers on board,
XEmacs can hardly say it's "supported" :-( but we'd love to get bug
reports (and even more so ecstatic reports from happy users :-).
Lars> On Amharic and Tigrana, char-charset says `ethiopic', and
Lars> the prefered-coding-system is reported as `iso-2022-jp'.
Lars> However, when I decode what I get, I don't get anything;
Lars> just escape sequences. But if I decode what I got as
Lars> `iso-2022-jp-2', I get back the Amharic and Tigrana.
Yes. I believe that iso-2022-jp-2 (aka iso-2022-cjk aka
iso-2022-int-1, if I recall correctly) is a very slight redesign of
iso-2022-jp to make sure that non-Japanese character sets are handled
correctly.
ITRC the history of the RFCs was that people "accidently" got good
results from using iso-2022-jp with non-Japanese character sets (not
surprising, that robustness was an important design consideration in
the ISO-2022 spec), and the -jp-2 version was specifically created
with this in mind. Possibly some recent Emacs tightened up the Mule
definition of -jp to "Japanese-only."
I wouldn't be surprised if practically every "unusual" character set
defaults to `iso-2022-jp' for that historical reason, and nobody's
systematically gone about cleaning it up yet.
Lars> Er... Does this mean that it's just a faulty
Lars> prefered-coding-system in Emacs 20.3 for `ethiopic'?
Yes. Look at the iso-2022-int* coding systems; that's more likely to
make sense to non-Japanese users, and should work as well.
--
University of Tsukuba Tennodai 1-1-1 Tsukuba 305-8573 JAPAN
Institute of Policy and Planning Sciences Tel/fax: +81 (298) 53-5091
__________________________________________________________________________
__________________________________________________________________________
What are those two straight lines for? "Free software rules."
^ permalink raw reply [flat|nested] 3+ messages in thread
* Re: Charset niggles
1999-02-03 0:42 ` Stephen J. Turnbull
@ 1999-02-03 1:18 ` Lars Magne Ingebrigtsen
0 siblings, 0 replies; 3+ messages in thread
From: Lars Magne Ingebrigtsen @ 1999-02-03 1:18 UTC (permalink / raw)
Cc: xemacs-mule
"Stephen J. Turnbull" <turnbull@sk.tsukuba.ac.jp> writes:
> Lars> Er... Does this mean that it's just a faulty
> Lars> prefered-coding-system in Emacs 20.3 for `ethiopic'?
>
> Yes. Look at the iso-2022-int* coding systems; that's more likely to
> make sense to non-Japanese users, and should work as well.
Thanks; I've now sent a patch to Stallman for defaulting Ethiopic to
iso-2022-7bit, which seems to do the trick.
(Er -- shouldn't that be "Ethiopian"?)
--
(domestic pets only, the antidote for overdose, milk.)
larsi@gnus.org * Lars Magne Ingebrigtsen
^ permalink raw reply [flat|nested] 3+ messages in thread
end of thread, other threads:[~1999-02-03 1:18 UTC | newest]
Thread overview: 3+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
1999-02-02 23:55 Charset niggles Lars Magne Ingebrigtsen
1999-02-03 0:42 ` Stephen J. Turnbull
1999-02-03 1:18 ` Lars Magne Ingebrigtsen
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).