caml-list - the Caml user's mailing list
 help / color / mirror / Atom feed
From: skaller <skaller@maxtal.com.au>
To: Francis Dupont <Francis.Dupont@inria.fr>
Cc: STARYNKEVITCH Basile <Basile.Starynkevitch@cea.fr>, caml-list@inria.fr
Subject: Re: localization, internationalization and Caml
Date: Sun, 17 Oct 1999 21:27:09 +1000	[thread overview]
Message-ID: <3809B28D.F0300290@maxtal.com.au> (raw)
In-Reply-To: <199910150826.KAA12903@givry.inria.fr>

Francis Dupont wrote:
> 
>  In your previous mail you wrote:
> 
>         The current 'support' for 8 bit characters in ocaml should be
>    deprecated immediately. It is an extremely bad thing to have, since
>    Latin-1 et al are archaic 8 bit standards incompatible with the
>    international standard for ISO10646 communication, namely
>    the UTF-8 encoding.
> 
> => there is a rather strong opposition against UTF-8 in France
> because it is not a natural encoding (ie. if ASCII maps to ASCII
> it is not the case for ISO 8859-* characters, imagine a new UTF-X
> encoding maps ASCII to strange things and you'd be able to understand
> our concern).

	I do understand the concern, but the decision on
the International Standards has been made. The transition
for ISO 8859-x clients will involve some pain. Better to
start going through the pain now :-(

>    Yes, I know Latin-1 is useful now for French.
> 
> => it is more than useful, Latin-1 (soon ISO IS 8859-15) is necessary
> if you need really readable texts in French.

	No, what you mean is that with _current technology_
there is plenty of support for 8 bit characters, using code pages,
so that Latin-1 is well supported.

	For example, there are a lot of text editors that
accept 8 bit characters, and even permit switching code pages.
There are almost none that work with ISO10646 or unicode,
let alone accept UTF-8 encoding. (Yudit is the only one I know of).

>    The way forward may well be to provide an input filter to convert
>    Latin-1 (or any other encoding) to UTF8, and have ocaml process that.
> 
> => my problem is the output of the filter will be no more readable when
> I've put too much French in the program (in comments for instance).

	You will have no problem with the right tools: the difference
will be transparent. Of course, you will need the right tools.
For example, you will need a browser like Internet Explorer 5,
which processes UTF-8 encoding correctly.

	I agree that this is a problem, but supporting
Latin-1, or any other archaic standard, is not going
to help move forward. It is bad enough that most vendors
only support Unicode, which is a small, almost filled,
16 bit subset of the full 31 bit ISO-10646 Standard.
 
>    This requires almost no changes to the compiler: the design should
>    open the set of characters acceptable in identifiers, probably
>    to some subset of the set recommended in one of the ISO10646 related
>    documents; the other change required is to accept \uXXXX and \UXXXXXXXX
>    escapes in strings. String processing functions should generally
>    continue to be 8 bit [per octet]: full internationalisation of client
>    string handling functions is a very complex, non-trivial, task]
> 
> => I believe internationalization should not be done by countries
> where English is the only used language: this is at least awkward...

	I believe people with international concerns can work
together no matter what their native language. Some English
speakers may be concerned, some, like me, are somewhat
embarrased to be non-fluent in _any_ other language.
[I speak a smattering of high school German]

	However, Australia, where I live, has migrants
from all over the world and support for many languages
is an important issue here. Particularly Asian languages.
And ISO-8859-x is not much help there :-)

-- 
John Skaller, mailto:skaller@maxtal.com.au
1/10 Toxteth Rd Glebe NSW 2037 Australia
homepage: http://www.maxtal.com.au/~skaller
downloads: http://www.triode.net.au/~skaller




  reply	other threads:[~1999-10-18 14:14 UTC|newest]

Thread overview: 21+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
1999-10-13 12:12 STARYNKEVITCH Basile
1999-10-14 22:20 ` skaller
1999-10-15  8:26   ` Francis Dupont
1999-10-17 11:27     ` skaller [this message]
1999-10-17 15:54       ` Francis Dupont
1999-10-19 18:48         ` skaller
1999-10-15 13:53 Gerard Huet
1999-10-15 20:28 ` Gerd Stolpmann
1999-10-19 18:06   ` skaller
1999-10-20 21:05     ` Gerd Stolpmann
1999-10-21  4:42       ` skaller
1999-10-21 12:05       ` Matías Giovannini
1999-10-21 15:35         ` skaller
1999-10-21 16:27           ` Matías Giovannini
1999-10-21 16:36             ` skaller
1999-10-21 17:21               ` Matías Giovannini
1999-10-23  9:53               ` Benoit Deboursetty
1999-10-25 21:06                 ` Jan Skibinski
1999-10-26 18:02                 ` skaller
1999-10-17 14:29 ` Xavier Leroy
1999-10-19 18:36   ` skaller

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=3809B28D.F0300290@maxtal.com.au \
    --to=skaller@maxtal.com.au \
    --cc=Basile.Starynkevitch@cea.fr \
    --cc=Francis.Dupont@inria.fr \
    --cc=caml-list@inria.fr \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).