Gnus development mailing list
 help / color / mirror / Atom feed
* Using iso-8859-15 with Emacs21 & Gnus
@ 2001-10-27 21:50 Raymond Scholz
  2001-10-27 22:51 ` Florian Weimer
  0 siblings, 1 reply; 34+ messages in thread
From: Raymond Scholz @ 2001-10-27 21:50 UTC (permalink / raw)


Hi there!

I'm trying to migrate to iso-8859-15 until January 2002 (time of the €
becoming legal tender in some European countries).

That's what I've done so far:

* XFree 4.1.0 with iso-8859-15 fonts.
* Emacs 21.1
* Oort Gnus
* (set-language-environment "Latin-9")

My own messages are properly encoded in iso-8859-15.  But quoting
parts of other messages that are encoded iso-8859-1 causes Gnus to
create a multipart/mixed message with the quoted parts encoded in
iso-8859-1 and added parts in iso-8859-15.

I know that it's difficult or even impossible for Gnus to determine
whether iso-8859-1 would suffice to encode the whole message.  What I
would prefer is telling Gnus to encode the whole message in
iso-8859-15.  How?

Cheers, Ray
-- 
Hfr fgebat rapelcgvba!



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-10-27 21:50 Using iso-8859-15 with Emacs21 & Gnus Raymond Scholz
@ 2001-10-27 22:51 ` Florian Weimer
  2001-10-28 13:12   ` Raymond Scholz
  0 siblings, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-10-27 22:51 UTC (permalink / raw)


Raymond Scholz <ray-2001@zonix.de> writes:

> I know that it's difficult or even impossible for Gnus to determine
> whether iso-8859-1 would suffice to encode the whole message.  What I
> would prefer is telling Gnus to encode the whole message in
> iso-8859-15.  How?

Dave Love has posted a Lisp package for Latin unification to
gnu.emacs.sources.  This might solve this kind of problem.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-10-27 22:51 ` Florian Weimer
@ 2001-10-28 13:12   ` Raymond Scholz
  2001-11-01 11:26     ` Florian Weimer
  0 siblings, 1 reply; 34+ messages in thread
From: Raymond Scholz @ 2001-10-28 13:12 UTC (permalink / raw)



I forgot to mention that I didn't use Mule-UCS before.  With Mule-UCS
0.84 utf-8 messages are being created.

Florian Weimer <fw@deneb.enyo.de> writes:

> Dave Love has posted a Lisp package for Latin unification to
> gnu.emacs.sources.  This might solve this kind of problem.

Tried that but it makes no difference.  With or without Mule-UCS.  But
I may be ignorant of the usage of Dave's package.

Cheers, Ray
-- 
Real programmers confuse Christmas and Halloween because
DEC 25 = OCT 31!



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-10-28 13:12   ` Raymond Scholz
@ 2001-11-01 11:26     ` Florian Weimer
  2001-11-01 14:37       ` ShengHuo ZHU
  2001-11-01 17:51       ` news
  0 siblings, 2 replies; 34+ messages in thread
From: Florian Weimer @ 2001-11-01 11:26 UTC (permalink / raw)


Raymond Scholz <ray-2001@zonix.de> writes:

> I forgot to mention that I didn't use Mule-UCS before.  With Mule-UCS
> 0.84 utf-8 messages are being created.

Can vanilla Gnus 5.9.0 deal correctly with ISO-8859-15 messages?  Or
do you get multiple parts when you reply to such a message?

Anyway, the current behavior of CVS Gnus + Emacs 21 is not acceptable.
How shall we fix it?  Unify latin-iso8859-1 and latin-iso8859-15
character sets?  Extend the iso-latin-1 and iso-latin-9 coding systems
so that they can encode part of the other charset?  Extend the UTF-8
charset, to make it encode latin-iso8859-15 as well?

The GNU Emacs folks don't seem to be willing to do much work in this
direction because general Unicode support should arrive RSN.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 11:26     ` Florian Weimer
@ 2001-11-01 14:37       ` ShengHuo ZHU
  2001-11-01 21:06         ` Florian Weimer
  2001-11-01 21:14         ` Raymond Scholz
  2001-11-01 17:51       ` news
  1 sibling, 2 replies; 34+ messages in thread
From: ShengHuo ZHU @ 2001-11-01 14:37 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

> Raymond Scholz <ray-2001@zonix.de> writes:
>
>> I forgot to mention that I didn't use Mule-UCS before.  With Mule-UCS
>> 0.84 utf-8 messages are being created.
>
> Can vanilla Gnus 5.9.0 deal correctly with ISO-8859-15 messages?  Or
> do you get multiple parts when you reply to such a message?
>
> Anyway, the current behavior of CVS Gnus + Emacs 21 is not acceptable.
> How shall we fix it?  Unify latin-iso8859-1 and latin-iso8859-15
> character sets?  Extend the iso-latin-1 and iso-latin-9 coding systems
> so that they can encode part of the other charset?  Extend the UTF-8
> charset, to make it encode latin-iso8859-15 as well?

I've put some code into the Gnus CVS to handle this issue.  Please
try. BTW, which charsets are compatible with latin-iso8859-15?

ShengHuo



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 11:26     ` Florian Weimer
  2001-11-01 14:37       ` ShengHuo ZHU
@ 2001-11-01 17:51       ` news
  2001-11-01 21:06         ` Florian Weimer
  1 sibling, 1 reply; 34+ messages in thread
From: news @ 2001-11-01 17:51 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

> The GNU Emacs folks don't seem to be willing to do much work in this
> direction because general Unicode support should arrive RSN.

Meaning the next release of GNU Emacs?

Chris



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 14:37       ` ShengHuo ZHU
@ 2001-11-01 21:06         ` Florian Weimer
  2001-11-01 23:30           ` ShengHuo ZHU
  2001-11-01 21:14         ` Raymond Scholz
  1 sibling, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-11-01 21:06 UTC (permalink / raw)


[-- Attachment #1: Type: text/plain, Size: 550 bytes --]

ShengHuo ZHU <zsh@cs.rochester.edu> writes:

>> Anyway, the current behavior of CVS Gnus + Emacs 21 is not acceptable.
>> How shall we fix it?  Unify latin-iso8859-1 and latin-iso8859-15
>> character sets?  Extend the iso-latin-1 and iso-latin-9 coding systems
>> so that they can encode part of the other charset?  Extend the UTF-8
>> charset, to make it encode latin-iso8859-15 as well?
>
> I've put some code into the Gnus CVS to handle this issue.  Please
> try. 

It doesn't work, I'm afraid.  For example, ISO-8859-1 '¼' is changed

[-- Attachment #2: Type: text/plain, Size: 233 bytes --]

to ISO-8859-15 'Œ', which is clearly wrong.

> BTW, which charsets are compatible with latin-iso8859-15?

Strictly speaking, none of the other ISO 8859 ones.  But most
characters (not all) can be encoded using ISO-8859-1, too.

^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 17:51       ` news
@ 2001-11-01 21:06         ` Florian Weimer
  0 siblings, 0 replies; 34+ messages in thread
From: Florian Weimer @ 2001-11-01 21:06 UTC (permalink / raw)


(Chris Beggy ) news@kippona.com writes:

> Florian Weimer <fw@deneb.enyo.de> writes:
>
>> The GNU Emacs folks don't seem to be willing to do much work in this
>> direction because general Unicode support should arrive RSN.
>
> Meaning the next release of GNU Emacs?

I don't know.  Certainly, this isn't a 21.x issue.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 14:37       ` ShengHuo ZHU
  2001-11-01 21:06         ` Florian Weimer
@ 2001-11-01 21:14         ` Raymond Scholz
  2001-11-01 22:26           ` Florian Weimer
  1 sibling, 1 reply; 34+ messages in thread
From: Raymond Scholz @ 2001-11-01 21:14 UTC (permalink / raw)


ShengHuo ZHU <zsh@cs.rochester.edu> writes:

> I've put some code into the Gnus CVS to handle this issue.  Please
> try. 

No multipart mails anymore, just text/plain with iso-8859-15.  Great!

> BTW, which charsets are compatible with latin-iso8859-

,----[ man iso-8859-15 ]
| NAME
|        iso_8859-15  -  the  ISO  8859-15 character set encoded in
|        octal, decimal, and hexadecimal
| 
| DESCRIPTION
|        The ISO 8859 standard includes several 8-bit extensions to
|        the  ASCII  character  set  (also  known  as ISO 646-IRV).
|        Especially important is ISO 8859-1,  the  "Latin  Alphabet
|        No.  1",  which  has  become  widely  implemented  and may
|        already  be  seen   as   the   de-facto   standard   ASCII
|        replacement.   However,  it lacks the EURO symbol and does
|        not fully cover Finnish and  French.   ISO  8859-15  is  a
|        modification of ISO 8859-1 that covers these needs.
| 
|        ISO  8859-15  supports  the following languages: Albanian,
|        Basque, Breton, Catalan, Danish, Dutch, English, Estonian,
|        Faroese,   Finnish,  French,  Frisian,  Galician,  German,
|        Greenlandic,  Icelandic,  Irish  Gaelic,  Italian,  Latin,
|        Luxemburgish,   Norwegian,   Portuguese,   Rhaeto-Romanic,
|        Scottish Gaelic, Spanish, and Swedish.
`----

Note that some characters of iso-8859-1 changed their position in
iso-8859-15, so it is not fully backward compatible from this point of
view.

Cheers, Ray
-- 
rscholz:x:587:100:Raymond Scholz:/home/rscholz:/usr/bin/emacs



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 21:14         ` Raymond Scholz
@ 2001-11-01 22:26           ` Florian Weimer
  0 siblings, 0 replies; 34+ messages in thread
From: Florian Weimer @ 2001-11-01 22:26 UTC (permalink / raw)


Raymond Scholz <ray-2001@zonix.de> writes:

> Note that some characters of iso-8859-1 changed their position in
> iso-8859-15, so it is not fully backward compatible from this point of
> view.

Not quite, they where simply replaced:

-0xA4   0x00A4  #       CURRENCY SIGN
+0xA4   0x20AC  #       EURO SIGN
-0xA6   0x00A6  #       BROKEN BAR
+0xA6   0x0160  #       LATIN CAPITAL LETTER S WITH CARON
-0xA8   0x00A8  #       DIAERESIS
+0xA8   0x0161  #       LATIN SMALL LETTER S WITH CARON
-0xB4   0x00B4  #       ACUTE ACCENT
+0xB4   0x017D  #       LATIN CAPITAL LETTER Z WITH CARON
-0xB8   0x00B8  #       CEDILLA
+0xB8   0x017E  #       LATIN SMALL LETTER Z WITH CARON
-0xBC   0x00BC  #       VULGAR FRACTION ONE QUARTER
-0xBD   0x00BD  #       VULGAR FRACTION ONE HALF
-0xBE   0x00BE  #       VULGAR FRACTION THREE QUARTERS
+0xBC   0x0152  #       LATIN CAPITAL LIGATURE OE
+0xBD   0x0153  #       LATIN SMALL LIGATURE OE
+0xBE   0x0178  #       LATIN CAPITAL LETTER Y WITH DIAERESIS



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 21:06         ` Florian Weimer
@ 2001-11-01 23:30           ` ShengHuo ZHU
  2001-11-02 19:36             ` Florian Weimer
  0 siblings, 1 reply; 34+ messages in thread
From: ShengHuo ZHU @ 2001-11-01 23:30 UTC (permalink / raw)


[-- Attachment #1: Type: text/plain, Size: 595 bytes --]

Florian Weimer <fw@deneb.enyo.de> writes:

> ShengHuo ZHU <zsh@cs.rochester.edu> writes:
>
>>> Anyway, the current behavior of CVS Gnus + Emacs 21 is not acceptable.
>>> How shall we fix it?  Unify latin-iso8859-1 and latin-iso8859-15
>>> character sets?  Extend the iso-latin-1 and iso-latin-9 coding systems
>>> so that they can encode part of the other charset?  Extend the UTF-8
>>> charset, to make it encode latin-iso8859-15 as well?
>>
>> I've put some code into the Gnus CVS to handle this issue.  Please
>> try. 
>
> It doesn't work, I'm afraid.  For example, ISO-8859-1 '¼' is changed

[-- Attachment #2: Type: text/plain, Size: 69 bytes --]

> to ISO-8859-15 'Œ', which is clearly wrong.

Fixed.

ShengHuo

^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-01 23:30           ` ShengHuo ZHU
@ 2001-11-02 19:36             ` Florian Weimer
  2001-11-02 22:36               ` ShengHuo ZHU
  0 siblings, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-11-02 19:36 UTC (permalink / raw)


[-- Attachment #1: Type: text/plain, Size: 207 bytes --]

ShengHuo ZHU <zsh@cs.rochester.edu> writes:

>>> I've put some code into the Gnus CVS to handle this issue.  Please
>>> try. 
>>
>> It doesn't work, I'm afraid.  For example, ISO-8859-1 '¼' is changed

[-- Attachment #2: Type: text/plain, Size: 379 bytes --]

>> to ISO-8859-15 'Œ', which is clearly wrong.
>
> Fixed.

Yes, thanks a lot.  However, I think you should unifiy to ISO-8859-1
instead of ISO-8859-15 because the standard utf-8.el can deal with
ISO-8859-1, but not ISO-8859-15.

BTW, there is a typo in the docstrings for 'mm-hack-charsets', and the
docstring for 'mm-iso-8859-15-compatible' seems to be a bit unclear.

^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-02 19:36             ` Florian Weimer
@ 2001-11-02 22:36               ` ShengHuo ZHU
  2001-11-03 10:37                 ` Florian Weimer
  2001-11-03 12:54                 ` Per Abrahamsen
  0 siblings, 2 replies; 34+ messages in thread
From: ShengHuo ZHU @ 2001-11-02 22:36 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

[...]

> Yes, thanks a lot.  However, I think you should unifiy to ISO-8859-1
> instead of ISO-8859-15

I think that in most cases ISO-8859-15 is used because of the euro
sign, which is not convertible to ISO-8859-1.

> because the standard utf-8.el can deal with ISO-8859-1, but not
> ISO-8859-15.

That's the problem of utf-8.el.

> BTW, there is a typo in the docstrings for 'mm-hack-charsets', and the
> docstring for 'mm-iso-8859-15-compatible' seems to be a bit unclear.

Fixed.

ShengHuo



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-02 22:36               ` ShengHuo ZHU
@ 2001-11-03 10:37                 ` Florian Weimer
  2001-11-03 14:53                   ` ShengHuo ZHU
  2001-11-03 12:54                 ` Per Abrahamsen
  1 sibling, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-11-03 10:37 UTC (permalink / raw)


ShengHuo ZHU <zsh@cs.rochester.edu> writes:

>> because the standard utf-8.el can deal with ISO-8859-1, but not
>> ISO-8859-15.
>
> That's the problem of utf-8.el.

It completely breaks UTF-8 support for Gnus using Emacs 21.  If you
reply to a UTF-8 message, chances are good that it contains ISO-8859-1
characters, and if I understand your code correctly, these characters
are mapped to ISO-8859-15, so the reply cannot be encoded using UTF-8
by Emacs.

But I agree that utf-8.el should support more charsets.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-02 22:36               ` ShengHuo ZHU
  2001-11-03 10:37                 ` Florian Weimer
@ 2001-11-03 12:54                 ` Per Abrahamsen
  2001-11-03 23:31                   ` Paul Jarc
  1 sibling, 1 reply; 34+ messages in thread
From: Per Abrahamsen @ 2001-11-03 12:54 UTC (permalink / raw)


ShengHuo ZHU <zsh@cs.rochester.edu> writes:

> I think that in most cases ISO-8859-15 is used because of the euro
> sign, which is not convertible to ISO-8859-1.

I might misunderstand you, but ISO-8859-1.is used a lot more than
ISO-8859-15 right now.  It is not clear that this will change in the
future, since use is likely to migrate to UTF8 rather than
ISO-8859-15. 




^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 10:37                 ` Florian Weimer
@ 2001-11-03 14:53                   ` ShengHuo ZHU
  2001-11-03 16:36                     ` Kai Großjohann
  2001-11-03 19:57                     ` Karl Eichwalder
  0 siblings, 2 replies; 34+ messages in thread
From: ShengHuo ZHU @ 2001-11-03 14:53 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

> ShengHuo ZHU <zsh@cs.rochester.edu> writes:
>
>>> because the standard utf-8.el can deal with ISO-8859-1, but not
>>> ISO-8859-15.
>>
>> That's the problem of utf-8.el.
>
> It completely breaks UTF-8 support for Gnus using Emacs 21.  If you
> reply to a UTF-8 message, chances are good that it contains ISO-8859-1
> characters, and if I understand your code correctly, these characters
> are mapped to ISO-8859-15, so the reply cannot be encoded using UTF-8
> by Emacs.

The point is that if a message contains ISO-8859-15, Gnus converts
ISO-8859-1 to ISO-8859-15.

The cases are that if a message contains

1. no ISO-8859-15 chars, no conversion. Fine for utf-8.el.

2. some inconvertible ISO-8859-15 chars (e.g. euro sign), convert
   ISO-8859-1 to ISO-8859-15. No harm, utf-8.el can't encode it either
   way.  

3. no inconvertible ISO-8859-15 chars, convert ISO-8859-1 to
   ISO-8859-15. It would be better to convert them to ISO-8859-1. But
   this case doesn't happen much. Why users input ISO-8859-15 chars if
   they can do it with ISO-8859-1 chars?

To autodetect the case 3 will lose the efficiency.

ShengHuo



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 14:53                   ` ShengHuo ZHU
@ 2001-11-03 16:36                     ` Kai Großjohann
  2001-11-03 18:29                       ` Graham Murray
  2001-11-03 19:57                     ` Karl Eichwalder
  1 sibling, 1 reply; 34+ messages in thread
From: Kai Großjohann @ 2001-11-03 16:36 UTC (permalink / raw)


ShengHuo ZHU <zsh@cs.rochester.edu> writes:

> 3. no inconvertible ISO-8859-15 chars, convert ISO-8859-1 to
>    ISO-8859-15. It would be better to convert them to ISO-8859-1. But
>    this case doesn't happen much. Why users input ISO-8859-15 chars if
>    they can do it with ISO-8859-1 chars?

Presumably, users will switch from a _default_ encoding of Latin-1
now to a default encoding of Latin-9 later.  Most people probably
don't want to use Latin-1 for most things, and Latin-9 only for the
Euro sign -- they want to use Latin-9 in all cases.

At least that's what I would want, as a (more or less) typical German.

kai
-- 
Lisp is kinda like tpircstsoP



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 16:36                     ` Kai Großjohann
@ 2001-11-03 18:29                       ` Graham Murray
  2001-11-03 20:28                         ` Florian Weimer
  2001-11-04 18:24                         ` Kai Großjohann
  0 siblings, 2 replies; 34+ messages in thread
From: Graham Murray @ 2001-11-03 18:29 UTC (permalink / raw)


Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes:

> Presumably, users will switch from a _default_ encoding of Latin-1
> now to a default encoding of Latin-9 later.  Most people probably
> don't want to use Latin-1 for most things, and Latin-9 only for the
> Euro sign -- they want to use Latin-9 in all cases.

Though does USEFOR not specify a default of utf-8?



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 14:53                   ` ShengHuo ZHU
  2001-11-03 16:36                     ` Kai Großjohann
@ 2001-11-03 19:57                     ` Karl Eichwalder
  1 sibling, 0 replies; 34+ messages in thread
From: Karl Eichwalder @ 2001-11-03 19:57 UTC (permalink / raw)


ShengHuo ZHU <zsh@cs.rochester.edu> writes:

> 3. no inconvertible ISO-8859-15 chars, convert ISO-8859-1 to
>    ISO-8859-15. It would be better to convert them to ISO-8859-1. But
>    this case doesn't happen much. Why users input ISO-8859-15 chars if
>    they can do it with ISO-8859-1 chars?

Happens automatically when you call Emacs from an ISO-8859-15 locale:

    LANG=de_DE.ISO-8859-15  emacs

or

    LANG=de_DE@euro  emacs

-- 
ke@suse.de (work) / keichwa@gmx.net (home):              |
http://www.suse.de/~ke/                                  |      ,__o
Free Translation Project:                                |    _-\_<,
http://www.iro.umontreal.ca/contrib/po/HTML/             |   (*)/'(*)



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 20:28                         ` Florian Weimer
@ 2001-11-03 20:28                           ` Per Abrahamsen
  2001-11-03 20:38                             ` Russ Allbery
  2001-11-03 21:57                             ` Using iso-8859-15 with Emacs21 & Gnus Florian Weimer
  2001-11-04 18:29                           ` Kai Großjohann
  1 sibling, 2 replies; 34+ messages in thread
From: Per Abrahamsen @ 2001-11-03 20:28 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

> Graham Murray <graham@barnowl.demon.co.uk> writes:
>
>> Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes:
>>
>>> Presumably, users will switch from a _default_ encoding of Latin-1
>>> now to a default encoding of Latin-9 later.  Most people probably
>>> don't want to use Latin-1 for most things, and Latin-9 only for the
>>> Euro sign -- they want to use Latin-9 in all cases.
>>
>> Though does USEFOR not specify a default of utf-8?
>
> No.  USEFOR does not specify any default charset.  

Yes it does.  UTF-8 for headers and US-ASCII for the body.

> Anyway, I think Kai was talking about the default charset for posting,
> which is properly MIME-declared by Gnus.

Agreed.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 18:29                       ` Graham Murray
@ 2001-11-03 20:28                         ` Florian Weimer
  2001-11-03 20:28                           ` Per Abrahamsen
  2001-11-04 18:29                           ` Kai Großjohann
  2001-11-04 18:24                         ` Kai Großjohann
  1 sibling, 2 replies; 34+ messages in thread
From: Florian Weimer @ 2001-11-03 20:28 UTC (permalink / raw)


Graham Murray <graham@barnowl.demon.co.uk> writes:

> Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes:
>
>> Presumably, users will switch from a _default_ encoding of Latin-1
>> now to a default encoding of Latin-9 later.  Most people probably
>> don't want to use Latin-1 for most things, and Latin-9 only for the
>> Euro sign -- they want to use Latin-9 in all cases.
>
> Though does USEFOR not specify a default of utf-8?

No.  USEFOR does not specify any default charset.  It doesn't even
encourage reading agents to implement hierarchy-specific default
charsets, which is required in practice.

Anyway, I think Kai was talking about the default charset for posting,
which is properly MIME-declared by Gnus.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 20:28                           ` Per Abrahamsen
@ 2001-11-03 20:38                             ` Russ Allbery
  2001-11-03 21:11                               ` USEFOR (was: Re: Using iso-8859-15 with Emacs21 & Gnus) Per Abrahamsen
  2001-11-03 21:57                             ` Using iso-8859-15 with Emacs21 & Gnus Florian Weimer
  1 sibling, 1 reply; 34+ messages in thread
From: Russ Allbery @ 2001-11-03 20:38 UTC (permalink / raw)


Per Abrahamsen <abraham@dina.kvl.dk> writes:
> Florian Weimer <fw@deneb.enyo.de> writes:

>> No.  USEFOR does not specify any default charset.  

> Yes it does.  UTF-8 for headers and US-ASCII for the body.

Why on earth would you require UTF-8 handling for the headers and not take
advantage of it for the body?  Oh, wait, right, because we're rewriting
the mail standards without really thinking.

It's this kind of thing that really makes me think it's best to ignore
USEFOR completely for right now.  The lunatics are running the asylum over
there.

-- 
Russ Allbery (rra@stanford.edu)             <http://www.eyrie.org/~eagle/>



^ permalink raw reply	[flat|nested] 34+ messages in thread

* USEFOR (was: Re: Using iso-8859-15 with Emacs21 & Gnus)
  2001-11-03 20:38                             ` Russ Allbery
@ 2001-11-03 21:11                               ` Per Abrahamsen
  2001-11-03 21:34                                 ` Russ Allbery
  2001-11-03 22:01                                 ` USEFOR Florian Weimer
  0 siblings, 2 replies; 34+ messages in thread
From: Per Abrahamsen @ 2001-11-03 21:11 UTC (permalink / raw)


Russ Allbery <rra@stanford.edu> writes:

> Per Abrahamsen <abraham@dina.kvl.dk> writes:
>> Florian Weimer <fw@deneb.enyo.de> writes:
>
>>> No.  USEFOR does not specify any default charset.  
>
>> Yes it does.  UTF-8 for headers and US-ASCII for the body.
>
> Why on earth would you require UTF-8 handling for the headers and not take
> advantage of it for the body?

Because leaving out the Content-Type header for messages with UTF-8
content in the body cause problems for pre-standard clients and has no
real advantages.

The situation is very different for headers, where both the option of
qp-encoding UTF-8 content in the headers and the option of not
qp-encoding UTF-8 content in the header will cause real problems for
pre-standard newsreaders.  And the option of qp-encoding the header
will cause problems on pre-standard servers.

The cases are not parallel, since MIME does not allow you to specify
character set in headers without QP or BASE64 encoding the content.
In the body, the two functions are ortogonal.

I agree that USEFOR in general are doing too much design and too
little standardization, but I think they took the least bad option in
this case.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: USEFOR (was: Re: Using iso-8859-15 with Emacs21 & Gnus)
  2001-11-03 21:11                               ` USEFOR (was: Re: Using iso-8859-15 with Emacs21 & Gnus) Per Abrahamsen
@ 2001-11-03 21:34                                 ` Russ Allbery
  2001-11-03 22:01                                 ` USEFOR Florian Weimer
  1 sibling, 0 replies; 34+ messages in thread
From: Russ Allbery @ 2001-11-03 21:34 UTC (permalink / raw)


Per Abrahamsen <abraham@dina.kvl.dk> writes:
> Russ Allbery <rra@stanford.edu> writes:

>> Why on earth would you require UTF-8 handling for the headers and not
>> take advantage of it for the body?

> Because leaving out the Content-Type header for messages with UTF-8
> content in the body cause problems for pre-standard clients and has no
> real advantages.

Ah, yeah, good point.

> I agree that USEFOR in general are doing too much design and too little
> standardization, but I think they took the least bad option in this
> case.

Yeah, you're right, they did.

-- 
Russ Allbery (rra@stanford.edu)             <http://www.eyrie.org/~eagle/>



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 20:28                           ` Per Abrahamsen
  2001-11-03 20:38                             ` Russ Allbery
@ 2001-11-03 21:57                             ` Florian Weimer
  2001-11-03 22:09                               ` Russ Allbery
  1 sibling, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-11-03 21:57 UTC (permalink / raw)


Per Abrahamsen <abraham@dina.kvl.dk> writes:

>> No.  USEFOR does not specify any default charset.  
>
> Yes it does.  UTF-8 for headers and US-ASCII for the body.

For the headers, I agree, but for the body, no.  There's no such
default in the current draft.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: USEFOR
  2001-11-03 21:11                               ` USEFOR (was: Re: Using iso-8859-15 with Emacs21 & Gnus) Per Abrahamsen
  2001-11-03 21:34                                 ` Russ Allbery
@ 2001-11-03 22:01                                 ` Florian Weimer
  2001-11-05 10:46                                   ` USEFOR Per Abrahamsen
  1 sibling, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-11-03 22:01 UTC (permalink / raw)


Per Abrahamsen <abraham@dina.kvl.dk> writes:

> Because leaving out the Content-Type header for messages with UTF-8
> content in the body cause problems for pre-standard clients and has no
> real advantages.

In particular since many hierarchies have already a declared a charset
to be the default if no MIME headers are present.  Unfortunately,
USEFOR is completely ignoring this issue and fails to mention that
reading agents need hierarchy-specific charset defaults.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 21:57                             ` Using iso-8859-15 with Emacs21 & Gnus Florian Weimer
@ 2001-11-03 22:09                               ` Russ Allbery
  2001-11-04  0:56                                 ` Florian Weimer
  0 siblings, 1 reply; 34+ messages in thread
From: Russ Allbery @ 2001-11-03 22:09 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

> For the headers, I agree, but for the body, no.  There's no such default
> in the current draft.

It's inherited from MIME.

-- 
Russ Allbery (rra@stanford.edu)             <http://www.eyrie.org/~eagle/>



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 12:54                 ` Per Abrahamsen
@ 2001-11-03 23:31                   ` Paul Jarc
  0 siblings, 0 replies; 34+ messages in thread
From: Paul Jarc @ 2001-11-03 23:31 UTC (permalink / raw)


Per Abrahamsen <abraham@dina.kvl.dk> wrote:
> ShengHuo ZHU <zsh@cs.rochester.edu> writes:
>> I think that in most cases ISO-8859-15 is used because of the euro
>> sign, which is not convertible to ISO-8859-1.
>
> I might misunderstand you, but ISO-8859-1.is used a lot more than
> ISO-8859-15 right now.

I think ShengHuo meant: in most of the cases where -15 is used, it is
used for the euro sign; not: -15 is used more often than -1.

So unifying -15 and -1 to -1 is likely to cause lossage: the original
-15 part probably used to euro, and the unified message won't be able
to.


paul



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 22:09                               ` Russ Allbery
@ 2001-11-04  0:56                                 ` Florian Weimer
  2001-11-04 10:31                                   ` Russ Allbery
  0 siblings, 1 reply; 34+ messages in thread
From: Florian Weimer @ 2001-11-04  0:56 UTC (permalink / raw)


Russ Allbery <rra@stanford.edu> writes:

> Florian Weimer <fw@deneb.enyo.de> writes:
>
>> For the headers, I agree, but for the body, no.  There's no such default
>> in the current draft.
>
> It's inherited from MIME.

How do you know? ;-)

The draft encourages implementations to recognize UTF-8 even though it
is not declared (and fails to mention the already existing practice of
hierarchy-specific default charsets).



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-04  0:56                                 ` Florian Weimer
@ 2001-11-04 10:31                                   ` Russ Allbery
  0 siblings, 0 replies; 34+ messages in thread
From: Russ Allbery @ 2001-11-04 10:31 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:
> Russ Allbery <rra@stanford.edu> writes:
>> Florian Weimer <fw@deneb.enyo.de> writes:

>>> For the headers, I agree, but for the body, no.  There's no such
>>> default in the current draft.

>> It's inherited from MIME.

> How do you know? ;-)

How do I know which?  That news messages are supposed to be MIME messages?
The draft says that pretty clearly, I thought, although it then mentions a
bunch of exceptions like UTF-8 headers.

> The draft encourages implementations to recognize UTF-8 even though it
> is not declared (and fails to mention the already existing practice of
> hierarchy-specific default charsets).

The draft completely ignores all sorts of existing practice that the
authors seem to find inconvenient.  *sigh*

-- 
Russ Allbery (rra@stanford.edu)             <http://www.eyrie.org/~eagle/>



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 18:29                       ` Graham Murray
  2001-11-03 20:28                         ` Florian Weimer
@ 2001-11-04 18:24                         ` Kai Großjohann
  1 sibling, 0 replies; 34+ messages in thread
From: Kai Großjohann @ 2001-11-04 18:24 UTC (permalink / raw)


Graham Murray <graham@barnowl.demon.co.uk> writes:

> Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes:
>
>> Presumably, users will switch from a _default_ encoding of Latin-1
>> now to a default encoding of Latin-9 later.  Most people probably
>> don't want to use Latin-1 for most things, and Latin-9 only for the
>> Euro sign -- they want to use Latin-9 in all cases.
>
> Though does USEFOR not specify a default of utf-8?

I meant that I now use (set-language-environment "Latin-1"), and once
I see a real need for the Euro sign I might switch to
(set-language-environment "Latin-9"), instead.

This change affects all of Emacs, so USEFOR is only marginally
relevant, here.

kai
-- 
I like BOTH kinds of music.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-03 20:28                         ` Florian Weimer
  2001-11-03 20:28                           ` Per Abrahamsen
@ 2001-11-04 18:29                           ` Kai Großjohann
  2001-11-04 23:27                             ` Florian Weimer
  1 sibling, 1 reply; 34+ messages in thread
From: Kai Großjohann @ 2001-11-04 18:29 UTC (permalink / raw)
  Cc: ding

Florian Weimer <fw@deneb.enyo.de> writes:

> Anyway, I think Kai was talking about the default charset for posting,
> which is properly MIME-declared by Gnus.

Well, err.  Hm.  I wasn't thinking of this explicitly, but in effect,
that's what I was saying: right now, Gnus uses Latin-1 for German
Umlauts, and after I switch to Latin-9, Gnus should use Latin-9 for
the Umlauts, too.

In the current design of Mule, this would be achieved by the
following chain of events:

* I change (set-language-environment "Latin-1") to
  (set-language-environment "Latin-9").

* Now typing `" a' inserts a Latin-9 ä which Emacs distinguishes from
  a Latin-1 ä.

* Gnus sees non-Ascii Latin-9 characters in my post and chooses
  iso-8859-15 as charset.

After changing Mule to unify the various Latin-* charsets, Emacs does
not distinguish the two variants of ä anymore.  Not sure what that
means to the whole procedure.  Maybe Latin-9 is preferred and used
whenever possible, after set-language-environment?  Maybe.

kai
-- 
I like BOTH kinds of music.



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: Using iso-8859-15 with Emacs21 & Gnus
  2001-11-04 18:29                           ` Kai Großjohann
@ 2001-11-04 23:27                             ` Florian Weimer
  0 siblings, 0 replies; 34+ messages in thread
From: Florian Weimer @ 2001-11-04 23:27 UTC (permalink / raw)


Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes:

> After changing Mule to unify the various Latin-* charsets, Emacs does
> not distinguish the two variants of ä anymore.  Not sure what that
> means to the whole procedure.  Maybe Latin-9 is preferred and used
> whenever possible, after set-language-environment?  Maybe.

If unification is present, the whole MIME charset decision process in
Gnus has to be redesigned (and many things can then work in a very
straightforward way).



^ permalink raw reply	[flat|nested] 34+ messages in thread

* Re: USEFOR
  2001-11-03 22:01                                 ` USEFOR Florian Weimer
@ 2001-11-05 10:46                                   ` Per Abrahamsen
  0 siblings, 0 replies; 34+ messages in thread
From: Per Abrahamsen @ 2001-11-05 10:46 UTC (permalink / raw)


Florian Weimer <fw@deneb.enyo.de> writes:

> In particular since many hierarchies have already a declared a charset
> to be the default if no MIME headers are present.  Unfortunately,
> USEFOR is completely ignoring this issue and fails to mention that
> reading agents need hierarchy-specific charset defaults.

They don't.  It is a quality of implementation issue.  



^ permalink raw reply	[flat|nested] 34+ messages in thread

end of thread, other threads:[~2001-11-05 10:46 UTC | newest]

Thread overview: 34+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2001-10-27 21:50 Using iso-8859-15 with Emacs21 & Gnus Raymond Scholz
2001-10-27 22:51 ` Florian Weimer
2001-10-28 13:12   ` Raymond Scholz
2001-11-01 11:26     ` Florian Weimer
2001-11-01 14:37       ` ShengHuo ZHU
2001-11-01 21:06         ` Florian Weimer
2001-11-01 23:30           ` ShengHuo ZHU
2001-11-02 19:36             ` Florian Weimer
2001-11-02 22:36               ` ShengHuo ZHU
2001-11-03 10:37                 ` Florian Weimer
2001-11-03 14:53                   ` ShengHuo ZHU
2001-11-03 16:36                     ` Kai Großjohann
2001-11-03 18:29                       ` Graham Murray
2001-11-03 20:28                         ` Florian Weimer
2001-11-03 20:28                           ` Per Abrahamsen
2001-11-03 20:38                             ` Russ Allbery
2001-11-03 21:11                               ` USEFOR (was: Re: Using iso-8859-15 with Emacs21 & Gnus) Per Abrahamsen
2001-11-03 21:34                                 ` Russ Allbery
2001-11-03 22:01                                 ` USEFOR Florian Weimer
2001-11-05 10:46                                   ` USEFOR Per Abrahamsen
2001-11-03 21:57                             ` Using iso-8859-15 with Emacs21 & Gnus Florian Weimer
2001-11-03 22:09                               ` Russ Allbery
2001-11-04  0:56                                 ` Florian Weimer
2001-11-04 10:31                                   ` Russ Allbery
2001-11-04 18:29                           ` Kai Großjohann
2001-11-04 23:27                             ` Florian Weimer
2001-11-04 18:24                         ` Kai Großjohann
2001-11-03 19:57                     ` Karl Eichwalder
2001-11-03 12:54                 ` Per Abrahamsen
2001-11-03 23:31                   ` Paul Jarc
2001-11-01 21:14         ` Raymond Scholz
2001-11-01 22:26           ` Florian Weimer
2001-11-01 17:51       ` news
2001-11-01 21:06         ` Florian Weimer

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).