Gnus development mailing list
 help / color / mirror / Atom feed
* iso-8859-8/windows-1255 (Hebrew) and iso-8859-9/windows-1254 (Turkish)
@ 2006-09-28 16:44 Reiner Steib
  2006-10-04 21:49 ` Reiner Steib
  0 siblings, 1 reply; 2+ messages in thread
From: Reiner Steib @ 2006-09-28 16:44 UTC (permalink / raw)


Hi,

I've learned that windows-1254 is a superset of iso-8859-9 (Turkish)
and windows-1255 is a superset of iso-8859-8 (Hebrew).

So it would probably make sense to add these pairs to
`mm-charset-synonym-alist' and `mm-charset-override-alist' as well, in
the same way we do for iso-8859-1/windows-1252 (West Europe), see the
patch below.  In western Europe, articles are often mislabeled as
iso-8859-1, when in fact the article is windows-1252.

Does anyone (especially people using those charsets) see a problem
with this change?

--8<---------------cut here---------------start------------->8---
--- mm-util.el	16 Jul 2006 14:33:12 -0000	7.54
+++ mm-util.el	28 Sep 2006 16:37:05 -0000
@@ -208,12 +208,22 @@
     ;; BIG-5HKSCS is similar to, but different than, BIG-5.
     ,@(unless (mm-coding-system-p 'big5-hkscs)
 	'((big5-hkscs . big5)))
-    ;; Windows-1252 is actually a superset of Latin-1.  See also
-    ;; `gnus-article-dumbquotes-map'.
+    ;; Windows-1252 is actually a superset of iso-8859-1 (West Europe).  See
+    ;; also `gnus-article-dumbquotes-map'.
     ,@(unless (mm-coding-system-p 'windows-1252)
 	(if (mm-coding-system-p 'cp1252)
 	    '((windows-1252 . cp1252))
 	  '((windows-1252 . iso-8859-1))))
+    ;; Windows-1254 is a superset of iso-8859-9 (Turkish).
+    ,@(unless (mm-coding-system-p 'windows-1254)
+	(if (mm-coding-system-p 'cp1254)
+	    '((windows-1254 . cp1254))
+	  '((windows-1254 . iso-8859-9))))
+    ;; Windows-1255 is a superset of iso-8859-8 (Hebrew).
+    ,@(unless (mm-coding-system-p 'windows-1255)
+	(if (mm-coding-system-p 'cp1255)
+	    '((windows-1255 . cp1255))
+	  '((windows-1255 . iso-8859-8))))
     ;; Windows-1250 is a variant of Latin-2 heavily used by Microsoft
     ;; Outlook users in Czech republic. Use this to allow reading of their
     ;; e-mails. cp1250 should be defined by M-x codepage-setup.
@@ -237,7 +247,9 @@
   "A mapping from unknown or invalid charset names to the real charset names.")
 
 (defcustom mm-charset-override-alist
-  `((iso-8859-1 . windows-1252))
+  '((iso-8859-1 . windows-1252)
+    (iso-8859-8 . windows-1255)
+    (iso-8859-9 . windows-1254))
   "A mapping from undesired charset names to their replacement.
 
 You may add pairs like (iso-8859-1 . windows-1252) here,
@@ -245,6 +257,8 @@
 superset of iso-8859-1."
   :type '(list (set :inline t
 		    (const (iso-8859-1 . windows-1252))
+		    (const (iso-8859-8 . windows-1255))
+		    (const (iso-8859-9 . windows-1254))
 		    (const (undecided  . windows-1252)))
 	       (repeat :inline t
 		       :tag "Other options"
--8<---------------cut here---------------end--------------->8---

Bye, Reiner.
-- 
       ,,,
      (o o)
---ooO-(_)-Ooo---  |  PGP key available  |  http://rsteib.home.pages.de/




^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: iso-8859-8/windows-1255 (Hebrew) and iso-8859-9/windows-1254 (Turkish)
  2006-09-28 16:44 iso-8859-8/windows-1255 (Hebrew) and iso-8859-9/windows-1254 (Turkish) Reiner Steib
@ 2006-10-04 21:49 ` Reiner Steib
  0 siblings, 0 replies; 2+ messages in thread
From: Reiner Steib @ 2006-10-04 21:49 UTC (permalink / raw)


On Thu, Sep 28 2006, Reiner Steib wrote:

> I've learned that windows-1254 is a superset of iso-8859-9 (Turkish)
> and windows-1255 is a superset of iso-8859-8 (Hebrew).
>
> So it would probably make sense to add these pairs to
> `mm-charset-synonym-alist' and `mm-charset-override-alist' as well, in
> the same way we do for iso-8859-1/windows-1252 (West Europe), see the
> patch below.  In western Europe, articles are often mislabeled as
> iso-8859-1, when in fact the article is windows-1252.

Committed (trunk).

Bye, Reiner.
-- 
       ,,,
      (o o)
---ooO-(_)-Ooo---  |  PGP key available  |  http://rsteib.home.pages.de/




^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2006-10-04 21:49 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2006-09-28 16:44 iso-8859-8/windows-1255 (Hebrew) and iso-8859-9/windows-1254 (Turkish) Reiner Steib
2006-10-04 21:49 ` Reiner Steib

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).