From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/38090 Path: main.gmane.org!not-for-mail From: Daniel Pittman Newsgroups: gmane.emacs.gnus.general Subject: Re: Have Emacs guess the charset? Date: Mon, 20 Aug 2001 11:26:43 +1000 Organization: Not today, thank you, Mother. Message-ID: <87r8u7zfl8.fsf@inanna.rimspace.net> References: <871ym71ulx.fsf@inanna.rimspace.net> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: main.gmane.org 1035173725 18976 80.91.224.250 (21 Oct 2002 04:15:25 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 04:15:25 +0000 (UTC) Return-Path: Return-Path: Original-Received: (qmail 23183 invoked from network); 20 Aug 2001 01:27:34 -0000 Original-Received: from melancholia.rimspace.net (HELO melancholia.danann.net) (203.36.211.210) by gnus.org with SMTP; 20 Aug 2001 01:27:34 -0000 Original-Received: from localhost (melancholia.rimspace.net [203.36.211.210]) by melancholia.danann.net (Postfix) with ESMTP id C60A12A834 for ; Mon, 20 Aug 2001 11:27:16 +1000 (EST) Original-Received: by localhost (Postfix, from userid 1000) id 37DD98217B; Mon, 20 Aug 2001 11:26:43 +1000 (EST) Original-To: ding@gnus.org In-Reply-To: (Lars Magne Ingebrigtsen's message of "Mon, 20 Aug 2001 02:36:14 +0200") X-Homepage: http://danann.net/ User-Agent: Gnus/5.090004 (Oort Gnus v0.04) XEmacs/21.5 (artichoke) Original-Lines: 43 Xref: main.gmane.org gmane.emacs.gnus.general:38090 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:38090 On Mon, 20 Aug 2001, Lars Magne Ingebrigtsen wrote: > Daniel Pittman writes: > >> `detect-coding-region' > > Thanks. > > I've just tried it in an (unmarked) big5 message. (Well, I'm guessing > it was big5.) The function returned the following list: > > (iso-latin-1-unix raw-text-unix chinese-big5-unix no-conversion) > > Which means that the correct answer is the third-most-likely guess... > Is big5 particularly difficult to guess, or is the function bad at > guessing? I think that the answer is probably "both", but I am not really certain. I don't know too much about MULE, but both Kai and I have tried to support it at various times with TRAMP. So, my recollection of BIG5 encoding is that it is an escaped-in set of bytes in the 128-255 range, with iso-2022 codeset shifts to get to and from ASCII. That means it's probably not that easy to pick. Which, you understand, does not make the function all that smart. Under XEmacs, it's pretty simplistic in it's detection of possible coding system matches. Er, you did call it on the region that *DID NOT* include the ASCII email headers, right? If that's true, I guess that you are pretty much short of luck. :( Daniel -- There is censorship in this country, all right, make no mistake about that, but also make no mistake about its source...While the government will not censor, apparently the networks will. The irreparable damage to the public is all the same. -- Nicholas Johnson, Federal Communications Commissioner, _ New York Times_, (April 8, 1969)