From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/38135 Path: main.gmane.org!not-for-mail From: ShengHuo ZHU Newsgroups: gmane.emacs.gnus.general Subject: Re: Have Emacs guess the charset? Date: Mon, 20 Aug 2001 17:07:24 -0700 Message-ID: <2nhev2uvgj.fsf@piglet.jia.vnet> References: <871ym71ulx.fsf@inanna.rimspace.net> NNTP-Posting-Host: coloc-standby.netfonds.no Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="=-=-=" X-Trace: main.gmane.org 1035173763 19236 80.91.224.250 (21 Oct 2002 04:16:03 GMT) X-Complaints-To: usenet@main.gmane.org NNTP-Posting-Date: Mon, 21 Oct 2002 04:16:03 +0000 (UTC) Return-Path: Return-Path: Original-Received: (qmail 25738 invoked from network); 21 Aug 2001 00:08:01 -0000 Original-Received: from unknown (HELO zsh.2y.net) (131.241.79.80) by gnus.org with SMTP; 21 Aug 2001 00:08:01 -0000 Original-Received: (from zsh@localhost) by zsh.2y.net (8.11.4/8.11.2) id f7L07RW22195; Mon, 20 Aug 2001 17:07:27 -0700 Original-To: ding@gnus.org X-Attribution: ZSH X-Face: 'IF:e51ib'Qbl^(}l^&4-J`'P!@[4~O|&k#:@Gld#b/]oMq&`&FVY._3+b`mzp~Jeve~/#/ ERD!OTe<86UhyN=l`mrPY)M7_}`Ktt\K+58Z!hu7>qU,i.N7TotU[FYE(f1;}`g2xj!u*l`^&=Q!g{ *q|ddto|nkt"$r,K$[)"|6,elPH= GJ6Q Mail-Copies-To: never In-Reply-To: <871ym71ulx.fsf@inanna.rimspace.net> (Daniel Pittman's message of "Mon, 20 Aug 2001 09:46:18 +1000") User-Agent: Gnus/5.090004 (Oort Gnus v0.04) Emacs/21.0.105 Original-Lines: 30 Xref: main.gmane.org gmane.emacs.gnus.general:38135 X-Report-Spam: http://spam.gmane.org/gmane.emacs.gnus.general:38135 --=-=-= Content-Type: text/plain; charset=iso-8859-1 Content-Transfer-Encoding: 8bit Daniel Pittman writes: > On Sun, 19 Aug 2001, Lars Magne Ingebrigtsen wrote: > > Kai.Grossjohann@CS.Uni-Dortmund.DE (Kai Großjohann) writes: >> >>> Sometimes I get email which has no predeclared charset. Emacs assumes >>> Latin-1 in those cases. This is good in general. But is there a way >>> to have Emacs inspect the current message and suggest a better >>> charset? >>> >>> In particular, I sometimes know there is Chinese in it, but I don't >>> know if it's GB or Big5 encoded. So I try both until I see a >>> character I recognize. Is there a way to have Emacs/Gnus guess >>> whether it's GB or Big5? >> >> Surely there must be some Mule functions for guessing what charset >> some text is in, but I have no idea what it's called. Anybody? > > `detect-coding-region' I tested the function on the attached files in both Emacs 21 and XEmacs 21.4. I found the one in XEmacs did a good job, but the one in Emacs is almost useless. ShengHuo --=-=-= Content-Type: application/octet-stream Content-Disposition: attachment; filename=Big5.tex Content-Transfer-Encoding: base64 JSBUaGlzIGlzIHRoZSBmaWxlIEJpZzUudGV4IG9mIHRoZSBDSksgcGFja2FnZQolICAgZm9yIHRl c3RpbmcgQ2hpbmVzZSAoaW4gQmlnIDUgZW5jb2RpbmcpLgolCiUgd3JpdHRlbiBieSBXZXJuZXIg TGVtYmVyZyA8d2xAZ251Lm9yZz4KJQolIFZlcnNpb24gNC4yLjAgKDEzLURlYy0xOTk4KQolCiUK JSBwcm9jZXNzIHRoaXMgZmlsZSB3aXRoIGJnNWxhdGV4CgpcZG9jdW1lbnRjbGFzc1sxMnB0XXth cnRpY2xlfSAKClx1c2VwYWNrYWdle0NKS30KCgpcYmVnaW57ZG9jdW1lbnR9CgpcYmVnaW57Q0pL Kn17Qmc1fXtzb25nfQpcQ0pLdGlsZGUKClxub2luZGVudCClu7FgsN2w3bWqtrB+KEZBUSBsaXN0 KX6sT7FxpECox7hnsWCzUbDdqOyqurDdw0SkzqjkvkG37aq6uNEKtaqkpKFBpUik6KtLqrqnzqah ukutbqbTpViquqFDuPKkV6RAqqmko6ZQqrqsT6FBqOS9c7HGtbK6Y6R3uf2ps6fvxdyhQwpcdGV4 dGJme6azw/a3c7WyumOqurLTuGChQaVpsNGm0qF1pnCm875cxaqlu7Ddtaq2sKTOpEa40ajkvXOx xrWyumOhdrjTCra1pKSquruhqfqhQ30KClxlbmR7Q0pLKn0KClxlbmR7ZG9jdW1lbnR9Cg== --=-=-= Content-Type: application/octet-stream Content-Disposition: attachment; filename=GB.tex Content-Transfer-Encoding: base64 JSBUaGlzIGlzIHRoZSBmaWxlIEdCLnRleCBvZiB0aGUgQ0pLIHBhY2thZ2UKJSAgIGZvciB0ZXN0 aW5nIENoaW5lc2UgKGluIEdCIGVuY29kaW5nKS4KJQolIHdyaXR0ZW4gYnkgV2VybmVyIExlbWJl cmcgPHdsQGdudS5vcmc+CiUKJSBWZXJzaW9uIDQuMi4wICgxMy1EZWMtMTk5OCkKClxkb2N1bWVu dGNsYXNzWzEycHRde2FydGljbGV9CgpcdXNlcGFja2FnZXtDSkt9CgoKXGJlZ2lue2RvY3VtZW50 fQoKXGJlZ2lue0NKSyp9e0dCfXtzb25nfQpcQ0pLdGlsZGUKClxub2luZGVudCCxvrOjzsrOyrTw vK9+KEZBUSBsaXN0KX7Kx7TT0rvQqb6ts6Oxu87Ktb21xM7KzOK8sMbkysq1sbXEveIKtPDW0KOs 0tS3vbHjtcTQzsq91arSqrb4s/a1xKGjuPrJz9K7sOayu82stcTKx6OsxuSx4MXFveG5udLRs7m1 17jEseShowpcdGV4dGJme9PQudjQwr3hubm1xM+4vdqjrL/Jss6/vKG4yOe6ztTEtsGxvs7KtPC8 r7ywwcu94sbkseDFxb3hubmhubjDCs/u1tC1xMu1w/eho30KClxlbmR7Q0pLKn0KClxlbmR7ZG9j dW1lbnR9Cg== --=-=-=--