From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/74756 Path: news.gmane.org!not-for-mail From: Lars Magne Ingebrigtsen Newsgroups: gmane.emacs.gnus.general Subject: Re: numeric entities Date: Mon, 06 Dec 2010 15:44:02 +0100 Organization: Programmerer Ingebrigtsen Message-ID: References: NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 8bit X-Trace: dough.gmane.org 1291646703 19467 80.91.229.12 (6 Dec 2010 14:45:03 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 6 Dec 2010 14:45:03 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M23112@lists.math.uh.edu Mon Dec 06 15:44:59 2010 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PPcJ6-0004HT-BA for ding-account@gmane.org; Mon, 06 Dec 2010 15:44:48 +0100 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1PPcIt-0004GE-RS; Mon, 06 Dec 2010 08:44:35 -0600 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1PPcIs-0004Fz-2p for ding@lists.math.uh.edu; Mon, 06 Dec 2010 08:44:34 -0600 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtp (Exim 4.72) (envelope-from ) id 1PPcIn-0006DN-Fc for ding@lists.math.uh.edu; Mon, 06 Dec 2010 08:44:33 -0600 Original-Received: from lo.gmane.org ([80.91.229.12]) by quimby.gnus.org with esmtp (Exim 3.36 #1 (Debian)) id 1PPcIj-00028y-00 for ; Mon, 06 Dec 2010 15:44:25 +0100 Original-Received: from list by lo.gmane.org with local (Exim 4.69) (envelope-from ) id 1PPcIf-0003wD-TD for ding@gnus.org; Mon, 06 Dec 2010 15:44:21 +0100 Original-Received: from cm-84.215.34.171.getinternet.no ([84.215.34.171]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 06 Dec 2010 15:44:21 +0100 Original-Received: from larsi by cm-84.215.34.171.getinternet.no with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Mon, 06 Dec 2010 15:44:21 +0100 X-Injected-Via-Gmane: http://gmane.org/ Mail-Followup-To: ding@gnus.org Original-Lines: 27 Original-X-Complaints-To: usenet@dough.gmane.org X-Gmane-NNTP-Posting-Host: cm-84.215.34.171.getinternet.no Face: iVBORw0KGgoAAAANSUhEUgAAADAAAAAwBAMAAAClLOS0AAAAFVBMVEX3+fj+/v46OTfK3eD9 //2cvMJ0kJT7jaMzAAACeElEQVQ4jVWTT4+bMBDFoYScsUL2XFEtZ9LZyTnRpns2cWbOKdTz/T9C nw1stxaKovl5/vj5ubg6x8I2tu5zHWl4uOIGYGanf3FXVUW5ANG3L3FXkD8lcOXI/2eEqrwWO+dq s7iCpsRPHXhwqObqub2u8ep7KnUpmgTa2iK5S2LILbdGu4oEU5FwIJLJ5mGbQBCXZamZ9hIIXyhy YAXSm07r3wIFGDs9YcndptcNoBmHM4+p7jvFyF+Bxm2YenSXlEtFOqDOt90KDtdPXQBEiMKzoUNL FVUYewN8NvWDCx+inlOTDKAuDbd3lAp7pnGvwrwBjgJ1Wwkaxvosa0aayjOmaivyFxou4ZlFKdLV koXxALEqpj3XomEDJnRq4pml5r1IeM4LaDlcXdlElci8r9WPfmneQit5tFFmgfj4/AqcxLv9xqUW CFxcWbl13FaMq0fy1a14H6hsmnXcVnz5S2yyaNDfbJqSNwAageYviMRAavM5TtPr4wsQE01mDbj4 H2MSsU3g2JlGnAGuVA1ySiBnHKMmTwyVKvQN2aIN9g0vAAgVvFhjXADm+ZYnmiuespfGJEkj83BE uOsMppr6iEYZOLGZOaJW32mYut7OdMoiir/d8HjihMls6jrknNp8DoU5e+0Mm7XDMl17wCU/Tbr+ 3ud4F+Nbk7XSQPtcaQE9iU+lXLZ5b3eEpgReiZ8bmKXL5VdAf3YJqHk9d6nWsiKRHzJIzbte7ivA y3gWi9txOwDa59ZpX/QL2LUfqCCp95QkoyDZVzq2L9gqy9V2UDvSAgKaQ4l7jqei4S3LPrO3pAQO kyfrLVwBDjifwiV4w5yf2UChdH8BzbLx+2zcUr0AAAAASUVORK5CYII= Mail-Copies-To: never X-Now-Playing: Marine Girls's _Lazy Ways + Beach Party_: "A Place In The Sun" User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:AgRM2V2pFRMqKM22/v1l1lAXZuE= X-Spam-Score: -1.9 (-) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:74756 Archived-At: Katsumi Yamaoka writes: > When reading html articles, I sometimes see numeric entities like > "›". Currently `shr' and `gnus-w3m' render it as "\233", but > it should be "›", i.e. U+8250. Here is a conversion table stolen > from emacs-w3m (#155 is there as #x9B): > > (defvar mm-url-extra-numeric-entities It's this mostly the same as `gnus-article-dumbquotes-map'? Looks somewhat bigger, though. So perhaps that should be installed, and then `article-treat-dumbquotes' could just use that map instead? > I can implement it in mm-url.el, that is effective to `gnus-w3m', > but I hesitate to use it in `mm-shr' before calling > `libxml-parse-html-region'. WDYT? (IOW, isn't it better to make > `libxml-parse-html-region' do it by itself? It's too much for me > though.) I think `libxml-parse-html-region' should just mainly parse what it's given, for greater flexibility. But perhaps `mm-shr' and `gnus-w3m' should just convert these automatically -- they never actually make much sense. -- (domestic pets only, the antidote for overdose, milk.) larsi@gnus.org * Lars Magne Ingebrigtsen