From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.emacs.gnus.general/73215 Path: news.gmane.org!not-for-mail From: Katsumi Yamaoka Newsgroups: gmane.emacs.gnus.general Subject: Re: shr.el: folding Japanese text Date: Mon, 18 Oct 2010 14:19:10 +0900 Organization: Emacsen advocacy group Message-ID: References: <8762xcvbsp.fsf@lifelogs.com> <87eibxkxa3.fsf@anar.kanru.info> NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: dough.gmane.org 1287379266 18622 80.91.229.12 (18 Oct 2010 05:21:06 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Mon, 18 Oct 2010 05:21:06 +0000 (UTC) To: ding@gnus.org Original-X-From: ding-owner+M21588@lists.math.uh.edu Mon Oct 18 07:21:05 2010 Return-path: Envelope-to: ding-account@gmane.org Original-Received: from util0.math.uh.edu ([129.7.128.18]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1P7i9g-0004jT-Ck for ding-account@gmane.org; Mon, 18 Oct 2010 07:21:04 +0200 Original-Received: from localhost ([127.0.0.1] helo=lists.math.uh.edu) by util0.math.uh.edu with smtp (Exim 4.63) (envelope-from ) id 1P7i8V-00044G-Cz; Mon, 18 Oct 2010 00:19:51 -0500 Original-Received: from mx1.math.uh.edu ([129.7.128.32]) by util0.math.uh.edu with esmtps (TLSv1:AES256-SHA:256) (Exim 4.63) (envelope-from ) id 1P7i8T-000440-P0 for ding@lists.math.uh.edu; Mon, 18 Oct 2010 00:19:49 -0500 Original-Received: from quimby.gnus.org ([80.91.231.51]) by mx1.math.uh.edu with esmtp (Exim 4.72) (envelope-from ) id 1P7i8S-0001pp-Gu for ding@lists.math.uh.edu; Mon, 18 Oct 2010 00:19:49 -0500 Original-Received: from orlando.hostforweb.net ([216.246.45.90]) by quimby.gnus.org with esmtp (Exim 3.36 #1 (Debian)) id 1P7i8R-00022J-00 for ; Mon, 18 Oct 2010 07:19:47 +0200 Original-Received: from localhost ([127.0.0.1]:48158) by orlando.hostforweb.net with esmtpa (Exim 4.69) (envelope-from ) id 1P7i7s-0005XX-9z for ding@gnus.org; Mon, 18 Oct 2010 00:19:13 -0500 X-Hashcash: 1:20:101018:ding@gnus.org::9U+fHolbg3QEda61:00001WMa X-Face: #kKnN,xUnmKia.'[pp`;Omh}odZK)?7wQSl"4o04=EixTF+V[""w~iNbM9ZL+.b*_CxUmFk B#Fu[*?MZZH@IkN:!"\w%I_zt>[$nm7nQosZ<3eu;B:$Q_:p!',P.c0-_Cy[dz4oIpw0ESA^D*1Lw= L&i*6&( User-Agent: Gnus/5.110011 (No Gnus v0.11) Emacs/24.0.50 (gnu/linux) Cancel-Lock: sha1:sB4no6WY4W83q6r8culKI8Z72oo= X-AntiAbuse: This header was added to track abuse, please include it with any abuse report X-AntiAbuse: Primary Hostname - orlando.hostforweb.net X-AntiAbuse: Original Domain - gnus.org X-AntiAbuse: Originator/Caller UID/GID - [47 12] / [47 12] X-AntiAbuse: Sender Address Domain - jpl.org X-Source: X-Source-Args: X-Source-Dir: X-Spam-Score: -1.9 (-) List-ID: Precedence: bulk Xref: news.gmane.org gmane.emacs.gnus.general:73215 Archived-At: Lars Magne Ingebrigtsen wrote: > The rendering of the HTML here seems suboptimal: > > It's an article that talks about Chinese words, so it's mostly English > text with some Chinese here and there. The spaces have been removed > from the start and end of the Chinese text, which makes it look > awkward. Should the space stripping be altered in some way? I've improved the way to examine whether space is necessary between characters. Now space will not be inserted if the previous character is wide and categorized as kinsoku-bol[1], or both before and behind characters are categorized as nospace[2]. This is better for Japanese and ASCII mixture text, too. :) [1] (aref (char-category-set CHAR) ?>) [2] (aref fill-nospace-between-words-table CHAR)