From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.text.pandoc/10211 Path: news.gmane.org!not-for-mail From: Daniel Staal Newsgroups: gmane.text.pandoc Subject: Re: Excessive memory usage with --normalize Date: Sun, 29 Jun 2014 23:52:27 -0400 Message-ID: References: <20140630031155.GB16744@localhost.hsd1.ca.comcast.net> Reply-To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed X-Trace: ger.gmane.org 1404100355 15806 80.91.229.3 (30 Jun 2014 03:52:35 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 30 Jun 2014 03:52:35 +0000 (UTC) To: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-X-From: pandoc-discuss+bncBCGYLPE23UARB7F5YOOQKGQE3T7OBEQ-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Mon Jun 30 05:52:29 2014 Return-path: Envelope-to: gtp-pandoc-discuss@m.gmane.org Original-Received: from mail-ob0-f185.google.com ([209.85.214.185]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1X1Sdp-00082W-Lv for gtp-pandoc-discuss@m.gmane.org; Mon, 30 Jun 2014 05:52:29 +0200 Original-Received: by mail-ob0-f185.google.com with SMTP id wm4sf1626870obc.2 for ; Sun, 29 Jun 2014 20:52:28 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlegroups.com; s=20120806; h=date:from:to:subject:message-id:in-reply-to:references:mime-version :x-original-sender:x-original-authentication-results:reply-to :precedence:mailing-list:list-id:list-post:list-help:list-archive :sender:list-subscribe:list-unsubscribe:content-type :content-disposition; bh=F2/13mACI4ZrRZgVQo3Jzuy1lqq589KBq+SfeuJoSGA=; b=TctdJAVbl4KFMPcxsBUdCXMcSNwIRaxweVxUd6aI8rni+x8j2aJjgiFmLI6cptTTUD Dy+dQFeXDFLEd2vG60dFhP8IA/m6H2nfZWj1uwR5k1RuyIkrMrqw8xbo9gJAa4NWVxtu saRg9Xr8ffAWYoKk4wgoaZt81VYPnnIDQHfBvdrnvnYp7kWCTBwV4bmC2Ikc9ue61bN6 yLEcTh79KhSUp9LALdBC57vcwfPhLgVb4opBq+d0AjPdnkfXRIMZN8GZWMUUBdGFs0un S8kZZtOFX4MXR5ugpQDw5rH5lka2Wi42+wjT6AkyEhjUbTd6VFCAbvyLLjFdyau5jWr6 dWHQ== X-Received: by 10.140.88.103 with SMTP id s94mr194qgd.42.1404100348659; Sun, 29 Jun 2014 20:52:28 -0700 (PDT) X-BeenThere: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org Original-Received: by 10.140.36.71 with SMTP id o65ls908279qgo.4.gmail; Sun, 29 Jun 2014 20:52:28 -0700 (PDT) X-Received: by 10.52.230.71 with SMTP id sw7mr18030516vdc.9.1404100348252; Sun, 29 Jun 2014 20:52:28 -0700 (PDT) Original-Received: from mail.magehandbook.com (173-8-4-45-WashingtonDC.hfc.comcastbusiness.net. [173.8.4.45]) by gmr-mx.google.com with ESMTP id m2si2020934qcr.2.2014.06.29.20.52.28 for ; Sun, 29 Jun 2014 20:52:28 -0700 (PDT) Received-SPF: none (google.com: DStaal-Jdbf3xiKgS8@public.gmane.org does not designate permitted sender hosts) client-ip=173.8.4.45; Original-Received: from [192.168.1.50] (Mac-Pro.magehandbook.com [192.168.1.50]) by mail.magehandbook.com (Postfix) with ESMTP id 3h1vsl4sMQzm9 for ; Sun, 29 Jun 2014 23:52:27 -0400 (EDT) In-Reply-To: <20140630031155.GB16744-bi+AKbBUZKbivNSvqvJHCtPlBySK3R6THiGdP5j34PU@public.gmane.org> X-Mailer: Mulberry/4.0.8 (Mac OS X) X-Original-Sender: dstaal-Jdbf3xiKgS8@public.gmane.org X-Original-Authentication-Results: gmr-mx.google.com; spf=neutral (google.com: DStaal-Jdbf3xiKgS8@public.gmane.org does not designate permitted sender hosts) smtp.mail=DStaal-Jdbf3xiKgS8@public.gmane.org Precedence: list Mailing-list: list pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org; contact pandoc-discuss+owners-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-ID: X-Google-Group-Id: 1007024079513 List-Post: , List-Help: , List-Archive: Original-Sender: pandoc-discuss-/JYPxA39Uh5TLH3MbocFFw@public.gmane.org List-Subscribe: , List-Unsubscribe: , Content-Disposition: inline Xref: news.gmane.org gmane.text.pandoc:10211 Archived-At: --As of June 29, 2014 8:11:55 PM -0700, John MacFarlane is alleged to have said: > I tried pandoc without --normalize and it converted Pride > and Prejudice well. I think you just need to preprocess > the HTML and remove a few crufty bits, like the div at > the end of each chapter with four
tags. --As for the rest, it is mine. Yeah, I was mostly trying different options to see what worked best. I just felt I should report this because it was such an outlier. Daniel T. Staal --------------------------------------------------------------- This email copyright the author. Unless otherwise noted, you are expressly allowed to retransmit, quote, or otherwise use the contents for non-commercial purposes. This copyright will expire 5 years after the author's death, or in 30 years, whichever is longer, unless such a period is in excess of local copyright law. ---------------------------------------------------------------