From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/84116 Path: news.gmane.org!not-for-mail From: Aditya Mahajan Newsgroups: gmane.comp.tex.context Subject: Re: EPUB XHTML Format Date: Thu, 5 Sep 2013 14:20:30 -0400 (EDT) Message-ID: References: <00b501ceaa63$61805e50$24811af0$@tosovsky@email.cz> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1378405239 26479 80.91.229.3 (5 Sep 2013 18:20:39 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Thu, 5 Sep 2013 18:20:39 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Thu Sep 05 20:20:41 2013 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([5.39.185.229]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1VHeAa-0005HN-Ru for gctc-ntg-context-518@m.gmane.org; Thu, 05 Sep 2013 20:20:40 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id BF1AD101EE; Thu, 5 Sep 2013 20:19:06 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id iDrfwKCfDGvY; Thu, 5 Sep 2013 20:19:05 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [IPv6:::1]) by balder.ntg.nl (Postfix) with ESMTP id E1C00101E9; Thu, 5 Sep 2013 20:19:04 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 4A2F7101E9 for ; Thu, 5 Sep 2013 20:19:03 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id BVOalAWpIU6O for ; Thu, 5 Sep 2013 20:19:01 +0200 (CEST) Original-Received: from filter2-ams.mf.surf.net (filter2-ams.mf.surf.net [192.87.102.70]) by balder.ntg.nl (Postfix) with ESMTP id CE286101E7 for ; Thu, 5 Sep 2013 20:19:01 +0200 (CEST) Original-Received: from hackers.mr.itd.umich.edu (smtp.mail.umich.edu [141.211.14.81]) by filter2-ams.mf.surf.net (8.14.3/8.14.3/Debian-9.4) with ESMTP id r85IKXVK003759 for ; Thu, 5 Sep 2013 20:20:33 +0200 Original-Received: FROM Decentralized.ECE.McGill.CA (Decentralized.ECE.McGill.CA [132.206.69.142]) By hackers.mr.itd.umich.edu ID 5228CB6F.C540.7288 ; Authuser adityam; 5 Sep 2013 14:20:31 EDT In-Reply-To: <00b501ceaa63$61805e50$24811af0$@tosovsky@email.cz> User-Agent: Alpine 2.02 (LNX 1266 2009-07-14) X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=141.211.14.81; country=US; region=MI; city=Ann Arbor; postalcode=48109; latitude=42.2923; longitude=-83.7145; metrocode=505; areacode=734; http://maps.google.com/maps?q=42.2923,-83.7145&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 0QKlukxTF - c382ee4b3a5d - 20130905 (trained as not-spam) X-Scanned-By: CanIt (www . roaringpenguin . com) on 192.87.102.70 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.14 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: ntg-context-bounces@ntg.nl Original-Sender: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:84116 Archived-At: On Thu, 5 Sep 2013, honyk wrote: > On 2013-09-04 Thangalin wrote: >> >> What needs to happen to take a minimal ConTeXt file (such as the >> attached) to produce a minimum viable EPUB that: >> > It is always difficult to parse and further process not well structured > plain text without advanced semantics. Garbage in, garbage out. The typical ConTeXt document has a lot of structure, and the XML export generates a well structured XML output. That can be directly used in most modern browsers that handle XML+CSS well. However, most (all?) EPUB readers don't. So, the question is asking if instead ConTeXt could generate a XHTML > If you need both EPUB and PDF, start with a semantically rich XML > vocabulary, e.g. DocBook. In this case you can relatively easy transfrom > (XSLT) input data into almost any format. These basic outputs like EPUB or > PDF (via XSL-FO) you can get out-of-the-box. The Context output can be > generated using dbcontext: http://dblatex.sourceforge.net/ > > In sum, use XML as your primary source and from it derive everything else. I haven't used XML-only toolchains. Is it possible to handle: - Automatic section numbering taking care of different conversions. - Automatic index generation and sorting - Inserting hyphenation points at the approriate place in the generated ouput (so that the browser can effectively rely on TeX's hyphenation algorithm to do linebreaking). - Convert TeX math to MathML. The current ConTeXT XML source can translate a well formed ConTeXt document into a XML document with the above features. Aditya ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________