From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/84094 Path: news.gmane.org!not-for-mail From: Hans Hagen Newsgroups: gmane.comp.tex.context Subject: Re: EPUB XHTML Format Date: Wed, 04 Sep 2013 11:20:54 +0200 Message-ID: <5226FB76.8030602@wxs.nl> References: Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: plane.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; Format="flowed" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1378286486 3008 80.91.229.3 (4 Sep 2013 09:21:26 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Wed, 4 Sep 2013 09:21:26 +0000 (UTC) To: ntg-context@ntg.nl Original-X-From: ntg-context-bounces@ntg.nl Wed Sep 04 11:21:31 2013 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([5.39.185.229]) by plane.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1VH9HG-00060t-OF for gctc-ntg-context-518@m.gmane.org; Wed, 04 Sep 2013 11:21:30 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 0B1D410218; Wed, 4 Sep 2013 11:19:59 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id lOl1ymiIhTth; Wed, 4 Sep 2013 11:19:56 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [IPv6:::1]) by balder.ntg.nl (Postfix) with ESMTP id A5BA910207; Wed, 4 Sep 2013 11:19:56 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id A20DE10207 for ; Wed, 4 Sep 2013 11:19:54 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id iR+9l2loTVCu for ; Wed, 4 Sep 2013 11:19:49 +0200 (CEST) Original-Received: from filter1-til.mf.surf.net (filter1-til.mf.surf.net [194.171.167.217]) by balder.ntg.nl (Postfix) with ESMTP id 70FC7101F3 for ; Wed, 4 Sep 2013 11:19:49 +0200 (CEST) Original-Received: from smtp.ziggozakelijk.nl (D57D1DA2.static.ziggozakelijk.nl [213.125.29.162]) by filter1-til.mf.surf.net (8.14.3/8.14.3/Debian-9.4) with ESMTP id r849LJX2006648 for ; Wed, 4 Sep 2013 11:21:19 +0200 X-Default-Received-SPF: pass (skip=loggedin (res=PASS)) x-ip-name=10.100.1.103; Original-Received: from [10.100.1.103] (unverified [10.100.1.103]) by pragma-net.nl (SurgeMail 6.3c2) with ESMTP id 4050-1713362 for ; Wed, 04 Sep 2013 11:21:18 +0200 User-Agent: Mozilla/5.0 (Windows NT 6.2; WOW64; rv:17.0) Gecko/20130801 Thunderbird/17.0.8 In-Reply-To: X-Authenticated-User: hagen@controller-9 X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=213.125.29.162; country=NL; region=15; city=Zwolle; latitude=52.5058; longitude=6.0858; http://maps.google.com/maps?q=52.5058,6.0858&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 0RKkVljtN - 18aa74574e3d - 20130904 (trained as not-spam) X-Scanned-By: CanIt (www . roaringpenguin . com) on 194.171.167.217 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.14 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: ntg-context-bounces@ntg.nl Original-Sender: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:84094 Archived-At: On 9/4/2013 3:19 AM, Thangalin wrote: > Hi, > > The attached t.tex file produces the attached t.xhtml file. I have > looked at the following documents: > > * http://en.wikipedia.org/wiki/EPUB#Open_Publication_Structure_2.0..1 > > * http://en.wikipedia.org/wiki/DTBook > * http://www.idpf.org/epub/20/spec/OPS_2.0.1_draft.htm > * http://www.w3.org/TR/xhtml11/doctype.html > * http://www.w3.org/TR/html5/sections.html > > It seems that the macros in t.tex are being written out as XML elements, > verbatim. It is my understanding that these XML elements, however, do > not conform to the minimal content models associated with XHTML 1.1. you get a representation in xml indeed, but not verbatim, but as close as possible to the genaric (parent) structure elements in context of course we could alternatively export all as
but i don't like that too much; html itself is not rich enough for our purpose > What needs to happen to take a minimal ConTeXt file (such as the > attached) to produce a minimum viable EPUB that: > > * Generates XHTML headers (including ) not needed as we're 'standalone' > * Produces images as img tags, rather than float tags. the css can deal with them (info is written to files for that) the only real problematic thing is hyperlinks as css has no provision for that so there's an option to inject ... > * Uses typical XHTML tags for elements (e.g.,
    for ordered > lists). xhtml has no typical tags .. it's xml + css (or xslt) ... unfortunately browsers have messed up html so much (extensions, too tolerant support for unmatched tags, different rendering models) that xhtml never really took off the export of context is in fact just xml, and by tagging it as xhtml we can apply css to it; but if someone has a workflow for producing epub an option if to postprocess that xml file into whatever epub one wants (i.e. the export is generic and carries as much info as possible) > Ideally, I would like to do something such as: > > * context t.tex > * mtxrun --script epub --make t.specification > > to generate an EPUB that passes validation of epubcheck > , with an output XHTML > file that more closely matches the XHTML specification. Everytime we look into epub there's another issue ... it's not a standard but reversed engineered application mess (happen soften with xml: turn some application data structures into xml and call it a standard) I only tested (long ago already) with some firefox plugin (i don't have a recent epub device, only an old firts generation one which is dead slow, never relly used, probably broken by now) and i refuse to buy a new one till resolution is decent (and i only want generic devices, not something bound to some shop) > How can I help? by testing as i have no real use/demand for epub it's not something i look into on a daily basis Hans ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | voip: 087 875 68 74 | www.pragma-ade.com | www.pragma-pod.nl ----------------------------------------------------------------- ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________