From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/21482 Path: news.gmane.org!not-for-mail From: Brooks Moses Newsgroups: gmane.comp.tex.context Subject: Re: ConTeXt to XML? Date: Tue, 19 Jul 2005 12:57:43 -0700 Message-ID: <4.3.1.2.20050719120930.01d1c628@cits1.stanford.edu> References: <4491E49E-9A14-4D8D-9ED8-314B1A100C1F@uni-bonn.de> <9EA19B5C-0590-49C8-B397-0DDF02B0E53F@uni-bonn.de> <6faad9f005052410333139b7f2@mail.gmail.com> <429378DC.407@seznam.cz> <6faad9f005052413547f1e774f@mail.gmail.com> <6faad9f0050628073767e019bf@mail.gmail.com> <32E92F94-02D4-4A1E-A350-6F3CFF2FDD1A@uni-bonn.de> <6faad9f0050715124647ba63fa@mail.gmail.com> <6E7CB668-55CB-42CD-BAD4-F58C3EEC5983@uni-bonn.de> <42DAC94E.4080902@wxs.nl> <4491E49E-9A14-4D8D-9ED8-314B1A100C1F@uni-bonn.de> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed X-Trace: sea.gmane.org 1121803117 12568 80.91.229.2 (19 Jul 2005 19:58:37 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 19 Jul 2005 19:58:37 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Tue Jul 19 21:58:27 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1DuyDy-0005Gc-Fr for gctc-ntg-context-518@m.gmane.org; Tue, 19 Jul 2005 21:57:54 +0200 Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 422BD127D0; Tue, 19 Jul 2005 21:57:54 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23101-03; Tue, 19 Jul 2005 21:57:49 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 5290A12784; Tue, 19 Jul 2005 21:57:49 +0200 (CEST) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 0D2F412784 for ; Tue, 19 Jul 2005 21:57:47 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23104-02-2 for ; Tue, 19 Jul 2005 21:57:45 +0200 (CEST) Original-Received: from smtp1.Stanford.EDU (smtp1.Stanford.EDU [171.67.16.123]) by ronja.ntg.nl (Postfix) with ESMTP id 8ED7D1276B for ; Tue, 19 Jul 2005 21:57:45 +0200 (CEST) Original-Received: from mindolluin.stanford.edu (DNab42a553.Stanford.EDU [171.66.165.83]) by smtp1.Stanford.EDU (8.12.11/8.12.11) with ESMTP id j6JJvc32029306 for ; Tue, 19 Jul 2005 12:57:40 -0700 X-Sender: brooks@cits1.stanford.edu X-Mailer: QUALCOMM Windows Eudora Version 4.3.1 Original-To: mailing list for ConTeXt users In-Reply-To: Original-References: <4491E49E-9A14-4D8D-9ED8-314B1A100C1F@uni-bonn.de> <9EA19B5C-0590-49C8-B397-0DDF02B0E53F@uni-bonn.de> <6faad9f005052410333139b7f2@mail.gmail.com> <429378DC.407@seznam.cz> <6faad9f005052413547f1e774f@mail.gmail.com> <6faad9f0050628073767e019bf@mail.gmail.com> <32E92F94-02D4-4A1E-A350-6F3CFF2FDD1A@uni-bonn.de> <6faad9f0050715124647ba63fa@mail.gmail.com> <6E7CB668-55CB-42CD-BAD4-F58C3EEC5983@uni-bonn.de> <42DAC94E.4080902@wxs.nl> <4491E49E-9A14-4D8D-9ED8-314B1A100C1F@uni-bonn.de> X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:21482 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:21482 At 09:18 AM 7/19/2005, Elena Fraboschi wrote: >So, I have been delving into ConTeXt, and I like its syntax: far >"cleaner" than LaTeX. I have also read that there is work in progress >to convert XML to ConTeXt --- my question is, any thoughts, hints, >recommendations about reverse-engineering, that is, from ConTeXt to >XML? If the IUMJ switched production to ConTeXt, we would still want >to preserve on "archival" copy in XML. I'm reminded of a presentation that Kaveh Bazargan (from River Valley Technologies) gave at the PracticalTeX 2004 conference on a LaTeX/XML process they use, but unfortunately he hasn't provided any paper to go with his presentation. One of the things, though, that I specifically remember was that, since they also were archiving XML, their process specifically converted the authors' LaTeX to XML (specifically with the mathematics in MathML, rather than as embedded LaTeX as I know some publishers do), and then converted that from XML back to LaTeX and thence to PDF (or Postscript, possibly) for actual production -- thus guaranteeing that the archival XML would in fact reproduce the production versions exactly. With a workflow like that, I'm not certain how much conversion from ConTeXt to XML will be necessary -- you'll probably, at least for the time being, still be getting author submissions in LaTeX. You could then convert that to XML using a version of your present process, adjust the XML as needed to fit your standards, archive the XML, and then dump the XML into an automated sort of process that converts it to ConTeXt and thence to PDF, html, and whatever else you need. The only requirement, then, is that the XML -> ConTeXt -> PDF workflow not involve any manual adjustments to the ConTeXt code -- all manual changes would need to be made in the XML. (That may be a good idea anyway, as it guarantees that the XML is always the "true" version.) >(As of now we translate LaTeX to XML using "hermes", but it would >not work with ConTeXt.) If nothing is contemplated in the >area of *from ConTeXt to XML", might this strategy work > >ConTeXt -> pdf - pdf to XML ? I suspect that, insofar as it worked, you'd probably lose most of the metadata (this is a section header, this is a subsection header, etc.) unless it was a PDF to XML converter that was very specific to your particular PDF files. ... On a different note, it may be worth pointing out that ConTeXt's support for typesetting of complicated equations -- that is, the sort of stuff for which one really wants the AMSmath package in LaTeX -- really isn't especially great; it's pretty much limited to the capabilities of Plain TeX. I'm currently working (very slowly) on trying to improve this by doing a port of AMSmath to ConTeXt, but I have no idea how compatible that will be with ConTeXt's MathML/XML support, nor when I'd have it to a point where it would be ready for production work. The other alternative I know of for doing math in XML is embedding bits of LaTeX code within the XML. Since most "conversion to non-PDF formats" involves converting the math to bitmap images anyway (or, at least, any conversion to HTML for the web does!), I don't think that's really complicating things much. It's certainly possible in theory to embed LaTeX-coded equations in ConTeXt, and there's a small bit of comment on this on the Wiki. In practice, it may take a bit of coding to make work, but the amount won't be excessive. ... And, on yet a third note, you might find it useful to talk to Steve Grathwohl at the Duke University Press -- he was also at the PracticalTeX 2004 conference, and had clearly "caught the ConTeXt bug", and I suspect he may well have some useful practical experience in this area. - Brooks