From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/22754 Path: news.gmane.org!not-for-mail From: Christopher Creutzig Newsgroups: gmane.comp.tex.context Subject: Re: DOC/RTF to ConTeXt via XML Date: Wed, 28 Sep 2005 10:02:03 +0200 Message-ID: <433A4DFB.2040405@creutzig.de> References: <4372F02C@webmail.colostate.edu> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable X-Trace: sea.gmane.org 1127894774 24783 80.91.229.2 (28 Sep 2005 08:06:14 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 28 Sep 2005 08:06:14 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Wed Sep 28 10:06:02 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1EKWwq-0003VE-72 for gctc-ntg-context-518@m.gmane.org; Wed, 28 Sep 2005 10:05:52 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id BDE56127C3; Wed, 28 Sep 2005 10:05:51 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 09284-05; Wed, 28 Sep 2005 10:05:51 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 2549D127EE; Wed, 28 Sep 2005 10:02:10 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 23C32127EE for ; Wed, 28 Sep 2005 10:02:08 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 09284-04 for ; Wed, 28 Sep 2005 10:02:07 +0200 (CEST) Original-Received: from bayes.math.uni-paderborn.de (bayes.math.uni-paderborn.de [131.234.116.40]) by ronja.ntg.nl (Postfix) with ESMTP id 06817127ED for ; Wed, 28 Sep 2005 10:02:06 +0200 (CEST) Original-Received: from localhost (localhost.localdomain [127.0.0.1]) by bayes.math.uni-paderborn.de (Postfix) with ESMTP id ABBFAE000125 for ; Wed, 28 Sep 2005 10:02:06 +0200 (CEST) Original-Received: from bayes.math.uni-paderborn.de ([127.0.0.1]) by localhost (bayes [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id 32672-07 for ; Wed, 28 Sep 2005 10:02:04 +0200 (CEST) Original-Received: from [192.168.1.2] (p548B0EDA.dip0.t-ipconnect.de [84.139.14.218]) by bayes.math.uni-paderborn.de (Postfix) with ESMTP id A8CA2E000112 for ; Wed, 28 Sep 2005 10:02:04 +0200 (CEST) User-Agent: Mozilla Thunderbird 1.0.6 (Macintosh/20050716) X-Accept-Language: en-us, en Original-To: mailing list for ConTeXt users In-Reply-To: <4372F02C@webmail.colostate.edu> X-Enigmail-Version: 0.92.0.0 X-Virus-Scanned: by mailscan-system at math.uni-paderborn.de X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:22754 Archived-At: Idris Samawi Hamid wrote: > Ok, you guys have lost me now-) Maybe the best thing to do is try somet= hing=20 Just ignore the detail of what xslt can and can't do for the moment. That just influences the choice of tools for one particular step and we all agree that there are tools for this step. > it to ConTeXt. From what I gather so far the process goes something lik= e >=20 > doc =3D> rtf=20 > rtf =3D> OO.o > OO.o =3D> xml No need for rtf. That would loose lots of information anyway, wouldn't = it? > \startHans > converting open office xml is not always easy; stay away from tab's and= use=20 > high level constructs as much as possible > \stopHans I'm not really sure what Hand meant by this. I assume he does have a valid point, since so far I only had a short and theoretical look at the format, but I can only guess what it is. Hans, could you give an example or two? >>>From this discussion it seems that I (as an xml ignoramous) would be be= tter=20 > off converting to ConTeXt code rather than processing pure xml blocks (= but=20 > maybe I'm wrong). XML is much, much easier to parse than just about anything else. That means that whatever your conversion process uses, you can simply reuse an XML parser in whatever language you want to use. (Interpreting the file may be easy or hard, depending on the xml structure at hand.) The only exception I can see right now would be a rather large and error-prone =93Visual=94 Basic program to create a sort of export filter = for Word to write ConTeXt. I certainly don't think that's easier. > Once I get a sane xml file (this seems to be the biggest problem) what = is the=20 > best tool to convert this to ConTeXt? It depends on who is going to write the conversion. From the languages I've used so far, it's probably easiest to do in xslt, but if you are/have at hand a programmer who's good at ruby but would have to learn xslt first, the whole thing may not be big enough to warrant learning another language first. Unless that programmer wants to, which would be a very good sign. Learning a new language per year is not really a bad idea. > We are all extremely busy, of course, but if anyone finds this interest= ing I=20 > can send a sample doc article from my journal. Maybe we can do a MyWay = or=20 > something to document this process for ourselves and others, as well as= find=20 It might be a pretty specific thing, though. My guess is that you could make more progress by thinking about what sort of structurals you would like to have, rather than looking at what you have right now. Christopher