From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/28609 Path: news.gmane.org!not-for-mail From: "John R. Culleton" Newsgroups: gmane.comp.tex.context Subject: Re: Ugly hack for multiple MSWord docs. Date: Thu, 15 Jun 2006 18:46:56 -0400 Organization: WexfordPress Message-ID: <200606151846.56818.john@wexfordpress.com> References: <200606131829.58862.john@wexfordpress.com> <200606151435.11543.john@wexfordpress.com> <44919F2B.3070202@wxs.nl> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1150409785 29557 80.91.229.2 (15 Jun 2006 22:16:25 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Thu, 15 Jun 2006 22:16:25 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Fri Jun 16 00:16:20 2006 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1Fr08O-0005pG-LM for gctc-ntg-context-518@m.gmane.org; Fri, 16 Jun 2006 00:16:16 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id AC8CC12784; Fri, 16 Jun 2006 00:16:16 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 00417-03; Fri, 16 Jun 2006 00:16:16 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id AF309127A7; Thu, 15 Jun 2006 23:51:04 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 13328127A7 for ; Thu, 15 Jun 2006 23:51:03 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 30476-05 for ; Thu, 15 Jun 2006 23:51:00 +0200 (CEST) Original-Received: from mta10.adelphia.net (mta10.adelphia.net [68.168.78.202]) by ronja.ntg.nl (Postfix) with SMTP id 05AD012792 for ; Thu, 15 Jun 2006 23:50:59 +0200 (CEST) Original-Received: from 69-174-128-193.frdrmd.adelphia.net ([69.174.128.193]) by mta10.adelphia.net (InterMail vM.6.01.05.02 201-2131-123-102-20050715) with ESMTP id <20060615215058.ODOG12693.mta10.adelphia.net@69-174-128-193.frdrmd.adelphia.net> for ; Thu, 15 Jun 2006 17:50:58 -0400 Original-To: mailing list for ConTeXt users User-Agent: KMail/1.9.1 In-Reply-To: <44919F2B.3070202@wxs.nl> Content-Disposition: inline X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.7 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:28609 Archived-At: On Thursday 15 June 2006 13:55, Hans Hagen wrote: > John R. Culleton wrote: > > On Thursday 15 June 2006 08:50, Hans Hagen wrote: > >> John R. Culleton wrote: > >>> Someday there will be an elegant solution to the MSWord to > >>> Context problem. For now there is my ugly hack as described here. > >> > >> maybe the word xml output, since that can be parsed > >> > >> Hans > > > > Interesting suggestion. I don't have a copy of MSWord. And my > > clients are naive so that asking them to save in exotic formats > > is likely to be unproductive. > > > > Open Office does not save as xml. Abiword, however does. In a > > hm, open offices uses xml as storage format, just save in oo format and > unzip the file and you will end up with xml files > > (however, the xml is typical office xml, complete with tab elements that > spoil the idea) The abiword xml is neat and parsimonious thus: ------------------------------------------------------------------
Now is the time for all good men.
------------------------------------------------ The Open Office file unzipped is a lot more verbose and a lot less readable. There are five files in fact. The file content.xml will in fact compile correctly via texexec and yield the expected result. The character count in that file alone is three times that of the corresponding Abiword xml output shown above. The experiments continue... -- John Culleton Books with answers to marketing and publishing questions: http://wexfordpress.com/tex/shortlist.pdf Book coaches, consultants and packagers: http://wexfordpress.com/tex/packagers.pdf