ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: "John R. Culleton" <john@wexfordpress.com>
Subject: Re: Ugly hack for multiple MSWord docs.
Date: Thu, 15 Jun 2006 18:46:56 -0400	[thread overview]
Message-ID: <200606151846.56818.john@wexfordpress.com> (raw)
In-Reply-To: <44919F2B.3070202@wxs.nl>

On Thursday 15 June 2006 13:55, Hans Hagen wrote:
> John R. Culleton wrote:
> > On Thursday 15 June 2006 08:50, Hans Hagen wrote:
> >> John R. Culleton wrote:
> >>> Someday there will be an elegant solution to the MSWord to
> >>> Context problem. For now there is my ugly hack as described here.
> >>
> >> maybe the word xml output, since that can be parsed
> >>
> >> Hans
> >
> > Interesting suggestion. I don't have a copy of MSWord. And my
> > clients are naive so that asking them to save in exotic formats
> > is likely to be unproductive.
> >
> > Open Office does not save as xml. Abiword, however does. In a
>
> hm, open offices uses xml as storage format, just save in oo format and
> unzip the file and you will end up with xml files
>
> (however, the xml is typical office xml, complete with tab elements that
> spoil the idea)

The abiword xml is neat and parsimonious thus:

------------------------------------------------------------------

<!DOCTYPE book PUBLIC "-//OASIS//DTD DocBook XML V4.2//EN"
	"http://www.oasis-open.org/docbook/xml/4.2/docbookx.dtd">

<book>
<!-- 
================================================================================ 
-->
<!-- This DocBook file was created by AbiWord.										-->
<!-- AbiWord is a free, Open Source word processor.								   -->
<!-- You may obtain more information about AbiWord at www.abisource.com			   
-->
<!-- 
================================================================================ 
-->


	<chapter>
		<title></title>
		<section role="unnumbered">
			<title></title>
			<para>Now is the time for all good men.</para>
		</section>
	</chapter>
</book>
------------------------------------------------

The Open Office file unzipped is a lot more verbose and  a lot
less readable. There are five files in fact. The file content.xml
will in fact compile correctly via texexec and yield the expected
result. The character count in that file alone is three times
that of the corresponding Abiword xml output shown above.  

The experiments continue...
-- 
John Culleton
Books with answers to marketing and publishing questions:
http://wexfordpress.com/tex/shortlist.pdf

Book coaches, consultants and packagers:
http://wexfordpress.com/tex/packagers.pdf

  reply	other threads:[~2006-06-15 22:46 UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2006-06-13 22:29 John R. Culleton
2006-06-15 12:50 ` Hans Hagen
2006-06-15 18:35   ` John R. Culleton
2006-06-15 17:55     ` Hans Hagen
2006-06-15 22:46       ` John R. Culleton [this message]
2006-06-19  7:44         ` luigi scarso
2006-06-15 16:45 ` Bob Kerstetter

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=200606151846.56818.john@wexfordpress.com \
    --to=john@wexfordpress.com \
    --cc=ntg-context@ntg.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).