ntg-context - mailing list for ConTeXt users
 help / color / mirror / Atom feed
From: Wolfgang Schuster <schuster.wolfgang@googlemail.com>
To: mailing list for ConTeXt users <ntg-context@ntg.nl>
Subject: Re: question for the xml-experts
Date: Sat, 14 Feb 2009 19:25:22 +0100	[thread overview]
Message-ID: <BCB8606A-2172-4BC3-BDEF-FB3BBC97B7D3@gmail.com> (raw)
In-Reply-To: <4C416126-1F10-4206-BD3F-9377AC7C81CC@uni-bonn.de>

Hi Thomas,

why don't you take a look at the OpenOffice export function, I saw it's
possible to convert a document to xhtml and this could be a start for  
you.

Wolfgang

Am 14.02.2009 um 18:40 schrieb Thomas A. Schmitz:

> Hi all,
>
> this is not a question about direct technical details, but more of a  
> conceptual problem, and I would love to have your input and ideas on  
> this. I will be editing several edited volumes in my field  
> (humanities, classics). From experience, I know that it's impossible  
> to make scholars in the humanities adhere to standards. Each and  
> every one of them will turn in a paper (most of them written in half  
> a dozen different versions of Word) with its own idiosyncracies. At  
> my last conference, I asked them to please use Unicode for their  
> Greek passages, and I got blank looks and the question "What the  
> hell is Unicode?"
>
> So: I want to extract the content of these papers and process it  
> with ConTeXt. I thought the easiest route might be convert them to  
> OpenOffice odt and then use the content.xml as a starting point.  
> Since the formatting will be unusable anyways, it doesn't make sense  
> to process the odt directly; instead, I want to transform the xml  
> via xslt to a simplified format and then process that with ConTeXt.  
> I have just discovered the tool xalan ( http://xml.apache.org/xalan-c/index.html 
>  ) which allows me to use an xslt style sheet and direct the output  
> to a new file. I will then need to clean up these xml files and  
> write a mkiv xml setup for them.
>
> So for those who know much more about this sort of workflow: does  
> that make sense? Is there any better way to achieve these results,  
> i.e., have the content of a couple of papers in Word and/or rtf  
> format and typeset it in a consistent ConTeXt environment? Is there  
> any tool better than xslt to convert the OpenOffice xml than xslt  
> (anything in lua that can parse xml)? Anything better than xalan to  
> convert xm -> xml? I'm just beginning to plan this, so I'd be most  
> grateful for any pointers.
>
> Thanks for reading this long message, all best
>
> Thomas
___________________________________________________________________________________
If your question is of interest to others as well, please add an entry to the Wiki!

maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context
webpage  : http://www.pragma-ade.nl / http://tex.aanhet.net
archive  : https://foundry.supelec.fr/projects/contextrev/
wiki     : http://contextgarden.net
___________________________________________________________________________________


  reply	other threads:[~2009-02-14 18:25 UTC|newest]

Thread overview: 17+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
2009-02-14 17:40 Thomas A. Schmitz
2009-02-14 18:25 ` Wolfgang Schuster [this message]
2009-02-14 18:37   ` Thomas A. Schmitz
2009-02-15  9:39   ` luigi scarso
2009-02-15 17:17     ` Thomas A. Schmitz
2009-02-17 22:07       ` luigi scarso
2009-02-19  8:54         ` Thomas A. Schmitz
2009-02-19  9:24           ` luigi scarso
2009-02-19 10:39           ` luigi scarso
2009-02-19 11:53             ` Thomas A. Schmitz
2009-02-19 14:10               ` luigi scarso
2009-02-20 15:09                 ` Thomas A. Schmitz
2009-02-20 15:35                   ` luigi scarso
2009-02-19 17:02           ` luigi scarso
2009-02-14 18:31 ` Patrick Gundlach
2009-02-14 19:06   ` Thomas A. Schmitz
2009-02-15 10:14 ` Khaled Hosny

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=BCB8606A-2172-4BC3-BDEF-FB3BBC97B7D3@gmail.com \
    --to=schuster.wolfgang@googlemail.com \
    --cc=ntg-context@ntg.nl \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).