From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/22644 Path: news.gmane.org!not-for-mail From: Mojca Miklavec Newsgroups: gmane.comp.tex.context Subject: Re: ConTeXt to RTF Conversion Date: Wed, 21 Sep 2005 22:09:32 +0200 Message-ID: <4331BDFC.7070808@gmail.com> References: Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1127333514 5860 80.91.229.2 (21 Sep 2005 20:11:54 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 21 Sep 2005 20:11:54 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Wed Sep 21 22:11:45 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1EIAuZ-00021r-Gl for gctc-ntg-context-518@m.gmane.org; Wed, 21 Sep 2005 22:09:47 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 212341278F; Wed, 21 Sep 2005 22:09:47 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 07673-05-3; Wed, 21 Sep 2005 22:09:43 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 896A9127B0; Wed, 21 Sep 2005 22:09:43 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id C7A6E127B0 for ; Wed, 21 Sep 2005 22:09:41 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 07673-05-2 for ; Wed, 21 Sep 2005 22:09:40 +0200 (CEST) Original-Received: from avs1.arnes.si (avs1.arnes.si [193.2.1.74]) by ronja.ntg.nl (Postfix) with ESMTP id C66361278F for ; Wed, 21 Sep 2005 22:09:40 +0200 (CEST) Original-Received: from localhost (avs1.arnes.si [193.2.1.74]) by avs1.arnes.si (Postfix) with ESMTP id 6EB1D36A017; Wed, 21 Sep 2005 22:09:40 +0200 (CEST) Original-Received: from avs1.arnes.si ([193.2.1.74]) by localhost (avs1.arnes.si [193.2.1.74]) (amavisd-new, port 10024) with ESMTP id 32887-03; Wed, 21 Sep 2005 22:09:40 +0200 (CEST) Original-Received: from [194.249.10.216] (ar20-24i.dial-up.arnes.si [194.249.10.216]) by avs1.arnes.si (Postfix) with ESMTP id DCBA936A00C; Wed, 21 Sep 2005 22:09:34 +0200 (CEST) User-Agent: Mozilla/4.5-4.75 (Windows; U; Windows NT 5.1; sl-SI; rv:1.4) Gecko/20030624 Netscape/7.1 X-Accept-Language: sl, en, en-us, de Original-To: ntg@louspringer.com, mailing list for ConTeXt users In-Reply-To: X-Virus-Scanned: by amavisd-new at arnes.si X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:22644 Archived-At: Louis F.Springer wrote: > What are the options for conversion from ConTeXt to other formats, if > any? I'm particularly interested in rtf and/or html. The main problem is: PDF is extremely complex format (and Hans tries to explore almost every single capability of it) and you can't grant a good conversion for obscure documents. You can't get html out of the box, but there are some options depending on what your documents look like, what effort are you ready to invest and what quality of HTML/RTF you expect. (I can't imagine a tool which would satisfactory convert the ConTeXt manuals into HTML without manual intervention.) 1. The best quality can be achieved if you prepare all your stuff in XML and then write both a stylesheet for conversion into HTML and "a couple" of ConTeXt definitions to handle formating for output in PDF documents. I never did that (I consider it too complex and time consuming), but if you're ready to go that way and sacrifice some time, there are some people on the list who can help you. 2. exTeX is going to natively support HTML output. The question is when a stable version is going to appear and if ConTeXt will ever support it / if they will support ConTeXt. (not a satisfactory answer yet) 3. latex2html, latex2word, tth - like No, no such tool yet and I doubt that anyone is going to write it soon. 4. PDF -> HTML / PDF -> RTF conversion Currently the best possibility if you have simple documents, you don't want to spend too much time on it and you don't mind too much about high quality of the produced HTML/RTF. I often use pdftotext and then manually reformat everything (not for my own documents however), but I was impressed by the quality of what ABBYY PDF Transformer was able to do with sample documents that I saw (it converts a table in PDF into a table in Word, preserves images and page layout, ...). About the accents that Idris mentioned: As long as the accented characters aren't faked, there is a way to get them out of PDF. Mojca