From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/34109 Path: news.gmane.org!not-for-mail From: Henning Hraban Ramm Newsgroups: gmane.comp.tex.context Subject: Re: Microsoft Word -> Context Date: Tue, 3 Apr 2007 23:26:08 +0200 Message-ID: <16A563D8-4C4F-4E22-ABA7-5E17DF67BB5F@fiee.net> References: <627014675.20070402204746@gmail.com> <4612002D.9040501@iki.fi> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 (Apple Message framework v752.2) Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1175635606 12879 80.91.229.12 (3 Apr 2007 21:26:46 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Tue, 3 Apr 2007 21:26:46 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Tue Apr 03 23:26:38 2007 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by lo.gmane.org with esmtp (Exim 4.50) id 1HYqWL-0007AA-7y for gctc-ntg-context-518@m.gmane.org; Tue, 03 Apr 2007 23:26:29 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 12DCA2016A; Tue, 3 Apr 2007 23:26:25 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23964-06-4; Tue, 3 Apr 2007 23:26:22 +0200 (CEST) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 31B932009F; Tue, 3 Apr 2007 23:26:21 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 9C54E2009F for ; Tue, 3 Apr 2007 23:26:17 +0200 (CEST) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 23964-06-3 for ; Tue, 3 Apr 2007 23:26:11 +0200 (CEST) Original-Received: from moutng.kundenserver.de (moutng.kundenserver.de [212.227.126.183]) by ronja.ntg.nl (Postfix) with ESMTP id 2433720046 for ; Tue, 3 Apr 2007 23:26:10 +0200 (CEST) Original-Received: from [81.62.204.49] (helo=[10.128.9.3]) by mrelayeu.kundenserver.de (node=mrelayeu8) with ESMTP (Nemesis), id 0ML31I-1HYqW20KxB-0005sQ; Tue, 03 Apr 2007 23:26:10 +0200 In-Reply-To: <4612002D.9040501@iki.fi> X-Mailer: Apple Mail (2.752.2) X-Provags-ID: V01U2FsdGVkX19p7qnWASLPFy7twuYZSm8dDqkYyFCIgMVF5Bw SIYMOKlNjFXSB57toLvMX4JsymI7wcyJFbOiJ8jGbOJG29w8vJ IkkiymuMiZsCP1g6CdsU/9iAiWL60lK X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.9 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:34109 Archived-At: Am 2007-04-03 um 09:20 schrieb Mari Voipio: > Note! If your files contain graphics, for ConTeXt you have to ask > people to send them in separately as pdf, png or jpg (instead of > putting > them inline in the Word file). I have found *this* hard to achieve > once > in a while and I still often spend substantial time chasing down > originals of graphics I get in Word files. A good way is to save the docs as OpenOffice docs, unzip them and collect the images from their folder. But pictures in Word documents are crap anyway, most of the time. For my main project at work (a city magazine, typeset with InDesign) I got everything as Word Docs until some issues before. After struggling with useless text formatting (hyperlinks! blech!) we copypasted only plain text and did the formatting again manually. Now I wrote a editorial system as web application, where the authors have to fill fixed text boxes (title, intro, text, infos, author etc.). If everything's ready, I pull the whole stuff from the database and apply formatting (InDesign tagged text, but could be anything) to ease the layout work. Event timetable data works similar, but via XML. (Why? InDesign can place images with XML, but not with TaggedText, and we need some icons in the calendar. We could use XML for everything, but InDesign is much faster with TaggedText.) Of course that's no solution for most Word-to-ConTeXt cases, only as a side note... And BTW: I really like InDesign as a layout app, but it's text handling (regarding XML or TaggedText import) is horrible! (Crappy coded - doesn't understand different line endings or different text encodings, only incomplete UTF-16 without BOM and predeclared Win or Mac line endings... XML is always whitespace sensible...) Enough OT. > [I've found that generally my fellow office workers don't want to deal > with *anything* like this, but professional translators have no > problems > with ConTeXt code; and anybody with html-by-hand experience usually > gets > the drift very fast.] Unfortunately even my HTML coding colleagues fear the command line. And providing GUIs for my nice automation scripts (e.g. CD cover generator with ConTeXt) is tedious... > For example about now I have to start writing a product manual where > some parts of text come from an old Word file. I'll probably just cut > and paste what I need from the pdf file, but it's still faster than > fighting with Word over original the 9 MB (!) doc - and consistency > can > be guaranteed, unlike if I used Word, because the old file is done > with > Word95 and 97 and we now use Word 2003 where the list functions and > styles work slightly differently and don't open quite as they used > to be. Yup, I get a lot of crashes if the Word versions don't fit. I use TextEdit.app then to extract the text, but then (like with most other Word converters) you have to clean up the hyperlink and versions crap. Greetlings from Lake Constance! Hraban --- http://www.fiee.net/texnique/ http://wiki.contextgarden.net https://www.cacert.org (I'm an assurer)