From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/65352 Path: news.gmane.org!not-for-mail From: Cecil Westerhof Newsgroups: gmane.comp.tex.context Subject: Re: Translating PDF-files Date: Wed, 19 Jan 2011 15:35:42 +0100 Message-ID: References: Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============0108630803==" X-Trace: dough.gmane.org 1295447790 8715 80.91.229.12 (19 Jan 2011 14:36:30 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 19 Jan 2011 14:36:30 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Wed Jan 19 15:36:26 2011 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PfZ91-0003G2-H8 for gctc-ntg-context-518@m.gmane.org; Wed, 19 Jan 2011 15:36:19 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id A3643CAA11; Wed, 19 Jan 2011 15:36:18 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id I+nPzoqVuoZp; Wed, 19 Jan 2011 15:36:16 +0100 (CET) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 0F97BCA9EE; Wed, 19 Jan 2011 15:36:16 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 7651ECA9EE for ; Wed, 19 Jan 2011 15:36:14 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id dRWezoEDRrAo for ; Wed, 19 Jan 2011 15:36:03 +0100 (CET) Original-Received: from filter2-ams.mf.surf.net (filter2-ams.mf.surf.net [192.87.102.70]) by balder.ntg.nl (Postfix) with ESMTP id 3160ACA9BC for ; Wed, 19 Jan 2011 15:36:03 +0100 (CET) Original-Received: from mail-fx0-f41.google.com (mail-fx0-f41.google.com [209.85.161.41]) by filter2-ams.mf.surf.net (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id p0JEa2AI023377 for ; Wed, 19 Jan 2011 15:36:02 +0100 Original-Received: by fxm12 with SMTP id 12so925942fxm.14 for ; Wed, 19 Jan 2011 06:36:02 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=zn4fpCdJm5EhPn9ljxhrb32gUR3toGxkAdEF4Ps2Dss=; b=V7T/+qulbRcx2OCccOPH0+pqBrgJJsC+NdPlYouc/tx09y3qN2rZTHC/YfSYHpg7E1 MWOdzyczd2KldiI1CiwkR0BKxpVt/7nqmk+ngL+E8Yy3gCUI2Nn0RpVQpjiZ7FTrHq9A 6Bscfe/kPMs+htbrYYS7EVQyNvGVxRme1W/nQ= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=YjZtD8VqtRUMiluTdCJfAkIEBz1VYDR3dbEhluFX/E9EYh5FZAC4EA5Y1P7fq4zYkt l8n7VjTm9fNuubLbZCzG7TljG1bzroeIo1kJSk9UTec/Hhj1dszIvCNApO5fvx/UzrAS dInvkfEGg/Nmp5aaaxgQTMcmb1pCmS0/x0Dbw= Original-Received: by 10.223.103.12 with SMTP id i12mr791854fao.43.1295447742995; Wed, 19 Jan 2011 06:35:42 -0800 (PST) Original-Received: by 10.223.112.19 with HTTP; Wed, 19 Jan 2011 06:35:42 -0800 (PST) In-Reply-To: X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=209.85.161.41; country=US; region=CA; city=Mountain View; postalcode=94043; latitude=37.4192; longitude=-122.0574; metrocode=807; areacode=650; http://maps.google.com/maps?q=37.4192,-122.0574&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 0rDVqA2pG - c7267bc730a1 - 20110119 X-Scanned-By: CanIt (www . roaringpenguin . com) on 192.87.102.70 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:65352 Archived-At: --===============0108630803== Content-Type: multipart/alternative; boundary=20cf3054a5478bdf96049a33ee1f --20cf3054a5478bdf96049a33ee1f Content-Type: text/plain; charset=ISO-8859-1 2011/1/19 luigi scarso > On Wed, Jan 19, 2011 at 3:14 PM, Cecil Westerhof > wrote: > > Already done. What looked the most promissing was pdftohtml. Just > wondering > > if there is a better way. > What y do you want exactly ? > Preserve structure ? formulas ? layout ? > As far as these informations are not embedded (tagged) into the pdf > you have to do (a lot of) manual work . > My contact 'just' wants to translate the document. I already told him that this is easier said than done. But he is adamant. (Notwithstanding that several people already gave up on his quest.) I think structure and layout should be maintained. But I think it will be mostly 'simple' documents with text and some graphics. So I do not expect to have formula trouble. Also google for pdfdraw mupdf > I will do that. -- Cecil Westerhof --20cf3054a5478bdf96049a33ee1f Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
2011/1/19 luigi scarso <luigi.scarso@gmail.com>
On Wed, Jan 19, 2011 at 3:14 PM, Cecil Westerhof <cldwesterhof@gmail.com> wrote:=
> Already done. What looked the most promissing was pdftohtml. Just wond= ering
> if there is a better way.
What y do you want exactly ?
Preserve structure ? formulas ? layout ?
As far as these informations are not embedded (tagged) into the pdf
you have to =A0do (a lot of) manual work .

My cont= act 'just' wants to translate the document. I already told him that= this is easier said than done. But he is adamant. (Notwithstanding that se= veral people already gave up on his quest.) I think structure and layout sh= ould be maintained. But I think it will be mostly 'simple' document= s with text and some graphics. So I do not expect to have formula trouble.<= br> =A0

Also google for pdfdraw mupdf

I will do that.
<= /div>

--
Cecil Westerhof
--20cf3054a5478bdf96049a33ee1f-- --===============0108630803== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________ --===============0108630803==--