From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/65349 Path: news.gmane.org!not-for-mail From: Cecil Westerhof Newsgroups: gmane.comp.tex.context Subject: Re: Translating PDF-files Date: Wed, 19 Jan 2011 15:14:10 +0100 Message-ID: References: Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 Content-Type: multipart/mixed; boundary="===============1603030346==" X-Trace: dough.gmane.org 1295446473 1314 80.91.229.12 (19 Jan 2011 14:14:33 GMT) X-Complaints-To: usenet@dough.gmane.org NNTP-Posting-Date: Wed, 19 Jan 2011 14:14:33 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Wed Jan 19 15:14:29 2011 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by lo.gmane.org with esmtp (Exim 4.69) (envelope-from ) id 1PfYnr-0005UK-U4 for gctc-ntg-context-518@m.gmane.org; Wed, 19 Jan 2011 15:14:27 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 296FACA9C1; Wed, 19 Jan 2011 15:14:27 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id dpmojNAYaehH; Wed, 19 Jan 2011 15:14:24 +0100 (CET) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 6FCCCCA9EE; Wed, 19 Jan 2011 15:14:24 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 944CECA9EE for ; Wed, 19 Jan 2011 15:14:23 +0100 (CET) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id ajxXkgmGKVaz for ; Wed, 19 Jan 2011 15:14:12 +0100 (CET) Original-Received: from filter2-til.mf.surf.net (filter2-til.mf.surf.net [194.171.167.218]) by balder.ntg.nl (Postfix) with ESMTP id 90F18CA9C1 for ; Wed, 19 Jan 2011 15:14:12 +0100 (CET) Original-Received: from mail-fx0-f41.google.com (mail-fx0-f41.google.com [209.85.161.41]) by filter2-til.mf.surf.net (8.14.3/8.14.3/Debian-5+lenny1) with ESMTP id p0JEEBGd028056 for ; Wed, 19 Jan 2011 15:14:12 +0100 Original-Received: by fxm12 with SMTP id 12so903471fxm.14 for ; Wed, 19 Jan 2011 06:14:11 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:in-reply-to:references:date :message-id:subject:from:to:content-type; bh=utBrMJM2y+yCZptegAEB6vqeBNPAB3T9CQFIXAcVXGA=; b=JRivIJs3wfmPvV23sds0t7Qi4KlW0zpjx2paxo3Be/w22bTkfoPE20r2sRSNDezhjr MIzo0IPxI86mDgegQYqbFozfwZj16tFq28i8ulqzkTn9wTpY7V4gUY91yOutQHCCBsCg 49ffMptkiS2VGGVgZk/c2LUq6dhppTGuKbhAw= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=P/jTrRdwK9paej+PhpFYV7Jl3qsjeIjXhCkDciO4ONBSlRO4sOPwlU/8YsZoHpGOOb 0E2Ux/6fs3O0kXXr0W+HJStyo78Ef8MJLHI7fPblgVPyiY2EgZOAnrovm678OvsagDH/ CrdNSYX2hDoTxBf7dH66DA0N5p+SJcqldPpe4= Original-Received: by 10.223.74.1 with SMTP id s1mr733319faj.138.1295446451501; Wed, 19 Jan 2011 06:14:11 -0800 (PST) Original-Received: by 10.223.112.19 with HTTP; Wed, 19 Jan 2011 06:14:10 -0800 (PST) In-Reply-To: X-Bayes-Prob: 0.0001 (Score 0, tokens from: @@RPTN) X-CanIt-Geo: ip=209.85.161.41; country=US; region=CA; city=Mountain View; postalcode=94043; latitude=37.4192; longitude=-122.0574; metrocode=807; areacode=650; http://maps.google.com/maps?q=37.4192,-122.0574&z=6 X-CanItPRO-Stream: uu:ntg-context@ntg.nl (inherits from uu:default, base:default) X-Canit-Stats-ID: 0bDVqeb6x - 6d02e5990ede - 20110119 X-Scanned-By: CanIt (www . roaringpenguin . com) on 194.171.167.218 X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:65349 Archived-At: --===============1603030346== Content-Type: multipart/alternative; boundary=20cf30433dfe913ce1049a33a151 --20cf30433dfe913ce1049a33a151 Content-Type: text/plain; charset=ISO-8859-1 2011/1/19 luigi scarso > On Wed, Jan 19, 2011 at 1:58 PM, Cecil Westerhof > wrote: > > Properly not really a ConTeXt question, but maybe nows the answer. > > > > Someone asked me how to convert a PDF to XML and back. The reasons is > that > > he has a PDF in English, but he likes to have it also in Russian. His > idea > > is to convert the PDF file to XML, translate the XML file with > > GoogleTranslate and convert the translated XML file to PDF. He asked me > how > > to do this. Of-course it does not have to be a XML file, if > GoogleTranslate > > can work with a TEX file, there is no reason not to do it. > google for pdftotext > Already done. What looked the most promissing was pdftohtml. Just wondering if there is a better way. -- Cecil Westerhof --20cf30433dfe913ce1049a33a151 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable
2011/1/19 luigi scarso <luigi.scarso@gmail.com>
On Wed, Jan 19, 2011 at 1:58 PM, Cecil Westerhof <cldwesterhof@gmail.com> wrote:=
> Properly not really a ConTeXt question, but maybe nows the answer.
>
> Someone asked me how to convert a PDF to XML and back. The reasons is = that
> he has a PDF in English, but he likes to have it also in Russian. His = idea
> is to convert the PDF file to XML, translate the XML file with
> GoogleTranslate and convert the translated XML file to PDF. He asked m= e how
> to do this. Of-course it does not have to be a XML file, if GoogleTran= slate
> can work with a TEX file, there is no reason not to do it.
google for pdftotext

Already done. What look= ed the most promissing was pdftohtml. Just wondering if there is a better w= ay.

--
Cecil Westerhof
--20cf30433dfe913ce1049a33a151-- --===============1603030346== Content-Type: text/plain; charset="us-ascii" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Disposition: inline ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : http://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________ --===============1603030346==--