From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/52729 Path: news.gmane.org!not-for-mail From: Oliver Buerschaper Newsgroups: gmane.comp.tex.context Subject: Re: [OT] Edit PDF manually Date: Mon, 7 Sep 2009 18:34:28 +0200 Message-ID: <52DF152A-3938-4965-901F-5AB0F01B4CB9@mpq.mpg.de> References: <387332A8-3062-4CF3-A318-F2E327345BD1@mpq.mpg.de> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: lo.gmane.org Mime-Version: 1.0 (Apple Message framework v936) Content-Type: text/plain; charset="us-ascii"; Format="flowed"; DelSp="yes" Content-Transfer-Encoding: 7bit X-Trace: ger.gmane.org 1252347428 23308 80.91.229.12 (7 Sep 2009 18:17:08 GMT) X-Complaints-To: usenet@ger.gmane.org NNTP-Posting-Date: Mon, 7 Sep 2009 18:17:08 +0000 (UTC) To: mailing list for ConTeXt users Original-X-From: ntg-context-bounces@ntg.nl Mon Sep 07 18:34:50 2009 Return-path: Envelope-to: gctc-ntg-context-518@m.gmane.org Original-Received: from balder.ntg.nl ([195.12.62.10]) by lo.gmane.org with esmtp (Exim 4.50) id 1MkhB1-00068F-RB for gctc-ntg-context-518@m.gmane.org; Mon, 07 Sep 2009 18:34:47 +0200 Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id 33847C9A92; Mon, 7 Sep 2009 18:34:47 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id fHfXHEsq8pHU; Mon, 7 Sep 2009 18:34:43 +0200 (CEST) Original-Received: from balder.ntg.nl (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id B8E62C9A81; Mon, 7 Sep 2009 18:34:43 +0200 (CEST) Original-Received: from localhost (localhost [127.0.0.1]) by balder.ntg.nl (Postfix) with ESMTP id D0C69C9A81 for ; Mon, 7 Sep 2009 18:34:41 +0200 (CEST) X-Virus-Scanned: Debian amavisd-new at balder.ntg.nl Original-Received: from balder.ntg.nl ([127.0.0.1]) by localhost (balder.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id sKpkJEhp45Vb for ; Mon, 7 Sep 2009 18:34:29 +0200 (CEST) Original-Received: from post.rzg.mpg.de (post.rzg.mpg.de [130.183.30.42]) by balder.ntg.nl (Postfix) with ESMTP id 80657C9A7C for ; Mon, 7 Sep 2009 18:34:29 +0200 (CEST) Original-Received: from [130.183.93.58] ([130.183.93.58]) (authenticated bits=0) by post.rzg.mpg.de (8.14.3/8.14.3) with ESMTP id n87GYSDU1266082 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO) for ; Mon, 7 Sep 2009 18:34:29 +0200 In-Reply-To: X-Mailer: Apple Mail (2.936) X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.12 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl Xref: news.gmane.org gmane.comp.tex.context:52729 Hi Luigi, > pdf spec. > www.adobe.com/devnet/acrobat/pdfs/PDF32000_2008.pdf > xpdf sources That sounds like a definite reference ... probably more than I can digest for a start :-( Perhaps there's a simple, example-driven guide for dummies somewhere? Like writing a simple PDF document with more than one page, non- contiguous text blocks and perhaps a hyperlink by hand ... > Under linux pdfedit is experimental > http://pdfedit.petricek.net/en/index.html Installed :-) OK, I've just managed to remove a page from a given PDF file. Beyond that one will probably have to know more about PDF ... for example, do you know how pdfedit can help me identify which object in the raw PDF corresponds to a given blob of text on the page? After selecting the blob I get some info on my selection that eludes me :-( > pypdf is a python module at lowlevel. > http://pybrary.net/pyPdf/ This looks very interesting! > As exercise, you can try to minimic pdffonts in python with pypdf > (pdfs with ttf,otf,type1 etc ) I'm afraid, I don't understand :-( Best, Oliver ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________