From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/23212 Path: news.gmane.org!not-for-mail From: "Patrick Gundlach" Newsgroups: gmane.comp.tex.context Subject: Re: OT: PDF word counts? Date: 2 Nov 2005 18:15:10 +0100 Organization: chaos Message-ID: References: <20051102170927.30074@mail.comp.lancs.ac.uk> Reply-To: mailing list for ConTeXt users NNTP-Posting-Host: main.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Trace: sea.gmane.org 1130951820 20892 80.91.229.2 (2 Nov 2005 17:17:00 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 2 Nov 2005 17:17:00 +0000 (UTC) Original-X-From: ntg-context-bounces@ntg.nl Wed Nov 02 18:16:57 2005 Return-path: Original-Received: from ronja.vet.uu.nl ([131.211.172.88] helo=ronja.ntg.nl) by ciao.gmane.org with esmtp (Exim 4.43) id 1EXMCr-0007Ky-Od for gctc-ntg-context-518@m.gmane.org; Wed, 02 Nov 2005 18:15:25 +0100 Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id C8512127CF; Wed, 2 Nov 2005 18:15:24 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 09660-04-2; Wed, 2 Nov 2005 18:15:20 +0100 (CET) Original-Received: from ronja.vet.uu.nl (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id 7A2DC127C4; Wed, 2 Nov 2005 18:15:20 +0100 (CET) Original-Received: from localhost (localhost [127.0.0.1]) by ronja.ntg.nl (Postfix) with ESMTP id B7F1C127C4 for ; Wed, 2 Nov 2005 18:15:19 +0100 (CET) Original-Received: from ronja.ntg.nl ([127.0.0.1]) by localhost (smtp.ntg.nl [127.0.0.1]) (amavisd-new, port 10024) with LMTP id 09660-04 for ; Wed, 2 Nov 2005 18:15:18 +0100 (CET) Original-Received: from lilly.ping.de (lilly.ping.de [83.97.42.2]) by ronja.ntg.nl (Postfix) with SMTP id C430F1279D for ; Wed, 2 Nov 2005 18:15:18 +0100 (CET) Original-Received: (qmail 14562 invoked from network); 2 Nov 2005 17:15:18 -0000 Original-Received: (ofmipd 83.97.40.204); 2 Nov 2005 17:14:56 -0000 Original-To: ntg-context@ntg.nl X-Lieblings-Musik: the_capricorns In-Reply-To: <20051102170927.30074@mail.comp.lancs.ac.uk> (Adam Lindsay's message of "Wed, 2 Nov 2005 18:09:27 +0100") User-Agent: Gnus/5.11 (Gnus v5.11) Emacs/22.0.50 (darwin) X-Virus-Scanned: amavisd-new at ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.1.5 Precedence: list List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Original-Sender: ntg-context-bounces@ntg.nl Errors-To: ntg-context-bounces@ntg.nl X-Spam-Checker-Version: SpamAssassin 3.0.3 (2005-04-27) on smtp.ntg.nl X-Virus-Scanned: amavisd-new at ntg.nl Xref: news.gmane.org gmane.comp.tex.context:23212 Archived-At: Hi Adam, > Sorry for going a bit off-topic, but does anyone here know of good tools > for doing a word count directly on existing PDFs? what about pdftotext from xpdf?: /opt/xpdf/current/bin/pdftotext lettrine.pdf - | wc -w 1672 It's not that accurate, but perhaps it is sufficient? Patrick -- ConTeXt wiki and more: http://contextgarden.net