From mboxrd@z Thu Jan 1 00:00:00 1970 X-Msuck: nntp://news.gmane.io/gmane.comp.tex.context/15009 Path: main.gmane.org!not-for-mail From: Erik Hetzner Newsgroups: gmane.comp.tex.context Subject: Re: ConTeXt and the blind Date: Thu, 15 Apr 2004 04:06:58 -0700 Sender: ntg-context-admin@ntg.nl Message-ID: <407E6CD2.3070108@ocf.berkeley.edu> References: <4E1ABA54-8E55-11D8-AE3A-00306544E64E@princeton.edu> <20040414172047.36881177@atipa.local> Reply-To: ntg-context@ntg.nl NNTP-Posting-Host: deer.gmane.org Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii; format=flowed Content-Transfer-Encoding: 7bit X-Trace: sea.gmane.org 1081984012 6012 80.91.224.253 (14 Apr 2004 23:06:52 GMT) X-Complaints-To: usenet@sea.gmane.org NNTP-Posting-Date: Wed, 14 Apr 2004 23:06:52 +0000 (UTC) Original-X-From: ntg-context-admin@ntg.nl Thu Apr 15 01:06:42 2004 Return-path: Original-Received: from ref.vet.uu.nl ([131.211.172.13] helo=ref.ntg.nl) by deer.gmane.org with esmtp (Exim 3.35 #1 (Debian)) id 1BDtSs-00006i-00 for ; Thu, 15 Apr 2004 01:06:42 +0200 Original-Received: from ref.ntg.nl (localhost.localdomain [127.0.0.1]) by ref.ntg.nl (Postfix) with ESMTP id AA69710B5F; Thu, 15 Apr 2004 01:03:11 +0200 (MEST) Original-Received: from war.OCF.Berkeley.EDU (war.OCF.Berkeley.EDU [192.58.221.244]) by ref.ntg.nl (Postfix) with ESMTP id 78B1B10B54 for ; Thu, 15 Apr 2004 01:01:26 +0200 (MEST) Original-Received: from ocf.berkeley.edu (adsl-63-204-198-82.dsl.snfc21.pacbell.net [63.204.198.82]) by war.OCF.Berkeley.EDU (8.12.11/8.9.3) with ESMTP id i3EN4m5B013322 for ; Wed, 14 Apr 2004 16:04:49 -0700 (PDT) User-Agent: Mozilla/5.0 (Windows; U; Win98; en-US; rv:1.6b) Gecko/20031205 Thunderbird/0.4 X-Accept-Language: en-us, en Original-To: ntg-context@ntg.nl In-Reply-To: <20040414172047.36881177@atipa.local> Errors-To: ntg-context-admin@ntg.nl X-BeenThere: ntg-context@ntg.nl X-Mailman-Version: 2.0.13 Precedence: bulk List-Help: List-Post: List-Subscribe: , List-Id: mailing list for ConTeXt users List-Unsubscribe: , List-Archive: Xref: main.gmane.org gmane.comp.tex.context:15009 X-Report-Spam: http://spam.gmane.org/gmane.comp.tex.context:15009 Bill McClain wrote: >On Wed, 14 Apr 2004 16:50:04 -0400 >Alan Bowen wrote: > > > >>So, does anyone on the list have ideas about how to produce such files >> >>from the files I currently have in hand or any experience with this >>sort of problem? >> >> > >I have used the pdftotext utility, part of the xpdf package, for similar >tasks. In the case of hyphenated line endings, the word will be >hyphenated and broken across lines just as in the pdf, and that might be >a problem for the reader program. > >-Bill > > From my own experience pdftotext also has trouble handling multicolumn documents. Adobe has an online utility for transforming PDF to html, which can rather easily be turned into text, which worked pretty well for me, breaking columns into something useful instead of mashing all the text together. Regards, Erik Hetzner