9fans - fans of the OS Plan 9 from Bell Labs
 help / color / mirror / Atom feed
* [9fans] pdf2txt
@ 2004-11-03  9:47 cej
  2004-11-03 12:53 ` Russ Cox
  0 siblings, 1 reply; 2+ messages in thread
From: cej @ 2004-11-03  9:47 UTC (permalink / raw)
  To: 9fans


hi,

looking into src of 'page', it seems that
pdf docs don't go to GhostScript, but are treated internally.
Would it then be a problem to have something like 'pdf2txt'?

	cat doc.pdf | pdf2ps | ps2a  (or the like)

doesn't work apparently due to pdf2ps.
Or, hoe do YOU index your pdf docs? Manually??

thanks,

++pac

---
Odchozí zpráva neobsahuje viry.
Zkontrolováno antivirovým systémem AVG (http://www.grisoft.cz).
Verze: 6.0.786 / Virová báze: 532 - datum vydání: 29. 10. 2004
 


^ permalink raw reply	[flat|nested] 2+ messages in thread

* Re: [9fans] pdf2txt
  2004-11-03  9:47 [9fans] pdf2txt cej
@ 2004-11-03 12:53 ` Russ Cox
  0 siblings, 0 replies; 2+ messages in thread
From: Russ Cox @ 2004-11-03 12:53 UTC (permalink / raw)
  To: Fans of the OS Plan 9 from Bell Labs

> looking into src of 'page', it seems that
> pdf docs don't go to GhostScript, but are treated internally.

they go to ghostscript.

> Would it then be a problem to have something like 'pdf2txt'?
> 
>         cat doc.pdf | pdf2ps | ps2a  (or the like)
> 
> doesn't work apparently due to pdf2ps.

you could look at the xpdf tools, which include
a pdftotext.

> Or, hoe do YOU index your pdf docs? Manually??

i don't.

russ


^ permalink raw reply	[flat|nested] 2+ messages in thread

end of thread, other threads:[~2004-11-03 12:53 UTC | newest]

Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2004-11-03  9:47 [9fans] pdf2txt cej
2004-11-03 12:53 ` Russ Cox

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).