* [9fans] pdf2txt
@ 2004-11-03 9:47 cej
2004-11-03 12:53 ` Russ Cox
0 siblings, 1 reply; 2+ messages in thread
From: cej @ 2004-11-03 9:47 UTC (permalink / raw)
To: 9fans
hi,
looking into src of 'page', it seems that
pdf docs don't go to GhostScript, but are treated internally.
Would it then be a problem to have something like 'pdf2txt'?
cat doc.pdf | pdf2ps | ps2a (or the like)
doesn't work apparently due to pdf2ps.
Or, hoe do YOU index your pdf docs? Manually??
thanks,
++pac
---
Odchozí zpráva neobsahuje viry.
Zkontrolováno antivirovým systémem AVG (http://www.grisoft.cz).
Verze: 6.0.786 / Virová báze: 532 - datum vydání: 29. 10. 2004
^ permalink raw reply [flat|nested] 2+ messages in thread
* Re: [9fans] pdf2txt
2004-11-03 9:47 [9fans] pdf2txt cej
@ 2004-11-03 12:53 ` Russ Cox
0 siblings, 0 replies; 2+ messages in thread
From: Russ Cox @ 2004-11-03 12:53 UTC (permalink / raw)
To: Fans of the OS Plan 9 from Bell Labs
> looking into src of 'page', it seems that
> pdf docs don't go to GhostScript, but are treated internally.
they go to ghostscript.
> Would it then be a problem to have something like 'pdf2txt'?
>
> cat doc.pdf | pdf2ps | ps2a (or the like)
>
> doesn't work apparently due to pdf2ps.
you could look at the xpdf tools, which include
a pdftotext.
> Or, hoe do YOU index your pdf docs? Manually??
i don't.
russ
^ permalink raw reply [flat|nested] 2+ messages in thread
end of thread, other threads:[~2004-11-03 12:53 UTC | newest]
Thread overview: 2+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
2004-11-03 9:47 [9fans] pdf2txt cej
2004-11-03 12:53 ` Russ Cox
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for NNTP newsgroup(s).