From mboxrd@z Thu Jan 1 00:00:00 1970 Message-ID: Date: Fri, 14 May 2004 12:45:48 -0400 From: a@9srv.net To: 9fans@cse.psu.edu Subject: RE: [9fans] text search in PDF? In-Reply-To: MIME-Version: 1.0 Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Topicbox-Message-UUID: 7b7fb5ec-eacd-11e9-9e20-41e7f4b1d025 // Rare cases when rasterized images of text pages are wrapped // into Postscript or PDF formats are mostly due to a need to // somehow publish scanned documents; or due to faulty conversion // toolchain. i can't comment on the causes, although your assesment seems quite reasonable. i can say, however, that when i looked at the issue a year or two (not more) ago, these cases were certainly not rare. my measurements were quite unscientific, but it looked to be about 50% of the documents i examined (mostly pulled indiscriminatly from the web). *=E2=80 =CE=B1