From mboxrd@z Thu Jan 1 00:00:00 1970 From: brad@heeltoe.com (Brad Parker) Date: Wed, 30 Apr 2008 09:01:51 -0400 Subject: [Unix-jun72] ocr'd e03 In-Reply-To: <20080430102120.GA84492@minnie.tuhs.org> Message-ID: <3018.1209560511@mini> Hi, I'm new to this (just discovered it - way cool!), so as an experiment I opened the scanned pdf and cut and pasted e03-01,02,03,04 into gimp, shrunk them to 3000x3000 and sent them to the tesseract web site. It does an amazing job. A little emacs work and the source looks good. Anyway, I know e03 is assigned to someone else, but they where not in the svn. should I check them in? (I just did it as an experiment, and I don't want to step on anyone;) I'm also curious how we boot strap this. In the end I assume we need a binary image which one of the sims can read. I have 0.5 a mind to write a quick and dirty assembler which outputs a binary file... But I suppose it would be better to use the original as/as2. Can this be run with apout? (I'd be curious to hear how people are doing it). I'm happy to keep plugging through the remaining un-ocr'd pages if no one screams, sending email first of course. -brad