On Friday 12 September 2008 15:26:05 Ruth Bygrave wrote:
I'm having a go at OCRing a book that fell apart:
This is a bit tangentary, but it might be worth having a look at tools like unpaper http://unpaper.berlios.de/ which does tidying up of page scans.
What OCR software are you using? I've heard that Tesseract http://code.google.com/p/tesseract-ocr/ is supposed to be quite good for Mac. (Never tried it myself). Though the installation may be a bit involved http://littlefixes.blogspot.com/2008/06/open-source-ocr-on-mac.html.
Cheers, Richard