On Friday 12 September 2008 15:26:05 Ruth Bygrave wrote:
I'm having a go at OCRing a book that fell apart:
This is a bit tangentary, but it might be worth having a look at tools like unpaper <http://unpaper.berlios.de/> which does tidying up of page scans. What OCR software are you using? I've heard that Tesseract <http://code.google.com/p/tesseract-ocr/> is supposed to be quite good for Mac. (Never tried it myself). Though the installation may be a bit involved <http://littlefixes.blogspot.com/2008/06/open-source-ocr-on-mac.html>. Cheers, Richard -- -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- Richard Lewis JID: ironchicken@jabber.earth.li http://www.richard-lewis.me.uk/ -=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=- +-------------------------------------------------------+ |Please avoid sending me Word or PowerPoint attachments.| |http://www.gnu.org/philosophy/no-word-attachments.html | +-------------------------------------------------------+