Submit Hint Search The Forums LinksStatsPollsHeadlinesRSS
14,000 hints and counting!

Click here to return to the 'A paperless office workflow' hint
The following comments are owned by whoever posted them. This site is not responsible for what they say.
A paperless office workflow
Authored by: dewab on Jan 17, '13 09:00:37AM

I use the Scansnap S1500M and have tried the "built-in" OCR, as well as Acrobat's and PDFpen's. Because I'm using Hazel+AppleScript to automatically sort and rename a large number of bills and accounts, I rely heavily on identifying unique criteria (URLs, account #'s, etc) to ensure that I'm moving and renaming the bill correctly. I have found that OCR does an okay job of it, but I do have to re-OCR documents occasionally to have them get the correct information without adding space or misidentifying characters.

If you look at the raw text of OCRed PDFs (using pdftotext or the like) you'll actually see how hit-or-miss OCR really is.

[ Reply to This | # ]