Home : July 22 2013 Computer News : OCR makes short work of digitizing your docs |
|
OCR makes short work of digitizing your docs |
July 22, 2013
The file cabinet looms large in the office, yet it guards its secrets jealously...even from you. It's time to convert those papers to space-saving, easy-to-find digital documents. For that, you need a scanner to turn them into digital images and an Optical Character Recognition program to convert those images into editable and searchable documents. I took four of the latest OCR programs and a free online OCR service for a test spin. All of them work to varying degrees.
To test the programs, I ran 22 varied and not particularly clean scans of documents—including one hand-written note—through four OCR programs and one free service. I looked for accuracy in text recognition, image extraction, and the ability to recreate them in a Word document. In addition, I processed 264 separate scans from a yearbook for output as a searchable PDF.
Free-OCR
You don't actually need to install OCR software if you need to convert only a couple of small documents. You can use a free service such as Free-OCR (also known as Free-OCR.com) and upload a scan of your document. File size is limited to 2MB and 5000 pixels in any direction, which is about 150 dpi for a standard page. The OCR engine handles 29 languages, including English.
Free-OCR makes you jump through a CAPTCHA hoop, but does it apologetically.
Although you don't have to register or even fork over your email address, the Free-OCR site does make you fill in one of those annoying CAPTCHAs. (Thanks, Web bad guys, for making everyone's life more difficult.) However, those CAPTCHAs serve to remind one just how difficult OCR can be. If humans, with our incredible heuristic abilities, occasionally have problems with these, just think how poor straight-line software perusing a stream of bits must feel.
To read this article in full or to leave a comment, please click here
Link: http://www.pcworld.com/article/2044395/ocr-makes-short-work-of-digitizing-your-docs.html#tk.rss_reviews
|
|
|
|
|