Comments
Patrick Collands wrote: collands (AT) gmail com I'd be very grateful for an invitation. Thank you.
Cloud Expo on Google News

SYS-CON.TV

2009 East
PLATINUM SPONSORS:
IBM
Smarter Business Solutions Through Dynamic Infrastructure
IBM
Smarter Insights: How the CIO Becomes a Hero Again
Microsoft
Windows Azure
GOLD SPONSORS:
Appsense
Why VDI?
CA
Maximizing the Business Value of Virtualization in Enterprise and Cloud Computing Environments
ExactTarget
Messaging in the Cloud - Email, SMS and Voice
Freedom OSS
Stairway to the Cloud
Sun
Sun's Incubation Platform: Helping Startups Serve the Enterprise
POWER PANELS:
Click For 2008 West
Event Webcasts
Commercial and OpenSource OCR Softwares
Royalty-free OCR SDK for developers to use in custom applications

Open Source Journal

After testing the FineReader, OmniPage, ReadIRIS, and SimpleOCR, Aspire, Tesseract….it is evident that ABBYY FineReader 9 is the best overall value, while ReadIRIS is the best OCR software for under $150.

The main features that differentiate OCR software are:

  • Character recognition accuracy
  • Page layout reconstruction accuracy
  • Support for languages
  • Support for searchable PDF output
  • Speed
  • User interface
  • API / SDK
  • Support / Consulting
  • Stability of the engine when processing large documents

Following are some of the Softwares that I played with and compared.

SimpleOCR is the popular freeware OCR software with hundreds of thousands of users worldwide.  SimpleOCR is also a royalty-free OCR SDK for developers to use in their custom applications. If you have a scanner and want to avoid retyping your documents, SimpleOCR is the fast, free way to do it.  The SimpleOCR freeware is 100% free and not limited in any way.  Anyone can use SimpleOCR for free–home users, educational institutions, even corporate users. Our own freeware OCR application provides acceptable accuracy for those who just need to convert a few pages and can’t justify the cost of commercial OCR software.  Developers can use the command-line and SDK versions to integrate SimpleOCR with their custom applications.

ABBYY FineReader
FineReader Professional is a highly accurate and easy to use OCR software that includes host of features including digital camera OCR, intelligent document layouts, image enhancement, barcode recognition and command line integration.  FineReader 9 is our pick for OCR software because its document layout retention will save you much time in reformatting documents you convert for editing

IRIS ReadIRIS
Affordable OCR software for business and home users.  ReadIRIS Pro provides a extremely accurate OCR recognition rate at a low cost, but still has some of the advanced features that higher priced professional OCR software includes.

Nuance OmniPage
OmniPage is widely considered the fastest, most accurate and fully featured OCR software.  OmniPage 17 Professional has a unique new feature that lets you convert any type of document to searchable PDF or Word. OmniPage does not have a downloadable demo. Nuance also does not provide free technical support after the first call.  For these reasons we recommend the ABBYY and IRIS products instead.

OmniPage is an Optical character recognition application available from Nuance Communications. Nuance Communications was acquired by ScanSoft, which also took over its name in October 2005.OmniPage converts images such as scanned paper documents, and PDF files, into file formats used by computer applications such as Microsoft Word, Excel, Adobe Acrobat, or HTML files.OmniPage is in competition with ExperVision (TypeReader), Readiris and ABBYY Fine Reader as well as free software such as GOCR and Tesseract.

http://code.google.com/p/tesseract-ocr
In computer software, Tesseract is a free optical character recognition engine. It was originally developed as proprietary software at Hewlett-Packard between 1985 until 1995. After ten years without any development taking place, Hewlett Packard and UNLV released it as open source in 2005. Tesseract is currently developed by Google and released under the Apache License, Version 2.0.

http://jmagick.wiki.sourceforge.net
JMagick is an open source Java interface of ImageMagick. It is implemented in the form of Java Native Interface (JNI) into the ImageMagick API. JMagick does not attempt to make the ImageMagick API object-oriented. It is merely a thin interface layer into the ImageMagick API. JMagick currently only implements a subset of ImageMagick APIs. Should you require unimplemented features in JMagick, please join the mailing list and make a request. JMagick has a LGPL (Lesser GNU Public License) license.

http://www.expervision.com
The award-winning TypeReader converts scanned documents into electronic files at speed of 8,000 pages per hour with maximum reliability. Desktop 7.0 offers added flexibility to handle color and grayscale images, with duplex scanning support to process documents in English, French, German, Italian, Portuguese, Spanish, Dutch, Danish, Swedish, Norwegian, Finnish, Polish, Hungarian and Polynesian. It employs an unparalleled recognition technology to support 2618 fonts. Users can choose to output to various formats including PDF, MS Word, Excel, Lotus 1-2-3, HTML, etc.

http://www.edocfile.com
Tiff to Text is designed to perform Optical Character Recognition (OCR) in a batch process. The program utilizes the OCR engine from Nuance (Owners of OMNI Page – formally ScanSoft) that is included with Microsoft Office Document Imaging (MODI).

http://www.simpleocr.com/OCR_Software_Guide.asp

Read the original blog entry...

About Suresh Krishna Madhuvarsu
Suresh Krishna is in Oracle's Utilities division, based in San Francisco. His primary focus is on frameworks and tools. He currently blogs at http://sureshkrishna.wordpress.com.

Latest Cloud Developer Stories
CloudBench Applications, Inc. announced its financial results for the three months and nine months ending September 30, 2009. All amounts are stated in Canadian dollars unless otherwise noted. Revenues from BasicGov, the Company's cloud computing solution for local government, gr...
The new contract is an industry first, with CSC being the first Microsoft partner to lead and win a cloud computing services agreement of this scale. Under terms of the contract, CSC will provide Royal Mail Group's 30,000 employees with access to new IT services using Microsoft's...
Operates in over 170 countries and is one of the world’s leading providers of communications solutions and services. Richard Tarboton talks for MeettheBoss.TV on his role as Head of Energy & Carbon for BT and what they are doing towards reducing carbon emissions.
CA is going to put its Agile Planner software on salesforce.com’s Force.com platform in the first half to accelerate development time and give users visibility over their development initiatives to reduce time-to-market. Customers are supposed to be able to accelerate the deploym...
Despite its uncertain fate Sun soldiers on. Monday it trotted out a cloud-based multiplatform desktop as a service for K-12 and community colleges that can run Windows, the Mac OS, Linux and Solaris applications to nearly any client device, including its own Sun Ray thin clients....
Subscribe to the World's Most Powerful Newsletters
Subscribe to Our Rss Feeds & Get Your SYS-CON News Live!
Click to Add our RSS Feeds to the Service of Your Choice:
Google Reader or Homepage Add to My Yahoo! Subscribe with Bloglines Subscribe in NewsGator Online
myFeedster Add to My AOL Subscribe in Rojo Add 'Hugg' to Newsburst from CNET News.com Kinja Digest View Additional SYS-CON Feeds
Publish Your Article! Please send it to editorial(at)sys-con.com!

Advertise on this site! Contact advertising(at)sys-con.com! 201 802-3021

SYS-CON Featured Whitepapers
ADS BY GOOGLE

Breaking Cloud Computing News
CloudBench Applications, Inc. announced its financial results for the three months and nine months e...