Sunday, November 29, 2009

What is OCR? (Optical Character Recognition)

OCR Definition

OCR software or Optical Character Recognition Software is a function of certain software applications that provides the means to convert images, or portions of images to text.  Scanned documents are almost always create as non-text image formats, such as TIFF, PDF, JPG, etc.  The process of basic OCR makes them searchable, and thus more useful when you require the ability to search the contents of scanned documents.  The core system uses a combination of pattern recognition and artifical intelligence to interpret the images, and create the  most accurate output.  Many of the more popular engines provide the ability to output not only to text, but word processor formts, HTML, PDF, etc.

No comments:

Post a Comment