TessOCR

1.06 10 Jun 2012

Free (GNU) OCR software based on Tessaract.

3

Developer website: IITOYOSAN-SYORIN

TessOCR is free OCR tool using tesseract, ImageMagick and Xpdf as a framework with JVM. TessOCR is released and distributed under the Apache License, Version 2.0.

Features:

  • Supported language: Japanese, English, French and so on. Additional support for character recognition dictionary.
  • Layout recognition: Detects horizontal-writing and vertical-writing automatically. Recognizes only content of tabular.
  • Recognizable format of image data: JPEG,PNG,GIF,BMP, TIFF and PDF.
  • Recognizable image dimensions: There is no particular limitation.
  • Recognizable character size: (Under the investigation)
  • Elimination of noise in the image: Manual control.
  • Correction of the inclination of the image: Manual control.
  • Crop the image: Manual control. Spread pages can be specified.
  • Convert to the grayscaled image by threshold: Manual control.
  • Training the character recognition dictionary: Semi-automatic control. You can edit the box.
  • Text Editing : You can input the text and edit it, and save it. You can search the text and replace with another string.

TessOCR uses internally tesseract, ImageMagick and Xpdf to process the image. However, tesseract, ImageMagick and/or Xpdf do not include as a framework of TessOCR. If tesseract, ImageMagick and/or Xpdf is already installed in your environment, TessOCR will link to it. If tesseract, ImageMagick and/or Xpdf have not been installed yet, that thing will notify to you. You have to install tesseract, ImageMagick and/or Xpdf using MacPorts.

What's New

Version 1.06:
  • Bug fix in globalization.

Ratings

Overall
(3)
Current Version (1.x)
(3)

Details

Downloads
2,489
Version Downloads
1,971
Type
Business / Word Processing
License
Free
Date
10 Jun 2012
Platform
OS X / Intel 32
Price
Free