OCRKit
OCRKit 2.5
Your rating: Now say why...

(1) 4

Convert PDF or image graphics into searchable PDF.   Demo ($69.00)
Add to my Watch List
Email me when discounted
OCRKit uses OCR (Optical Character Recognition) technology to recognize the text in the graphic, which is particularly useful for PDFs received via e-mail, created by DTP, office applications, or with a scanner or copier. Create searchable PDFs, RTF, HTML, or Unicode plain text simply by dragging a document on the OCRKit application or Dock icon.

Searchable Text. OCRKit is a simple and streamlined Mac application, that features the advanced Optical Character Recognition technology, allowing you to convert scanned or printed documents into searchable and
What's New
Version 2.5:
  • Up to 8-times higher speed conversion on latest Mac
  • Improved auto-rotation for high-resolution documents
  • Highly Compressed PDF - file size reduction
  • Improved recognition accuracy and stability
  • Improved text positioning accuracy in PDF
  • Many other improvements
Requirements
Intel, OS X 10.5 or later



MacUpdate - OCRKit




  • Prizmo
    +8

  • ABBYY FineRead...
    +7

  • PDFpen
    +5

  • PDFpenPro
    +5

  • VelOCRaptor
    +1

  • Readiris Pro
    +1
OCRKit User Discussion (Write a Review)
ver. 2.x:
(1)
Your rating: Now say why...
Overall:
(11)

sort: smiles | time
burypromote

+124

Aeschylus reviewed on 22 Feb 2014
OCR Kit does as good a job as any OCR product on the Mac. Compared to using Adobe Acrobat on a PC, its accuracy is similar, although its features are less. Comparing prices, there's no comparison. While it may take a few minutes to learn OCR kit, there is little to do for single document conversion and it is very reliable. I like it a lot, and it is the only reason I can do OCR on my Mac. Thanks for a nice product.
[Version 2.4.1]

1 Reply

burypromote

+4
Meitnik replied on 13 Apr 2014
Can you or someone tell how good is it with german. Can it learn new words as it OCRs? Have a user dict to edit? Thanks
burypromote
+2

+84

Ryde reviewed on 20 Feb 2013
I've purchased quite a few licenses for OCR applications in the last couple of months, and all of these products have something going for them, but OCRKit stands out for at least two reasons:

a) OCRKit can be invoked in an automated, i.e. "batch", fashion.
b) OCRKit is able to preserve CCITT Group 4 compression in scanned images, at least in some cases. This finally gives me output files which have the exact same image quality as the original and are only marginally bigger due to the added text content. Kudos!

OCRKit is not perfectly straightforward to use IMHO and could definitely use a better documentation, though.

Regarding the recognition accuracy: I have been able to obtain very good results with OCRKit provided that the quality of the scanned image is high. In other cases the recognized text will contain errors, as with any other OCR software. OCRKit is said to be able to utilize a dictionary for better results, but it seems to me that this does not yet work in all cases.

So overall I am quite impressed with OCRKit, but improvements are of cause welcome. I hope the developers continue to bring their app forward, it has a lot of potential.

Please also see my comments added to the review of "brianvon" right below.
[Version 1.13]


burypromote
-1

-1

brianvon reviewed on 16 Feb 2013
Bloatware, takes my 1 MB file and at MINIMUM file size expands it to nearly 10 MB. Under higher quality settings it bloats the pdf 33x larger.

Spell correction does not work well. It took an easily readable word "confirmation" --> "confirmatioIL"

And that was with dictionary turned on!

try again
[Version 1.13]

1 Reply

burypromote
+3

+84
Ryde replied on 20 Feb 2013
Unfortunately, your findings do apply to most OCR available on the Mac, not just OCRKit (which does actually better than many competitors, see below). So your half-star rating is a too bit harsh IMO.

The problem is that most Mac applications which deal with PDFs are implemented using Apples PDFKit framework which has some limitations. In particular, when writing out PDFs, it can only write a subset of the image codecs defined by the PDF standard. So if you got PDFs which use a sophisticated special purpose (i.e. high-quality and high-compression) image codec like JBIG2 or CCITT Group 4, your average PDFKit based OCR application which only supports Deflate and JPEG as its compression methods has a problem. Either will it create files which are much bigger than the original (Deflate keeps the image quality as is, but is too simplistic to compress well) or which got degraded graphics (JPEG trades image quality for smaller file sizes). To avoid these drawbacks, the software developers would either need to go extra miles and partly bypass PDFKit or use a different PDF handling library altogether. Both requires effort, knowledge and maybe additional licensing costs, so most companies don't seem to bother. The developers of OCRKit are a notable exception, because at least in some cases OCRKit is actually able to transfer the images 1:1.

Regarding the recognition accuracy: I have been able to obtain very good results with OCRKit provided that the quality of the scanned image is high. But I've also had some cases like you mentioned, i.e. errors which I thought the dictionary function should have caught easily.

So overall I am quite impressed with OCRKit, but improvements are of cause welcome. I hope the developers continue to bring their app forward, it has a lot of potential.
burypromote
+1

+124

Aeschylus reviewed on 16 Jan 2013
I downloaded the demo and ran it on some very difficult documents - poorly made scans, documents with lots of wild fonts, etc. It performed relatively well. I ran it on a PDF i created by removing the text from commercially produced PDF (I exported it to .png, then exported again to PDF in Preview) and it was flawless. It was surprisingly quick on my 5 year old Mac Mini. The verdict - well worth the price. When I saw the PDFs it produces, they are SO much nicer than the CRAZY MS Word docs produced by Adobe Acrobat (they consist of zillions of text boxes and other strange items, not straightforward text and pictures). All in all, what does more for anywhere near this price? Acrobat has a few tricks this doesn't do (for example, editing PDFs, but with great difficulty), but OCRKit is so much nicer to use.
[Version 1.13]

1 Reply

burypromote

+124
Aeschylus replied on 16 Jan 2013
I forgot to say, of course I bought it.
burypromote
+2

+10

davert reviewed on 03 Dec 2012
The program works at about the same speed as Acrobat but is far easier to use. I was hoping it would not affect the images within my scanned PDF files, but it appears to have degraded them more than Acrobat -- not a huge deal for some purposes, but it makes me wonder why I can't make a PDF file searchable without changing the images in it.
[Version 1.12]

4 Replies

burypromote
Jsm17600 replied on 16 Jan 2013
Why doesn't OCRKit start by creating a copy of the file and then working with that? Of course, the user could create a copy and put the copy into OCRKit...
burypromote

+1
Cartoonasaur replied on 16 Jan 2013
I make copies regardless of the program for the simple reason that OCR increases the size gigantically. And sometimes I just want the darn PDF for my iPhone or iPad and I really don't want a huge file eating up my precious space. So making copies is just a good standard practice, especially since the file at that point is so tiny relative to the final OCR'd output...
burypromote

+10
davert replied on 16 Jan 2013
@ExactCode -- thanks. I look forward to seeing the update -- generally if I want a PDF to be searchable-ized, it's all images.
burypromote
+1

+84
Ryde replied on 16 Jan 2013
@ExactCODE Even if this feature will only work for the simple case of scanned, image-only PDF documents, it would really be great if this updated version you are working on could finally keep CCITT Group 4 compressed images 1:1. I have a big lot of scanned documents lying around waiting to be OCR'd, but there is still no OCR product on the Mac which gets this important detail right (aside from Adobe Acrobat).

Background Information: CCITT T.6 aka Group 4 compression is a very space efficient lossless compression algorithm which was originally developed for facsimile transmission of black-and-white images over telephone lines and data networks. It is optimized for documents with black letters on white background, where it typically yields output files which are multiple orders of magnitude smaller than the results of any other image compression algorithm. CCITT Group 4 has been part of the PDF standard right from the start and so it is still the best choice when storing scanned bi-level images in a TIFF or PDF file.

However, because almost all OCR solutions available on OS X use Apple's PDF Kit framework, and because PDF Kit can only read this format, but does not write it, the resulting files are either much bigger than the original or suffer from a significant loss of image quality as the OCR apps throw away image detail to save space. :-(((
burypromote

-1

jefferis reviewed on 31 Aug 2012
I just tried the 14 day demo, but the program is acting weird. When it scanned a pdf document, all it did was produce really strange text characters and showed no OCR results:

’1ì*—Á Ö-bîe+ PýæàfŽ§V~ÎM=nTø+ÐxÎX¤äˆ]°äVoŠ¥ð”–´·V³]ÎKmÏݯðšQqãûë˜Üì‘$8ÆG&¸ÿØGâIÞgË3ýÓҋ?z¬åÝ}ðüeoÎcVÞ~@;fº¥‘„B}k–ø}‰ü±cˁ“ƒÖº¥LX;R;òø§Oqç̉†ä‰A‘Ãô °ÎNp)ŎуÅ{°C©8µ$k¿°@;.!‡p®Ñò!Âdç©=ªYHS—S´ÓbîzQp
³‘õ¦H‘ýì¬l
[Version 1.12]

1 Reply

burypromote
+1

+14
Marilou53 replied on 16 Jan 2013
Prizmo 2 can perform OCR on screenshots too. And it works! You should give it a try.
burypromote

Yohan199 reviewed on 15 Aug 2012
This app works great! It is super fast when scanning big documents. It also allows for text recognition within the images of my file which is a great feature!
[Version 1.10.4]


burypromote
+1

+3

Garyrob reviewed on 15 Feb 2012
So far, I'm very pleased. I had been using PDFPen, but it repeatedly crashes or freezes. In my limited experience with OCRKit it has performed flawlessly (on a 22-page document that consistently made PDFPen crash).
[Version 1.9.4]


burypromote
-2

-2

Brian Duffy reviewed on 20 Jan 2012
Ah...I didn't need to give my email and this is not a demo.

I just bought a Mac running Lion and of course my older, expensive, Omni Page OCR software was no longer supported due to Apple asinine decision to drop Rosetta emulation. So, I had to scramble to find a quick and cheap alternative and wow, this did the job! For Free! I also downloaded OCRTools from the app store for $4.99 and than worked well also, maybe just sightly better.
[Version 1.9.3]

2 Replies

burypromote

+680
sjk replied on 03 Feb 2012
By free, do you mean the 14-day trial period?
burypromote

+23
Rpmurray replied on 28 Apr 2012
due to Omni Page's asinine decision not to update old technology.

Fixed it for you.
burypromote
-1

+41
Adrian commented on 15 Nov 2010
What? You cannot download the demo without giving them your email address? Lame.

No thanks, there're plenty of other choices.
[Version 1.5]

1 Reply

burypromote
+2

+3
tissot replied on 12 Dec 2011
"there're plenty of other choices" : I really doubt it. If there's one category lacking in Mac OS software, it's professional OCR tools.
There are currently no troubleshooting comments. If you are experiencing a problem with this app, please post a comment.


Cigire rated on 16 Jan 2013

[Version 1.13]



+14

Marilou53 rated on 16 Jan 2013

[Version 1.13]



-1

ParisP rated on 27 Apr 2012

[Version 1.9.6]


Downloads:22,875
Version Downloads:183
Type:Business : Word Processing
License:Demo
Date:13 Apr 2014
Platform:Intel 64 / Intel 32 / OS X
Price: $69.00
Overall (Version 2.x):
Features:
Ease of Use:
Value:
Stability:
Displaying 1-10 of 12
1 2 >
Displaying 1-3 of 3
-
-
-
Please login or create a new
MacUpdate Member account
to use this feature
Watch Lists are available to
MacUpdate Desktop Members
Upgrade Now
Install with MacUpdate Desktop.
Save time moving files & cleaning
up space wasting archives.
OCRKit uses OCR (Optical Character Recognition) technology to recognize the text in the graphic, which is particularly useful for PDFs received via e-mail, created by DTP, office applications, or with a scanner or copier. Create searchable PDFs, RTF, HTML, or Unicode plain text simply by dragging a document on the OCRKit application or Dock icon.

Searchable Text. OCRKit is a simple and streamlined Mac application, that features the advanced Optical Character Recognition technology, allowing you to convert scanned or printed documents into searchable and editable text. It can be a great help for everyone, home users, corporate users at work and educational institutions.

You can use the copy and paste tools on the document, instead of retyping everything and correct it. It increases the efficiency and effectiveness of office workflow. OCRKit is fast and accurate, ensuring the document's content remains intact while saving time as well.

OCR Engine
  • Text recognition accuracy
  • Instant content search ability
  • PDF compression
  • High-speed conversion
  • Stability and performance
The OCR engine recognizes the following languages: Bulgarian, Croatian, Czech, Danish, Dutch, English, Estonian, Finnish, French, German, Hungarian, Italian, Latvian, Lithuanian, Polish, Portuguese, Rumanian, Russian, Serbian, Slovenian, Spanish, Swedish, Turkish, Ukrainian, and Norwegian.

Automatic rotation. The automatic rotation determines the orientation of each scanned sheet automatically, therefore avoiding the need to manually pre-sort a stack before you scan. As the automatic rotation works independently from the OCR mechanism, the feature also helps to improve OCR results.

Multi-Page Documents. There are no limits to the number of pages of the PDF or TIFF documents. There are also no limits to the number of pages that can be processed each day. Usage is very simple: just drag a document into the app's Dock icon, or open it with the OCRKit application (Open With > ).

AppleScript Support. OCRKit includes AppleScript support, so professional users can use it for programmed batch conversion.


- -