PDF OCR X Community Edition
Your rating: Now say why...

(4) 2.125

Convert PDFs into text documents.   Free
Add to my Watch List
Email me when discounted
PDF OCR X is a simple drag-and-drop utility that converts your PDFs into text or searchable PDF documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF even if that text is contained in an image. This is particularly useful for dealing with PDFs that were created via a Scan-to-PDF function in a scanner or photo copier. Supports over 60 languages including English, French, German, Dutch, Spanish, Portuguese, Basque, Vietnamese, and Italian currently. The OCR engine is based on Tesseract.

Note: The free
What's New
Version 2.0.8:
  • Fixed issue with handling of some PDFs with rotation.
Requirements
Intel, OS X 10.6.8 or later



MacUpdate - PDF OCR X Community Edition




  • ABBYY FineRead...
    +8

  • OCRKit
    +4

  • Prizmo
    +4

  • PDFpenPro
    +3

  • Fopydo
    +1

  • PDF Editor
    +1
PDF OCR X Communit... User Discussion (Write a Review)
ver. 2.x:
(4)
Your rating: Now say why...
Overall:
(27)

sort: smiles | time
burypromote

+6

BadBadger reviewed on 09 Feb 2014
This is crap. Try to use a language other than English - making lot of mistakes. I think that probably the majority of free online OCR works better. I chose the first result from Google: www.onlineocr.net - there was not one wrong letter.
[Version 2.0.8]


burypromote

+115

Appledogx reviewed on 18 Nov 2013
Tried it on a very simple document in Spanish. The recognition was so bad, it would be faster to retype it from scratch than make all the corrections.
[Version 2.0.8]

1 Reply

burypromote
+1
Web Lite Solutions Corp. (developer) replied on 20 Nov 2013
With OCR, accuracy depends on a number of factors, but the most important is the quality/resolution of the input image. If the input image is good quality (>150dpi), PDF OCR X generally gets over 99% accuracy. If the resolution is high and you're still getting bad results, contact support as there may be some other issues at play that can be resolved.
burypromote
+3

+188
Prince_Isaac commented on 15 Nov 2013
From the Note in the above description: "The free community edition is limited to PDFs of 1 page or less."

Nice one, Dev, nice one. You can't have less than a one-page PDF. It's like trying to get "one DVD or less" into your slot.
[Version 2.0.8]


burypromote
-5

+398

Derekcurrie reviewed on 24 Oct 2013
PDF OCR X requires the installation of a potentially dangerous, security hole ridden version of Java. I'd steer clear of it if you can and use one of the many alternatives.
[Version 2.0.1]

8 Replies

burypromote
+2
Web Lite Solutions Corp. (developer) replied on 24 Oct 2013
This is not correct. Version 2.0.1 does not require you to have Java installed on your system at all. It is a self-contained application.
burypromote
-3

+398
Derekcurrie replied on 24 Oct 2013
Then when I ran it, the application insisted upon, and did, install its own Java directly into my System because why? I had to scour it all out again and drag a replaced file out of another installation of OS X Mavericks 10.9.0 to return my System folder to normal.
burypromote
+1
Web Lite Solutions Corp. (developer) replied on 24 Oct 2013
What do you mean by "insisted upon"? Did it prompt you in any way? What files are you referring to (exactly) that you say were installed in your system?

I repeat. PDF OCR X 2.x (which is the version you are commenting on) does *not* install Java in your system. It is a self contained application that does not depend on any outside libraries other than standard OS X frameworks that are already present in the OS. It does install language files for the OCR inside its designated application support directory, and it will add some preferences files (to store user preferences) at the designated locations, but none of this is contrary to OS X application guidelines, and none of it is a security risk to your system.
burypromote

+398
Derekcurrie replied on 27 Oct 2013
Details about the situation: If you've installed OS X Mavericks 10.9.0, a full installation of Java is not provided. I can provide details about the few files Apple does provide, upon request. Running PDF OCR X triggers OS X Software Update to pop up a box asking if you would like to install Java 6 into your system. If you agree, then a full installation of Java 6 is installed. Without this installation of Java 6, PDF OCR X (at least at version 2.0.1, I have not and will not test further versions) will not run.

I don't know if this is also the case with OS X 10.8.5 after the recent Apple Java update as I installed 10.9.0 on top of it. I know this does NOT occur with 10.7.5 after the recent Apple Java update.

The problem with Java 6 is that lousy Oracle has declared it EOL, or End Of Life and is no longer supporting it. It is known to have several security holes, which is why Apple does not provide it with OS X Mavericks 10.9.0. It is possible Apple themselves, via their open source project of JRE (Java Runtime Engine) patched the numerous security holes. But I have no indication that this is the case. Perhaps a Java security expert knows further details.

Meanwhile, to Web Lite Solutions Corp., you have not represented all of the facts of the matter. I have nothing against your software. But I despise Oracle for what they have done to Java, which is to turn it into the single most dangerous software anyone can install on their Mac. The Internet plug-in is the worst danger, well documented to be able to allow bots to infect Macs, depending upon the security hole involved. But because of the numerous security holes reported to exist in Java 6, I cannot recommend anyone install it on their Mac even to run applications. Trojan horse malware being perpetrated against Mac users continues. It is easy for such malware to take advantage of reported existing Java 6 security holes and bot a Mac, never a good thing.

I recommend Mac users avoid Java whenever it is not required to run software for Mac. I hope Web Lite Solutions Corp can move away from Java and use a safer programming language and system for their software. That's the sad state of Oracle's rendition of Java at this time. I'm sorry to be the messenger of bad news. But I'd rather take the undeserved hits for reporting the problem than leave everyone ignorant of the problem.

So hit me some more. I'm glad to be of service to those who appreciate the warning. So there.
burypromote
+2
Web Lite Solutions Corp. (developer) replied on 27 Oct 2013
The only explanation I can think of is that you were using version 1.9.36. One of the primary goals of 2.0 was to eliminate the dependency on Apple's Java 6 because it is deprecated. The App store version of PDF OCR X has been running inside the sandbox since version 1.9.32 (i.e. it doesn't depend on Apple's Java), but the non-appstore version still depended on Java. Since we knew that Java 6 was deprecated and that eventually (whether in Maverick or in a future version) you wouldn't be able to use it at all, we made it a key requirement in 2.0 to not depend on any deprecated APIs including Java 6.

We keep clean copies of Mavericks, Mountain Lion, Lion, and Snow Leopard for testing and am 100% certain that 2.0.x does not require Java. None of our test installs have Java installed, and the app works fine on them. Again, it was a *key* requirement.

Check the version number on PDF OCR X that you tried (i.e. get info), and I'm sure you'll see that it was 1.9.36. Not 2.0.1.
burypromote

+188
Prince_Isaac replied on 15 Nov 2013
@Derek... I'm interested in your points above. I have no idea if I have the Oracle Java that you're describing on my 10.9 system (which, like yours, was installed over 10.8.5). Can you point me to the directory where I could check to see if java is installed? Or describe other means of discovering it?

Thanks.
burypromote

+99
MacUpdate-Jess replied on 16 Nov 2013
@Prince_Isaac…

I'm glad you asked about Derekcurrie's claims. I was going to post this comment a couple of weeks ago, but refrained. You've given me a perfect opening. :-)

- Right-click (Ctrl-click) the application and select "Show Package Contents" from the contextual menu.
(The following is easiest if you switch your Finder window to column view [Cmd-3]….)
- Click on the Contents directory.
- Click on Java, and you'll find all the necessary JAR files for the application.
- Click on the PlugIns directory, and you'll see a subdirectory containing JDK 1.7. If you follow that trail, you'll find many JAR and dylib files.
burypromote
Web Lite Solutions Corp. (developer) replied on 16 Nov 2013
@MacUpdate-Jess, @Prince_Isaac ...

There is a JRE bundled inside the application, but this is very different than requiring Java to be installed on the system. There are no security issues with Java when used in this way. Any security issues you may have heard about with Java are related to the Java Browser Plugin (i.e. when run inside the web browser). PDF OCR X bundles its own JRE inside the application bundle so that your system doesn't need to have Java installed.

The claims that @Derekcurrie made was that PDF OCR X somehow installs Java in your System folder. This claim is false. PDF OCR X does not install Java anywhere outside the application bundle itself, which is no different than any other library that any other application may have bundled with it.
burypromote
+1

+1

DonAndrews8059 reviewed on 09 Sep 2013
It does exactly what it says it does, does it well and without problems. You do need to have a good quality document to start with, but this is the same with all OCR software I have used.
[Version 1.9.35]


burypromote
+1

+1

mwgeorguihotmailcom4975 reviewed on 29 Jul 2013
I was in a hurry to get 2 scanned sheets converted to text and PDF OCR X helped me out. Both files were converted in less than 30 secs. Thank you for this awesome program! Easily 5/5!
[Version 1.9.35]


burypromote

+1

cravenshavengmailcom9273 reviewed on 05 Jul 2013
Although this product was not what I needed at the time, it was very easy to use and understand.

The customer service from Steve Hannah from this company was stellar!!

I definitely will watch for their products.
[Version 1.9.35]


burypromote

+1

JoeAnucha reviewed on 07 Feb 2013
it just works.
[Version 1.9.35]


burypromote

+25
Tommy-Montag commented on 01 Feb 2013
Also requires Java.
[Version 1.9.35]


burypromote
+1

+10

Reddog99 reviewed on 29 May 2012
Seems to function about as good as any other OCR program I've used. The output quality is highly dependent on the input quality, which is to be expected. If you have fuzzy characters from a scan, then don't expect miracles, it won't perform too good. I was pleasantly surprised to discover that the program will also do OCR on .jpg images of text. Just drop the .jpg onto the icon and it will be automatically converted into a pdf before the OCR operation. Obviously, the program has limitations, but you'd be hard pressed to beat FREE.
[Version 1.9.25]


burypromote

-7
Gabriel2575 had trouble on 06 Apr 2010
It is strange I did download it as it said that the platform is PPC/Intel and when I open it it allways say that PDF OCR 1.7 works only for Intel system
I wonder what I do wrong
gglatendresse@shaw.ca


Date: 06 Apr 2010
Platform: PPC/Intel
[Version 1.7]

1 Reply

burypromote
+1
Shannah replied on 07 Apr 2010
Currently it is Intel only. I'm not sure why MacUpdate doesn't update this.
burypromote

+1
Jhersco had trouble on 15 Mar 2010
v1.6 won't quit. Requires force quitting.
[Version 1.6]


burypromote

+147
Gregm had trouble on 25 Feb 2010
PPC Only

per the developer:

" Unfortunately the current version only runs on Intel based macs. We are working towards removing some dependencies to make it work on PowerPC, but for now, unfortunately it's Intel only."
[Version 1.4.1]



SchwarzL6795 rated on 06 Nov 2013

[Version 2.0.7]



+14

Birdy53 rated on 16 Jan 2013

[Version 1.9.32]



cyberclaus rated on 06 Sep 2012

[Version 1.9.32]



+5

Bmonno rated on 06 Jul 2012

[Version 1.9.25]



+62

Morgan_Alex rated on 25 Mar 2012

[Version 1.9.24]



+15

Quiiick rated on 24 Mar 2012

[Version 1.9.24]



bkir016 rated on 19 Jan 2012

[Version 1.9.19]



+6

Wikno rated on 07 Sep 2011

[Version 1.9.14]



Johan Jarmander rated on 10 Feb 2011

[Version 1.9.10]



-5

Rahn rated on 20 Dec 2010

[Version 1.9.9]


Downloads:80,168
Version Downloads:9,777
Type:Business : Word Processing
License:Free
Date:15 Nov 2013
Platform:Intel 64 / Intel 32 / OS X
Price:Free0.00
Overall (Version 2.x):
Features:
Ease of Use:
Value:
Stability:
Displaying 1-10 of 48
1 2 3 4 5 >
Displaying 1-3 of 3
Displaying 1-10 of 11
1 2 >
-
-
-
Please login or create a new
MacUpdate Member account
to use this feature
Watch Lists are available to
MacUpdate Desktop Members
Upgrade Now
Install with MacUpdate Desktop.
Save time moving files & cleaning
up space wasting archives.
PDF OCR X is a simple drag-and-drop utility that converts your PDFs into text or searchable PDF documents. It uses advanced OCR (optical character recognition) technology to extract the text of the PDF even if that text is contained in an image. This is particularly useful for dealing with PDFs that were created via a Scan-to-PDF function in a scanner or photo copier. Supports over 60 languages including English, French, German, Dutch, Spanish, Portuguese, Basque, Vietnamese, and Italian currently. The OCR engine is based on Tesseract.

Note: The free community edition is limited to PDFs of 1 page or less. The Enterprise version ($29.99) has no limit on the size of the PDF.


- -