We stand with Ukraine to help keep people safe. Join us
Arch free download for Mac


Version 1.9.2

Open source extension of Apache Nutch.

Arch overview

Arch is an Open-Source extension of Apache Nutch (a popular, highly scalable, general-purpose search engine) for intranet search. Not happy with your corporate search engine? Not surprising, very few people are. To the best of our knowledge, there are no intranet engines that work as well as the Google's global Web search does. There is a fundamental reason for this: the algorithms used by Google on the global Web (or similar) do not work nearly as well on intranets for the lack of statistical data. Arch (finally!) solves this problem.

It uses a novel method to deliver high-precision search results that works great. Don't believe it? Blind test evaluation tools are included. You can deploy Arch and compare its performance to your current search engine and/or Google (on the public part of your site) using a blind test methodology. In addition to the excellent search quality, Arch has many features critical for corporate environments:

  • Document level security -- users can find only documents that they are authorized to see
  • Inexpensive index updates -- Arch is able to keep indexes up to date and avoid regular complete site recrawling
  • 24/7 availabilty -- there is always a working index available, even if a crawl fails
  • Support for simultaneous indexing and search of multiple web sites, with ability to search and administer any site separately, if needed. Dynamic adding and removal of web sites is easy
  • An automatically generated site directory
  • Low cost support once deployed
  • Dual interface (PHP and Java) for easy deployment and customization
  • Faceted search "out of the box"
  • An extensive and extensible set of parsers for parsing a variety of file formats: HTML, PHP, PDF, MS Office, Open Office, etc.
  • A modular, plugin-based architecture that can be easily customized and extended
  • The source code is included
  • High performance and scalability -- Arch can run on computer clusters to index very large data sets

What’s new in version 1.9.2

Updated on Sep 06 2016

Version 1.9.2:

Note: Now requires OS X 10.7.3 or later running on a 64-bit Intel processor

  • PHP used to put junky looking content in the query field on results pages when an advanced query was submitted; now it leaves this field empty in case of an advanced query
  • Made name field shorter (1K instead of 2K) in site DB tables; a too-long field length resulted in an index key that was too long for some MySQL configurations
  • Moved to new version numbering scheme to align it with the Apache Nutch version numbering scheme
  • Fixed a bug that caused enforcing access permissions problems
  • Fixed bugs found in 1.9b
  • Added post-parsing pruning
  • Changed order of application of parsers, moved Tika to top





23.9 MB



App requirements

  • Intel 64
  • OS X 10.7.3 or later
  • Java 1.7 or later
  • Apache Ant and Ant-options packages
  • Apache Ivy
  • Minimum of 2 GB RAM
Try our new feature and write a detailed review about Arch. All reviews will be posted soon.
Write your thoughts in our old-fashioned comment
MacUpdate Comment Policy. We strongly recommend leaving comments, however comments with abusive words, bullying, personal attacks of any type will be moderated.
(0 Reviews of )
There are no reviews yet
Help the community
There are no reviews yet, be the first to leave one