OpenPipeline
OpenPipeline is new open source software for crawling, parsing, analyzing and routing documents. It ties together otherwise incomplete solutions for enterprise search and document processing. OpenPipeline provides a common architecture for connectors to data sources, file filters, text analyzers and modules to distribute documents across a network. It includes a job scheduler and a full UI with a point-and-click interface.
Proprietary alternative(s):
Velocity Search Platform
Proprietary alternative(s):
FAST Enterprise Search Platform
Open Source alternative(s):
Apache Solr
Open Source alternative(s):
Nutch
Saas alternative(s):
SearchBlox Cloud Edition