This pages briefly documents some of the open source software which
has been used in developing the bioRegent framework.
Apache Maven™: Apache Maven is a software project management and comprehension tool. Based on the concept of a project object model (POM), Maven can manage a project's build, reporting and documentation from a central piece of information.
Apache Nutch™: Apache Nutch is an open
source web-search software project. Stemming from Apache Lucene, it now builds on
Apache Solr adding web-specifics, such as a crawler, a link-graph database and parsing
support handled by Apache Tika for HTML and and array other document formats.
Apache Nutch can run on a single machine, but gains a lot of its strength from running in a .
Apache Solr™: Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene&trade project. Its major features include powerful full-text search, hit highlighting, faceted search, near real-time indexing, dynamic clustering, database integration, rich document (e.g., Word, PDF) handling, and geospatial search. Solr is highly reliable, scalable and fault tolerant, providing distributed indexing, replication and load-balanced querying, automated failover and recovery, centralized configuration and more. Solr powers the search and navigation features of many of the world's largest internet sites.
Apache Tika™: The Apache Tik toolkit detects and extracts metadata and structured text content from various documents using existing parser libraries.
Apache Tomcat™: Apache Tomcat is an open source software implementation of the Java Servlet and JavaServer Pages technologies. The Java Servlet and JavaServer Pages specifications are developed under the Java Community Process.