Jump to content

Apache OODT

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by Chrismattmann (talk | contribs) at 22:22, 27 June 2016 (-- improve page with more history - WIP). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.
OODT
Developer(s)Apache Software Foundation
Stable release
1.0 / June 27, 2016; 8 years ago (2016-06-27)
Written inJava
Operating systemCross-platform
TypeSearch and index API
LicenseApache License 2.0
Websiteoodt.apache.org

The Apache Object Oriented Data Technology (OODT) is an open source data management system framework that is managed by the Apache Software Foundation. OODT was originally developed at NASA Jet Propulsion Laboratory to support capturing, processing and sharing of data for NASA's scientific archives.

History

The project started out as an internal NASA Jet Propulsion Laboratory project incepted by Daniel J. Crichton, Sean Kelly and Steve Hughes. The early focus of the effort was on information integration and search using XML as described in Crichton et al.'s paper in the CODATA meeting in 2000[1].

After deploying OODT to the Planetary Data System and to the National Cancer Institute EDRN or Early Detection Research Network project, OODT in 2005 moved into the era of large scale data processing and management via NASA's Orbiting Carbon Observatory (OCO) project. OODT's role on OCO was to usher in a new data management processing framework that instead of 10s of jobs per day and 10s of gigabytes of data would handle 10,000 jobs per day and 100s of _terabytes_ of data. This required an overhaul of OODT to support these new requirements. Dr. Chris Mattmann at NASA JPL led a team of 3-4 developers between 2005-2009 and completely re-engineered OODT to support these new requirements.

Influenced by the emerging efforts in Apache Nutch and Hadoop which Mattmann participated in, OODT was given an overhaul making it more amenable towards Apache Software Foundation like projects. In addition, Mattmann had a close relationship with Dr. Justin Erenkrantz, who as the Apache Software Foundation President at the time, and the idea to bring OODT to the Apache Software Foundation emerged. In 2009, Mattmann and his team received approval from NASA and from JPL to bring OODT to Apache making it the first NASA project to be stewarded by the foundation. Seven years later, the project has released a version 1.0.

Features

OODT focuses on two canonical use cases: Large Data Processing and on Information Integration. Both were described in Mattmann's ICSE 2006

OODT is written in the Java, and through its REST API <> used in languages.<ref>

Notable uses

Mention Dawn project. Mention Tweets for OCO-2 processing. Mention JPSS GRAVITE.

Mention OODT use bioinformatics at knowledgeent.

References

  1. ^ Crichton, Daniel; Hughes, John; Hyon, Jason; Kelly, Sean (2000). "Science Search and Retrieval using XML". The Second National Conference on Scientific and Technical Data, US National Committee for CODATA, National Research Council.