Hadoop Introduction
The Apache Hadoop project develops open-source software for reliable, scalable, distributed computing, including:
- Hadoop Core, our flagship sub-project, provides a distributed filesystem (HDFS) and support for the MapReduce distributed computing metaphor.
- HBase builds on Hadoop Core to provide a scalable, distributed database.
- Pig is a high-level data-flow language and execution framework for parallel computation. It is built on top of Hadoop Core.
- ZooKeeper is a highly available and reliable coordination system. Distributed applications use ZooKeeper to store and mediate updates for critical shared state.
- Hive is a data warehouse infrastructure built on Hadoop Core that provides data summarization, adhoc querying and analysis of datasets.
Hadoop Quick Download
- Hadoop Core 0.19.1
- HBase 0.19.1
- Hive SVN: http://svn.apache.org/repos/asf/hadoop/hive/trunk
- Pig 0.1.1
- ZoopKeeper 3.1.1
Documentation and Mail list
|