Documentation | Support
Skip to end of metadata
Go to start of metadata

Hadoop provides scalable data storage using the Hadoop Distributed File System (HDFS) and fast parallel data processing on a fault-tolerant cluster of computers.

If you are a system administrator responsible for setting up or configuring a Hadoop cluster to use with Datameer, this topic provides some links you may find useful.

If you are setting up a new Hadoop system for use with Datameer, see System Requirements for details on hardware and software requirements.

Getting started with Hadoop

To learn about Hadoop, you can go directly to the source at: http://hadoop.apache.org/

If you need to learn more about the HDFS architecture, see: http://hadoop.apache.org/common/docs/current/hdfs_design.html

Here are some additional links where you can learn more about Hadoop:

Additionally, see Monitoring Hadoop and Datameer to learn more about monitoring Hadoop and Hadoop Cluster Configuration Tips to learn about how to optimize Hadoop for use with Datameer.

Labels: