Skip to main content

Hadoop

Hadoop Versions​

  1. Hadoop 1.0: Includes Common, HDFS, MapReduce
  2. Hadoop 2.0: Includes Common, HDFS, MapReduce, YARN. Hadoop 1.0 and 2.0 are incompatible. Since Hadoop 2.7, includes Ozone. Since Hadoop 2.10, includes Submarine.
  3. Hadoop 3.0: Includes Common, HDFS, MapReduce, YARN, and Ozone module. Latest Hadoop 3.0 includes Submarine.

Hadoop Modules​

  1. Hadoop Common: Supports other modules
  2. HDFS (Hadoop Distributed File System): Distributed storage
  3. Hadoop YARN: Task scheduling and resource management
  4. Hadoop MapReduce: Distributed computing framework based on YARN
  5. Hadoop Ozone: Object storage
  6. Hadoop Submarine: Machine Learning engine

Hadoop Installation​

Standalone and Pseudo-Distributed​

  • Hadoop Pseudo-Distributed Installation.txt
  • Hadoop Fully Distributed Setup.txt

Differences between Hadoop 2.0 and 3.0​

Agreement
The code part of this work is licensed under Apache License 2.0 . You may freely modify and redistribute the code, and use it for commercial purposes, provided that you comply with the license. However, you are required to:
  • Attribution: Retain the original author's signature and code source information in the original and derivative code.
  • Preserve License: Retain the Apache 2.0 license file in the original and derivative code.
The documentation part of this work is licensed under Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License . You may freely share, including copying and distributing this work in any medium or format, and freely adapt, remix, transform, and build upon the material. However, you are required to:
  • Attribution: Give appropriate credit, provide a link to the license, and indicate if changes were made.
  • NonCommercial: You may not use the material for commercial purposes. For commercial use, please contact the author.
  • ShareAlike: If you remix, transform, or build upon the material, you must distribute your contributions under the same license as the original.