Let's take a moment to explore the different versions of Hadoop. The following are/were the most known versions.
  • 0.21 - Aug 2010 -- A widely used release. This eventually became the Hadoop 1.0 release.
  • 0.23 - Feb 2012 -- Was a branch created to add new features. This branch eventually became Hadoop 2.0
  • 1.0 -- Current production version of Hadoop
  • 2.0 -- Current development version of Hadoop


19.1. Hadoop version 1.0

This is currently the production version of Hadoop. It has been in wide use for a while and has been proven in the field. The following distributions are based on Hadoop 1.0
  • Cloudera's CDH 3 (Cloudera's Distribution of Hadoop) series
  • HortonWorks's HDP 1 (HortonWorks Data Platform) series

19.2. Hadoop version 2.0

This is a development branch of Hadoop. Hadoop 2 has significant new enhancements. It has been under development for a while. Hadoop 2 has the following new features:
  • HDFS fail over
  • Federated NameNode
  • Map Reduce version 2 (MRV2) also known as YARN

The following distributions currently bundle Hadoop 2:
  • Cloudera's CDH4 (Cloudera's Distribution of Hadoop) series.
  • HortonWorks HDP2 (Hortonworks Data Platform) series.

19.3. History of Hadoop

Previous Post Next Post