The main differences between Hadoop 1.0 and Hadoop 2.0 are as below.
Hadoop 2.0 has come with major changes introducing a new layer of YARN framework in Hadoop ECo system.
- YARN has primarily developed to share the work of MapReduce framework in Hadoop 1.0.YARN has taken over cluster resource management from MapReduce.
- YARN splits up the two major functionalities of overburdened JobTracker (resource management and job scheduling/monitoring) into two separate daemons: a global Resource Manager and per-application Application Master
- There are no more fixed MapReduce slots. YARN provides central resource manager. With YARN, you can now run multiple applications in Hadoop, all sharing a common resource.
- YARN allows non MapReduce distributed application (application which don’t follow map-reduce paradigm like SQL query, real time, streamline application) to run on Hadoop cluster which was not the case earlier in Hadoop1.0.
This course covers the installation of both Hadoop 1.0 and Hadoop 2.0 together running with MapReduce applications