YARN is the new resource management and execution framework of Apache Hadoop. This talk provides an architectural overview of YARN and its key components. It makes a case for migrating to YARN, particularly from MapReduce v1, for improved scalability and cluster utilization as well as multi-tenancy. The talk also goes through a few operational aspects of running YARN and concludes with an update on key features currently under development.
Karthik Kambatla is a Software Engineer at Cloudera, Apache Hadoop Committer, and a PhD student. He works primarily on scheduling and resource management in the Hadoop ecosystem.
Sponsored by Shopzilla