Hadoop - Looking to the Future, YARN: Past, Present and Future

Name: Hadoop - Looking to the Future, YARN: Past, Present and Future
Start: 2015-04-17T19:00:00+02:00
End: 2015-04-17T22:00:00+02:00
Location: Prezi House of Ideas

Hosted by Tamas N. and 2 others

Budapest Data Science Meetup

Details

This will be an English speaking event, co-organized with the Big Data Meetup Budapest (https://www.meetup.com/Big-Data-Meetup-Budapest).

Hadoop - Looking to the Future (Arun C Murty/Hortonworks)

The Apache Hadoop ecosystem began as just HDFS & MapReduce nearly 10 years ago in 2006.

Very much like the Ship of Theseus ( http://en.wikipedia.org/wiki/Ship_of_Theseus ), Hadoop has undergone incredible amount of transformation from multi-purpose YARN to interactive SQL with Hive/Tez to machine learning with Spark.

Much more lies ahead: whether you want sub-second SQL with Hive or use SSDs/Memory effectively in HDFS or manage Metadata-driven security policies in Ranger, the Hadoop ecosystem in the Apache Software Foundation continues to evolve to meet new challenges and use-cases.

Arun C Murthy has been involved with Apache Hadoop since the beginning of the project - nearly 10 years now. In the beginning he led MapReduce, went on to create YARN and then drove Tez & the Stinger effort to get to interactive & sub-second Hive. Recently he has been very involved in the Metadata and Governance efforts. In between he founded Hortonworks, the first public Hadoop distribution company.

YARN: Past, Present and Future (Vinod Kumar Vavilapalli/Hortonworks)

Apache Hadoop YARN is a distributed, multi-tenant and fault tolerant resource-management platform.

In this talk, we’ll first cover how YARN stands out today as a enterprise data processing platform and how YARN has been deployed and utilized in real production clusters.

Then, we’ll move on to recent efforts and a few forward-looking features that further YARN as a first class data-operating-system - rolling upgrades, support for long-lived services like HBase & Storm, workload scheduling like node labels, preemption, timeline service for application monitoring/metrics, resource scheduling & isolation on cpu, disks and network.

Vinod Kumar Vavilapalli is the Hadoop YARN and MapReduce guy at Hortonworks. He is a long term Hadoop contributor at Apache, Hadoop committer and a member of the Apache Hadoop PMC. He has a Bachelors degree from Indian Institute of Technology Roorkee in Computer Science in Engineering. He has been working on Hadoop for more than 6 years and he still has fun doing it. Straight out of college, he joined the Hadoop team at Yahoo! Bangalore where he worked on HadoopOnDemand, Hadoop-0.20, CapacityScheduler, and Hadoop security, before Hortonworks happened. He is passionate about using computers to change the world for better, bit by bit. He is reachable at twitter handle @tshooter.

Budapest Data Science Meetup

Hadoop - Looking to the Future, YARN: Past, Present and Future

Budapest Data Science Meetup

Details

Related topics

You may also like