Detailed agenda and summaries to follow. General agenda:
- 6:00 - 6:30 - Socialize over food and beer(s)
- 6:30 - 7:00 - Building Data Pipelines on Hadoop
- 7:00 - 7:30 - Using Standard File-Based Applications and SQL-Based Tools with Hadoop
- 7:30 - 8:00 - Overview of Oozie Qualification Process
Building Data Pipelines on Hadoop
This talk will review the components required to build large scale data pipelines on Hadoop. The talk will draw on the experience of building large scale data pipelines at Yahoo.
Presenter: Sameer Raheja, Yahoo!
Using Standard File-Based Applications and SQL-Based Tools with Hadoop
MapR makes Hadoop a more open platform by supporting industry-standard interfaces, including NFS and ODBC. The NFS interface enables users to leverage standard file-based applications, and makes it easier to get data into and out of the cluster, while the ODBC interface enables users to leverage standard BI tools and query builders. This talk covers the motivation for supporting industry-standard interfaces as well as several real-world use cases. In addition, this talk explains the technical details behind these capabilities and how they actually work.
Presenter: Tomer Shiran, MapR
Overview of Oozie Qualification Process
The talk will cover the Oozie QE practice and process in Yahoo!, the types of tests that QE perform before release, and the roadmap.
Presenter: Michelle Chiang, Yahoo!
Yahoo Campus Map:
Location on Wikimapia: