May 2011 HUG Agenda:
- 6:00 - 6:30 - Socialize over food and beer(s)
- 6:30 - 7:00 - Oozie 3.0
- 7:00 - 7:30 - Analyzing Hadoop Source Code with Hadoop
- 7:30 - 7:40 - Big Data Camp before Hadoop Summit
- 7:40 - 7:55 - Hadoop Summit 2011 - Track Agendas
Oozie 3.0: Oozie, a Hadoop workflow scheduling system, currently provides two levels of abstractions for Hadoop-based application development. Oozie workflow management layer allows users to specify job dependency in a directed acyclic graph (DAG), which can be executed by Oozie server accordingly. Secondly, users can schedule any workflow based on time frequency or/and data dependency using Oozie coordinator layer. Oozie 3.0 introduces a new abstraction called bundle to batch a set of coordinator applications. This feature is critical to large-scale data processing. In addition, Oozie 3.0 includes enhancements to the stability and scalability of Oozie servers that will benefit all users.
Presenter: Mohammad Islam, Yahoo!
Analyzing Hadoop Source Code with Hadoop: We analyzed the Hadoop source code and its development over time and found some interesting and fun facts we want to share with the community. This talk will illustrate text and related analytics with Hadoop on Hadoop to reveal the true hidden secrets of the elephant.
Presenter: Stefan Groschupf, Datameer
Big Data Camp before Hadoop Summit: BigDataCamp is an unconference for users of Hadoop and related technologies to exchange ideas in a loosely distributed format. Led by CloudCamp's Dave Nielsen, attendees are encouraged to share thoughts in open discussions with pre-defined and majority-vote topics, including best practices in application development and advanced analytics.
Presenter: Dave Nielsen, BigDataCamp
Track Agenda Hadoop Summit 2011: Want to find out if your abstract made it to the list of presentations selected for the Summit? This is your opportunity. Come find what the track agenda looks like for the Summit!
Presenter: Avik Dey, Yahoo!
Yahoo Campus Map:
Location on Wikimapia: