addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-crosscrosseditemptyheartfacebookfolderfullheartglobegmailgoogleimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartrashtriangle-downtriangle-uptwitteruseryahoo

Bay Area Hadoop User Group (HUG) May Meetup

Detailed agenda and summaries to follow. General agenda:

  • 6:00 - 6:30 - Socialize over food and beer(s)
  • 6:30 - 7:00 - The Changing Big Data Landscape - empowering the business user with analytics-driven insight
  • 7:00 - 7:30 - Oozie: Towards a Scalable Workflow Scheduling System for Hadoop

The Changing Big Data Landscape - empowering the business user with analytics-driven insight

The exponential growth of structured and unstructured data has overwhelmed traditional BI solutions. Data analysts, managers and executives want to be able to easily correlate the new unstructured data with legacy data sitting on tape or in platters to gain complete insights into customer behavior, business and IT operations without having to worry about the economics.
This session will discuss:
- the evolution of Big Data and the challenges it presents to business users
- the role Hadoop and NoSQL technologies play today
- the challenges that result from the growth of this data and possible solutions.

Presenter: Matthew Schumpert, Datameer

Oozie: Towards a Scalable Workflow Scheduling System for Hadoop

During the past three years Oozie has become the de-facto workflow scheduling system for Hadoop. Oozie has proven itself as a scalable, secure and multi-tenant service. Oozie stably processes more than 45% of the jobs run across more than 25 Hadoop clusters in Yahoo. At the same time adoption in other enterprises has increased substantially since Oozie was contributed to the Apache community. We attribute these achievements to design decisions that was selected to be presented at a workshop during the ACM/SIGMOD conference. This presentation covers the key architectural design choices described in the paper. Operational metrics will be used to illustrate production experience at Yahoo, and we will also include a quick tutorial.

Presenter: Mohammad Islam and Virag Kothari, Yahoo!

Yahoo Campus Map:

Detail map

Location on Wikimapia:[masked]&lon=[masked]&z=18&l=0&m=b&search=yahoo

Join or login to comment.

  • A former member
    A former member

    Anybody has the link to the video of this meeting?

    August 16, 2012

  • Nivas

    Caught only the last few minutes of the 1st presentation. So can't judge that.
    The 2nd presentation felt very monotonous, hard to understand/follow & as a result uninteresting (failed to grab my attention, even tho i tried). Feels like the quality of the presentations could be a lot better, more professional, more focus on higher level use-cases (vs details of installation, etc).

    May 17, 2012

  • Serge M.

    Sound setup is much better this time around. Thank you Yahoo!

    May 17, 2012

  • Sachin N.

    interested in the video as well

    May 16, 2012

  • A former member
    A former member

    I can video record for the group. I will check with organizers and see if we can get the OK from the presenters.

    1 · May 14, 2012

Our Sponsors

  • Yahoo

    Free admission, Space, Pizza and Beer

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy