addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Bay Area Hadoop User Group (HUG) May 2011 Meetup

May 2011 HUG Agenda:

  • 6:00 - 6:30 - Socialize over food and beer(s)
  • 6:30 - 7:00 - Oozie 3.0
  • 7:00 - 7:30 - Analyzing Hadoop Source Code with Hadoop
  • 7:30 - 7:40 - Big Data Camp before Hadoop Summit
  • 7:40 - 7:55 - Hadoop Summit 2011 - Track Agendas

Oozie 3.0: Oozie, a Hadoop workflow scheduling system, currently provides two levels of abstractions for Hadoop-based application development.  Oozie workflow management layer allows users to specify job dependency in a directed acyclic graph (DAG), which can be executed by Oozie server accordingly. Secondly, users can schedule any workflow based on time frequency or/and data dependency using Oozie coordinator layer. Oozie 3.0 introduces a new abstraction called bundle to batch a set of coordinator applications. This feature is critical to large-scale data processing. In addition, Oozie 3.0 includes enhancements to the stability and scalability of Oozie servers that will benefit all users.

Presenter: Mohammad Islam, Yahoo!

Analyzing Hadoop Source Code with Hadoop: We analyzed the Hadoop source code and its development over time and found some interesting and fun facts we want to share with the community.  This talk will illustrate text and related analytics with Hadoop on Hadoop to reveal the true hidden secrets of the elephant.

Presenter: Stefan Groschupf, Datameer

Big Data Camp before Hadoop Summit: BigDataCamp is an unconference for users of Hadoop and related technologies to exchange ideas in a loosely distributed format. Led by CloudCamp's Dave Nielsen, attendees are encouraged to share thoughts in open discussions with pre-defined and majority-vote topics, including best practices in application development and advanced analytics.

Presenter: Dave Nielsen, BigDataCamp

Track Agenda Hadoop Summit 2011: Want to find out if your abstract made it to the list of presentations selected for the Summit? This is your opportunity. Come find what the track agenda looks like for the Summit!

Presenter: Avik Dey, Yahoo!

Yahoo Campus Map:

Detail map

Location on Wikimapia:[masked]&lon=[masked]&z=18&l=0&m=b&search=yahoo

Join or login to comment.

  • A former member
    A former member

    Very good meeting. A bit of chatting in the back of the cafeteria, but the presentations went well. Would like to hear more about Oozie 3.0 and the types of jobs run through (haven't considered jobs running for a year, for example). Datameer presentation was clever and engaging and it would have been nice to see more of the process of using their product, even if that isn't traditionally done.

    May 19, 2011

  • Patrick N.

    Good presentation although noisy at time. Great meeting place.

    May 19, 2011

Our Sponsors

  • Yahoo

    Free admission, Space, Pizza and Beer

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy