Bay Area Hadoop User Group (HUG) Monthly Meetup


  • 6:00 - 6:30 - Socialize over food and beer(s)
  • 6:30 - 7:00 - Giraffa File System to Grow Hadoop Bigger
  • 7:00 - 7:30 - Apache Drill for Interactive Analysis
  • 7:30 - 8:00 - Elastic, Multi-tenant, Highly Available Hadoop on Demand


Session I: Giraffa File System to Grow Hadoop Bigger (6:30 - 7:00 PM)

HDFS scalability and availability is limited by the single namespace server design. Giraffa is an experimental file system, which uses HBase to maintain the file system namespace in a distributed way and serves data directly from HDFS DataNodes. Giraffa is intended to provide higher scalabilty, availability, and maintain very large namespaces. The presentation will explain the Giraffa architecture, the motivation, will address its main challenges, and give an update on the status of the project.

Presenter: Konstantin Shvachko (PhD), Founder, AltoScale


Session II: Apache Drill for Interactive Analysis (7:00 - 7:30 PM)

Apache Drill is a new open source Apache Incubator project for interactive analysis of large-scale datasets, inspired by Google's Dremel. It enables users to query terabytes of data in seconds. Apache Drill supports a broad range of data formats, including Protocol Buffers, Avro and JSON, and leverages Hadoop and HBase as data sources. Drill's primary query language, DrQL, is compatible with Google BigQuery. In this talk we provide an overview of the Drill project, including its design goals and architecture.

Presenter: Jason Frantz, Software Architect, MapR Technologies


Session III: Elastic, Multi-tenant, Highly Available Hadoop on Demand (7:30 - 8:00 PM)

Serengeti is an open-source project, initiated by VMware, to enable the rapid deployment of Hadoop clusters in virtual environments. While Hadoop clusters are typically run on physical machines, Serengeti aims to bridge Hadoop and virtualization, and bring the classic benefits of virtualization to the Hadoop user. Leveraging virtual machines, Serengeti-deployed clusters can be simply operated, configured for HA protection, and made elastic through the decoupling of Hadoop compute and data layers. In this talk, we explore each of these aspects of running Hadoop on a virtual platform.

Presenter: Kevin Leong, Product Manager, VMware 


Yahoo Campus Map:

Detail map


Location on Wikimapia:[masked]&lon=[masked]&z=18&l=0&m=b&search=yahoo


Join or login to comment.

  • Dave

    Save an Extra 20% with Discount Code MEETUP: Jan 28 Global Big Data Conference Santa Clara Convention Center?
    35+ Speakers, 20+ very high quality Sessions Register: http://globalbigdataconference....­

    Agenda: http://globalbigdataconference....­

    January 26, 2013

  • j m.

    Will there be a live feed at this meetup?

    September 18, 2012

    • Yahoo! HUG O.

      Hi, the slides and videos are posted at the usual location

      October 3, 2012

    • Yahoo! HUG O.

      Slides at and video at I checked and they have been there.

      October 3, 2012

  • Long P.

    Siri'ous about Hadoop! Pls connect to me for opportunity details. (lphungatappledotcom)

    September 24, 2012

  • Nael M.

    Can you post the slides for all three presentations?

    September 19, 2012

    • Yahoo! HUG O.

      Slides are on, the usual place for all HUG meetup slides.

      September 23, 2012

  • Corrinne K.

    Mostly good and interesting material. Microphone speakers have a lot of echo so its often hard to understand.

    September 20, 2012

  • Sanjay S.

    Nice meetup. Would be nice to connect with attendees

    September 20, 2012

  • A former member
    A former member

    I've posted a copy of my slides at­

    September 20, 2012

  • Deb B.


    September 19, 2012

  • Patrick N.

    Excellent. The topics are relevant to real-world problems.

    September 19, 2012

  • Prashanth K.

    I am waitlisted, should I still come or no point?

    September 19, 2012

    • Sreeni

      I guess you can swing in

      September 19, 2012

    • Prashanth K.

      Thank you. Looks like I am in.

      September 19, 2012

  • A former member
    A former member

    I won't be able to go, I can give away my space... Sorry late meetings

    September 19, 2012

  • Haidar H.

    It will be interesting if someone can address the issue of caching , since Hadoop on demand is being presented.

    September 18, 2012

    • Yahoo! HUG O.

      You can ask this during the Q&A time. Kevin may be able to address it.

      September 19, 2012

  • Jeremy T.

    Should be interesting!

    September 19, 2012

Our Sponsors

  • Yahoo! Inc.

    Meeting space, pizza and drinks are sponsored by the Yahoo! Hadoop team.

People in this
Meetup are also in:

Create your own Meetup Group

Get started Learn more

Meetup has allowed me to meet people I wouldn't have met naturally - they're totally different than me.

Allison, started Women's Adventure Travel

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy