Bay Area Hadoop User Group (HUG) Monthly Meetup

Agenda: 

  • 6:00 - 6:30 - Socialize over food and beer(s)
  • 6:30 - 7:00 - Giraffa File System to Grow Hadoop Bigger
  • 7:00 - 7:30 - Apache Drill for Interactive Analysis
  • 7:30 - 8:00 - Elastic, Multi-tenant, Highly Available Hadoop on Demand

 

Session I: Giraffa File System to Grow Hadoop Bigger (6:30 - 7:00 PM)

HDFS scalability and availability is limited by the single namespace server design. Giraffa is an experimental file system, which uses HBase to maintain the file system namespace in a distributed way and serves data directly from HDFS DataNodes. Giraffa is intended to provide higher scalabilty, availability, and maintain very large namespaces. The presentation will explain the Giraffa architecture, the motivation, will address its main challenges, and give an update on the status of the project.

Presenter: Konstantin Shvachko (PhD), Founder, AltoScale

 

Session II: Apache Drill for Interactive Analysis (7:00 - 7:30 PM)

Apache Drill is a new open source Apache Incubator project for interactive analysis of large-scale datasets, inspired by Google's Dremel. It enables users to query terabytes of data in seconds. Apache Drill supports a broad range of data formats, including Protocol Buffers, Avro and JSON, and leverages Hadoop and HBase as data sources. Drill's primary query language, DrQL, is compatible with Google BigQuery. In this talk we provide an overview of the Drill project, including its design goals and architecture.

Presenter: Jason Frantz, Software Architect, MapR Technologies

  

Session III: Elastic, Multi-tenant, Highly Available Hadoop on Demand (7:30 - 8:00 PM)

Serengeti is an open-source project, initiated by VMware, to enable the rapid deployment of Hadoop clusters in virtual environments. While Hadoop clusters are typically run on physical machines, Serengeti aims to bridge Hadoop and virtualization, and bring the classic benefits of virtualization to the Hadoop user. Leveraging virtual machines, Serengeti-deployed clusters can be simply operated, configured for HA protection, and made elastic through the decoupling of Hadoop compute and data layers. In this talk, we explore each of these aspects of running Hadoop on a virtual platform.

Presenter: Kevin Leong, Product Manager, VMware 

 

Yahoo Campus Map:

Detail map

 

Location on Wikimapia:

http://www.wikimapia.org/#lat=37.4181633&lon=-[masked]&z=18&l=0&m=b&search=yahoo

 

Join or login to comment.

  • Dave

    Save an Extra 20% with Discount Code MEETUP: Jan 28 Global Big Data Conference Santa Clara Convention Center?
    35+ Speakers, 20+ very high quality Sessions Register: http://globalbigdataconference....­

    Agenda: http://globalbigdataconference....­

    January 26, 2013

  • j mock

    Will there be a live feed at this meetup?

    September 18, 2012

  • Long Phung

    Siri'ous about Hadoop! Pls connect to me for opportunity details. (lphungatappledotcom)

    September 24, 2012

  • Nael Mohammad

    Can you post the slides for all three presentations?

    September 19, 2012

    • Yahoo! HUG Organizer

      Slides are on slideshare.net/ydn, the usual place for all HUG meetup slides.

      September 23, 2012

  • Corrinne Kahler

    Mostly good and interesting material. Microphone speakers have a lot of echo so its often hard to understand.

    September 20, 2012

  • Sanjay Shroff

    Nice meetup. Would be nice to connect with attendees
    linkedin.com/in/shroffsanjay
    [masked]

    September 20, 2012

  • A former member
    A former member

    I've posted a copy of my slides at

    http://www.slideshare.net/jason...­

    September 20, 2012

  • Deb Biswas

    Good!

    September 19, 2012

  • Patrick Nicolas

    Excellent. The topics are relevant to real-world problems.

    September 19, 2012

  • Prashanth Kokati

    I am waitlisted, should I still come or no point?

    September 19, 2012

  • A former member
    A former member

    I won't be able to go, I can give away my space... Sorry late meetings

    September 19, 2012

  • Haidar Hadi

    It will be interesting if someone can address the issue of caching , since Hadoop on demand is being presented.

    September 18, 2012

    • Yahoo! HUG Organizer

      You can ask this during the Q&A time. Kevin may be able to address it.

      September 19, 2012

  • Jeremy Taylor

    Should be interesting!

    September 19, 2012

Our Sponsors

  • Yahoo! Inc.

    Meeting space, pizza and drinks are sponsored by the Yahoo! Hadoop team.

People in this
Meetup are also in:

Start the perfect Meetup Group for you

We'll help you find the right people to join.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy