June 2013 San Francisco Hadoop Meetup

Hosted by San Francisco Hadoop Users

Public group

This is a past event

281 people went

Location image of event venue


The June 2013 Hadoop meetup will be held Wednesday, June 12, from 6:00pm to 8:00pm. This meetup will be hosted by AdRoll at 972 Mission St, San Francisco, CA. Please sign in at the reception desk.

This meetup is the night before HBaseCon! In keeping with this theme, we plan to host two HBase-oriented talks. This meetup will not have an unconference portion.

The following talks will be presented:

"Real-World Use Cases of the Hadoop/HBase Synergy"
Derek Nelson, AdRoll

Combining the batch, heavy lifting Hadoop powerhouse with the low-latency, random access capabilities of HBase can substantially empower your data infrastructure. In this talk, we discuss how several specific AdRoll services thrive on the Hadoop/HBase synergy. Namely, pairing Hadoop with HBase allows us to dynamically aggregate uniques in near real time, efficiently track conversions, and drive our recommendations platform--all in ways that scale horizontally with explosive growth in data volume.

"Under the Hood: How KijiSchema extends HBase"
Christophe Taton, WibiData

HBase provides powerful and generic interfaces, but with great power comes great responsibility. The genericity and extent of HBase interfaces can be tricky. In particular, early designs require great care to avoid various pitfalls that may later affect the scalability and evolvability of the application. While designing such applications, we learned do's and don'ts were captured in an open-source framework named KijiSchema. KijiSchema combines several sensible and cautious design choices, with a safe, efficient, flexible and evolvable core for encoding cells and table layouts. In this talk, we will cover several design tradeoffs and internal details that we made in KijiSchema. Attendees will learn how KijiSchema can help them build powerful HBase-backed applications.