Skip to content

Details

This meetup focuses on Scalability and technologies to enable handling large amounts of data: Hadoop, HBase, distributed NoSQL databases, and more!

There's not only a focus on technology, but also everything surrounding it including operations, management, business use cases, and more.

We've had great success in the past, and are growing quickly! Previous guests were from Twitter, LinkedIn, Amazon, Cloudant, Microsoft, 10gen/MongoDB, and more.

This month's guests:

Cloudera Impala

The Cloudera Impala project is for the first time making scalable parallel database technology, which is the underpinning of Google's Dremel as well as that of commercial analytic DBMSs, available to the Hadoop community. With Impala, the Hadoop community now has an open-sourced codebase that allows users to issue low-latency queries to data stored in HDFS and Apache HBase using familiar SQL operators.

Justin Erickson's talk will start out with an overview of Impala from the user's perspective, followed by a presentation of Impala's architecture and implementation, and will conclude with a comparison of Impala with Apache Hive, commercial MapReduce alternatives, and traditional data warehouse infrastructure.

Russell Jurney (https://twitter.com/rjurney) (author of Hortonworks Agile Data (http://shop.oreilly.com/product/0636920025054.do)) can't make it this month...

John Nestor - Persist Software OStore is a new NoSQL database and continuous Map-Reduce system currently under development written in Scala and Akka. An overview of the architecture will be followed by a demo.

Our format is flexible: We usually have speakers who talk for ~30 minutes each and then do Q+A, plus discussion.

There'll be beer afterwards, of course!

Doors open 30 minutes ahead of show-time. Please show up at least 15 minutes early out of respect for our first speaker.

Related topics

You may also like