SPEAKER: Marcel Kornacker - Cloudera
We are very excited that the thirteenth distinguished speaker in this series will be Marcel Kornacker. Doors open at 6:30pm, talk begins at 7:00pm. Drinks and light food provided.
TITLE: Cloudera Impala: Real-Time Queries in Apache Hadoop
ABSTRACT: The Cloudera Impala project is for the first time making scalable parallel database technology, which is the underpinning of Google's Dremel as well as that of commercial analytic DBMSs, available to the Hadoop community. With Impala, the Hadoop community now has an open-sourced codebase that allows users to issue low-latency queries to data stored in HDFS and Apache HBase using familiar SQL operators.
This talk will start out with an overview of Impala from the user's perspective, followed by a presentation of Impala's architecture and implementation, and will conclude with a comparison of Impala with Apache Hive, commercial MapReduce alternatives, and traditional data warehouse infrastructure.
Bio: Marcel Kornacker is a tech lead at Cloudera for new products and creator of the Cloudera Impala project. He graduated in 2000 with a PhD in databases from UC Berkeley, followed by engineering jobs at a few database-related startup companies. Marcel joined Google in 2003, where he worked on several ads serving and storage infrastructure projects. His last engagement was as the tech lead for the distributed query engine component of Google's F1 project.
More background on Marcel: