Skip to content

An Introduction to Impala – Low Latency Queries for Apache Hadoop

Photo of Matthew Rathbone
Hosted By
Matthew R. and Pitt F.
An Introduction to Impala – Low Latency Queries for Apache Hadoop

Details

The Cloudera Impala project is, for the first time, making scalable parallel database technology, which is the underpinning of Google's Dremel as well as that of commercial analytic DBMSs, available to the Hadoop community.

With Impala, the Hadoop community now has an open-sourced codebase that allows users to issue low-latency queries to data stored in HDFS and Apache HBase using familiar SQL operators.

Bio:
Matt Harris is a Systems Engineer at Cloudera where he supports organizations in their understanding and adoption of Hadoop. Prior to Cloudera, Matt was a Systems Engineer at Composite Software and a SCADA Engineer at Peoples Energy. Matt has an MS in Computer Science from DePaul University and a BS in Mechanical Engineering from Purdue University.

(I'm super excited about this talk! - matthew)

Photo of Big Data Madison group
Big Data Madison
See more events