Impala - Straight from the Antelope's Mouth


Details
Cloudera Impala is an open source massively parallel processing (MPP) SQL query engine for Apache Hadoop.
The presentation will cover:
a) How Impala decreases latency to query data on Hadoop - Use case and the need for Impala
b) How Impala differs from other options (Dremel, Drill, others) - Key differences in approaches
c) How Impala provides an MPP style execution on Hadoop - Technical overview of Impala architecture
d) What happens to MR jobs? Does impala reduce the need for MR programming?
e) Performance statistics of Impala on large sets of data
f) Demo of Impala and future road map
Shravan (Sean) Pabba is a Systems Engineer at Cloudera. He works with Cloudera customers and prospects in helping them architect and build applications using Cloudera Hadoop Distribution. Before Cloudera Sean worked as a Solutions Architect at various companies including GigaSpaces and IBM, where he was involved in architecture, design and development of distributed and mainframe applications.
Schedule:
6:00 - Food, open discussion
6:30 - presentation
8:00 - beverages and networking at Nodding Head

Impala - Straight from the Antelope's Mouth