Skip to content

Impala - Straight from the Antelope's Mouth

Photo of Michael Reichner
Hosted By
Michael R.
Impala - Straight from the Antelope's Mouth

Details

Cloudera Impala is an open source massively parallel processing (MPP) SQL query engine for Apache Hadoop.

The presentation will cover:

a) How Impala decreases latency to query data on Hadoop - Use case and the need for Impala

b) How Impala differs from other options (Dremel, Drill, others) - Key differences in approaches

c) How Impala provides an MPP style execution on Hadoop - Technical overview of Impala architecture

d) What happens to MR jobs? Does impala reduce the need for MR programming?

e) Performance statistics of Impala on large sets of data

f) Demo of Impala and future road map

Shravan (Sean) Pabba is a Systems Engineer at Cloudera. He works with Cloudera customers and prospects in helping them architect and build applications using Cloudera Hadoop Distribution. Before Cloudera Sean worked as a Solutions Architect at various companies including GigaSpaces and IBM, where he was involved in architecture, design and development of distributed and mainframe applications.

Schedule:

6:00 - Food, open discussion

6:30 - presentation

8:00 - beverages and networking at Nodding Head

Photo of PhillyDB group
PhillyDB
See more events
Municipal Services Building
1401 JFK Blvd · Philadelphia, PA