Skip to content

Apache Impala // IoT Data Analytics - Hype or Truly Transformative

Photo of Eric Christenson
Hosted By
Eric C. and 3 others
Apache Impala // IoT Data Analytics - Hype or Truly Transformative

Details

Apache Impala:

Apache Impala (Incubating) raises the bar for SQL query performance on Apache Hadoop while retaining a familiar user experience. With Impala, you can query data, whether stored in HDFS, Apache Kudu or Apache HBase – including SELECT, JOIN, and aggregate functions – in real time. In this session Ryan will do a deep dive into Impala internals including architecture, query planning, and query execution.

Speaker Bio

Ryan Wieber is a Systems Engineer at Cloudera. He has over 15 years experience as a Data Warehouse Architect, ETL Architect and Database Administrator and has worked at companies such as Oracle, Best Buy, Thomson Reuters and Boston Scientific. Ryan has a Bachelors Degree in Computer Science and Law Enforcement from Minnesota State University and has completed Masters Coursework related to spatial database systems and clustered database systems at the University of Minnesota.

IoT Data Analytics - Hype or Truly Transformative:

The Internet of Things has the potential to be one of the biggest technological revolutions in the recent ages, enabling businesses to work smarter, faster and more profitably. However IoT data represents a different paradigm for Enterprises as the inherent characteristics of data generated from IoT and connected devices will challenge traditional data management approaches and methodologies. Some of the key characteristics of the IoT data include: Diverse data structures & schemas based on the sources; volumes of intermittent data streams – predominantly time-series data from varied data sources; and multi-modal data acquisition with streaming, batch, and request-response methods. In this session, Dave will discuss the data management value chain and logical architecture for IoT.

Speaker Bio

Dave Shuman is an Industry Leader for Manufacturing & IoT at Cloudera, the leading Big Data vendor. He advises customers globally as they introduce Big Data solutions and adopt enterprise-wide Big Data delivery capabilities for supply chain, logistics, manufacturing, and sensor-driven use cases. Prior to joining Cloudera, Dave built and ran Vision Chain, an innovative data warehousing & insights start-up serving as Chief Operations Officer, VP of Field, and VP of Product over his 11 year tenure. Dave has worked broadly on the capabilities for data engineering and advanced analytics to leverage existing and emerging sources of data and the iterative cycles that enable rapid prototyping and development. Previously Dave developed ecommerce applications and business processes at the dawn of the ecommerce era with enews.com (a Barnes and Noble company) managing software development, operations and retail analytics. Dave has an extensive background in the Hadoop ecosystem, business intelligence applications, database architecture, logical and physical database design and data warehousing. He holds an M.B.A. with a concentration in Information Systems from Temple University and a B.A. from Earlham College.

Food and drink are generously provided by Cloudera.

Photo of BAM-Big Data, Advanced Analytics, Machine Learning group
BAM-Big Data, Advanced Analytics, Machine Learning
See more events