Skip to content

Getting Jiggy with Change Data Capture and Slowly Changing Dimensions

Photo of Brett Weninger
Hosted By
Brett W.
Getting Jiggy with Change Data Capture and Slowly Changing Dimensions

Details

We are thrilled to have to have Venkat Ankam speak on Change Data Capture and Slowly Changing Dimensions! A static view of technology leaves you stuck. Come learn how to manage the ever changing dimensions of the underlying sensor data!

Agenda

• 6:00 – 6:30 - Socialize over food and drink

• 6:30 – 6:45 - Announcements, Upcoming Events

• 6:45 – 7:15 - Confessions

• 7:15 – 8:30 - Venkat Ankam - Hadoop Architect, CenturyLink

• 8:30 – ??? - Continued socializing

About the Presentation
Big Data is a revolution that will transform how we live, work and think. Due to the lowering of cost of storage, explosion of Data, the time is ripe for the use of analytical techniques that could potentially yield huge productivity benefits, efficiencies and customized services for the consumers. The emergence of new data sources, increased volume of data and increased costs has led many organizations to a startling conclusion: a single enterprise data warehousing platform can no longer handle the growing breadth and depth of analytical workloads. Being purpose-built for big data analytics, Hadoop is now becoming a strategic addition to the data warehousing environment, where it is able to fulfill several roles. Hadoop's role in data warehousing is evolving rapidly in use cases like archiving data to hadoop, staging data to hadoop, offloading data to hadoop and replacing existing data warehousing systems. But, the biggest challenge in building Hadoop based data warehousing systems is how to implement the Change Data Capture (CDC) and Slowly Changing Dimensions (SCD).

In this presentation, Venkat will show you the techniques used in Change Data Capture on Hadoop using Sqoop and Hive to identify the inserts, updates and deletes. Also, how to apply inserts, deletes and updates on Hive Tables to maintain different TYPEs of SCDs. Presentation will be based on simple use case to show case these features on CDH5 cluster. He will also talk about various tools available and new features in upcoming releases of Hadoop that will enable easy implementation of CDC and SCDs on Hadoop.

About the Presenter

Venkat Ankam is a Hadoop Architect at CenturyLink. He has 16 years IT experience and has been working with Hadoop Technologies for the last 3 years. Venkat enjoys contributing and sharing knowledge to the community and is also the founder of HUG Hyderabad.

Photo of Boulder/Denver BigData Meetup group
Boulder/Denver BigData Meetup
See more events
1310 College Avenue, Boulder, CO · Boulder, CO