Data Exploration and Dashboard Creation in a Cloud Based Data Warehouse


Details
In this meetup, we’ll go through the process of loading synthetic data into a virtual warehouse, running ad hoc queries to determine certain trends within this data, and finally creating a dashboard to present our results using a built in data visualization tool. We’ll put ourselves in the shoes of a data scientist responsible for investigating a recent downturn in company productivity and go through a similar process to one you might see occur in any data driven company. Within this example you’ll see Apache Spark used to ingest and transform data, Apache Impala and Apache Hue used to query our data leveraging easily created virtual compute resources, and a new built in data visualization tool to wrap it all up and present our results.
Agenda (all times in EST)
06:00 Welcome and Announcements
06:05 Introductions and Agenda
06:10 Nicolas Pelaez: Data Exploration and Dashboard Creation in a Cloud Based Data Warehouse
07:00 Demo: Exploring Data and Creating a Dashboard in The Cloud
07:15 Q&A
07:20 Raffle of door prizes (must participate to win)
07:25 Preview of upcoming Meetups, concluding remarks
For a preview of the content we'll be covering, we've got the following resources:
Video:
https://youtu.be/msLhk6jPkxk
Cloudera Users Page:
https://www.cloudera.com/users.html
Come join us to see the process we’ve created and hopefully it will inspire some new ways of thinking!
In order to do our part to help flatten the curve of the spread of COVID-19, this will be an online event. The URL for joining the web videoconference will be provided on this page no later than 48 hours prior to the actual event's start time.

Sponsors
Data Exploration and Dashboard Creation in a Cloud Based Data Warehouse