Skip to content

Creating an Automated Spark Development Environment with Analysis and Logging

Photo of Future of Data
Hosted By
Future of D. and Mark S.
Creating an Automated Spark Development Environment with Analysis and Logging

Details

In this meetup, we’re going to put ourselves in the shoes of a Data Engineer tasked by the State of Texas to create processes that extract, transform, and load publicly accessible financial data from the Paycheck Protection Program. This data can then be used in order to create reports that state elected officials can use to make decisions. We’ll also go over how you can leverage CLI tools to create a system in which you can continuously deploy your newly developed Spark jobs through some automation scripts, saving huge amounts of development and deployment time. At the end we’ll even take a look at the finished result of our reports!

Come join us to see this example of the new Data Engineering Experience in action and hopefully inspire similar solutions of your own!

For a preview of the content we'll be covering, we've got the following resources:

Video:
https://youtu.be/RA8UIgfBpno

Blog:
https://blog.cloudera.com/using-cloudera-data-engineering-to-analyze-the-paycheck-protection-program-data/

Tutorial:
https://www.cloudera.com/tutorials/cdp-using-cli-api-to-automate-access-to-cloudera-data-engineering.html?utm_source=mktg-community&utm_medium=meetup

Cloudera Users Page:
https://www.cloudera.com/users.html

Due to the ongoing situation of the pandemic, this will be an online event.

Update Mon Oct 5 6:22 PM CDT 2020
The online event link can now be found to the right under "online event".

Photo of Future of Data: Austin group
Future of Data: Austin
See more events