Automated Testing in the Modern Data Warehouse with Milk Bar
Details
Dataiku will be hosting two talks focused on building repeatable, tested models to improve your data science practices!
Tentative Schedule:
6:30pm: Pizza + Beer
7:00pm: Scaling up with Docker and Kubernetes in Dataiku by Jed Dougherty, Lead Data Scientist at Dataiku
7:30pm: Automated Testing in the Modern Data Warehouse by Josh Temple, Senior Data Engineer at Milk Bar
Abstracts:
Scaling up with Docker and Kubernetes in Dataiku by Jed Dougherty, Lead Data Scientist at Dataiku:
Containers have changed the way people develop and deploy projects across the programming universe. That’s as true with Data Science as any other subfield. Jed Dougherty, Lead Data Scientist at Dataiku, will be your guide to two of the most popular of these technologies, Docker and Kubernetes. We’ll look at why they’re useful for Data Science and how they can be used from directly within Dataiku.
Automated Testing in the Modern Data Warehouse by Josh Temple, Senior Data Engineer at Milk Bar:
In today's business world, data is the currency of decision-making. Data drives daily decisions across the entire organization, from automated bots to the C-Suite. Yet many data professionals are not properly trained to test their code or their data in a comprehensive and modern way. While data quality is of utmost importance, testing data is hard, and many organizations don't have serious testing frameworks in place. Software engineering tools like continuous integration/deployment, staging environments, and unit tests are not as common in the data world. This talk will cover practical ways you can implement automated testing to improve the quality of your data and increase the confidence of decision-making across your organization.
Bios:
Jed leads Dataiku's Data Science team in North America. He works with a wide variety of Fortune 500 clients and specializes in helping large companies spin up and organize Data Science teams. Before coming to Dataiku he worked on event detection, spam filtering, and survival analysis in the fields of breaking news, social media, and child welfare. He earned his masters at Columbia University in its QMSS program.
Josh Temple leads data analytics and engineering for Milk Bar, an award-winning bakery led by chef Christina Tosi. He is building the first business intelligence stack at Milk Bar using Airflow, BigQuery, dbt, and Looker and is passionate about improving data quality through automation. Josh has a degree in chemical engineering from Johns Hopkins University and is a self-taught data engineer. He is also a major ice cream nerd and co-founded Tiny Home Creamery, an ice cream company based in NYC, with his wife Meera. In his free time, Josh enjoys cooking and playing the piano.
If you can't make it to the event, Josh also wrote an article on the topic! https://medium.com/@josh.temple/automated-testing-in-the-modern-data-warehouse-d5a251a866af

