πŸ’›πŸ’šπŸ’™ Productionizing the Machine Learning Lifecycle with Delta Lake πŸŽ€βœˆ

PyData Seattle
PyData Seattle
Public group
Location image of event venue

Details

Delta Lake using the Pyhon APIs and how to tie it all together with PySpark/TF/sklearn.

πŸ’šπŸ’™πŸ’– PyData Talk Night πŸ’–πŸ’šπŸ’™

Schedule:
6:00 - 6:30 Mix and mingle.πŸŒŸπŸ’• Many thanks to our host Databricks πŸ§‘πŸ’›πŸ§‘πŸ’›πŸš€ https://databricks.com/
Venue host - Flatiron 🌟
https://flatironschool.com
6:30 - 6:35 Announcements
6:35 - 7:20 @DennyLee and Spencer - Databricks!
7:20 - 7:30 QA
7:30 - 8:00 Networking

Presented by NumFOCUS open source better science https://www.numfocus.org/

Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. Delta Lake runs on top of your existing data lake and is fully compatible with Apache Spark APIs.

GDPR Compliance with Databricks Delta Lake

πŸ’™πŸŒΊ πŸŽ‰ Spencer McGhin from Databricks will be doing a live talk and demo on Databricks Delta Lake for PySpark, and how it can bring superior reliability and performance to your data lake. Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. Delta Lake runs on top of your existing data lake and is fully compatible with Apache Spark APIs.

Join this session to learn more about the recent developments on the PySpark APIs for this exciting new technology and take part in a live demonstration of their application in a GDPR compliance use case, utilizing Spark Structured Streaming on IoT data.

πŸ’šπŸ’™Thank you for your support to @NumFOCUS, your participation help us to bring awareness to NumFOCUS a 501(c)(3) nonprofit that supports and promotes world-class, innovative, open source scientific computing projects for Data Science, including: Pandas, Numpy, Sympy, IPython, Jupyter, Matplotlib and Julia.

❀️ Become a NumFOCUS Member! ❀️
Help sustain the open source data stack by becoming a NumFOCUS member! https://numfocus.org/

NumFOCUS envisions an inclusive scientific and research community that utilizes actively supported open source software to make impactful discoveries for a better world.