Building Reliable Data Lakes - Technical Workshop on Delta Lake


Details
5.30-6.00pm: Networking & Food
6.00-8.00pm: Workshop by Data Bricks
Presenter: Soham Bhatt, Solution Architect, Databricks, Inc.
Workshop Agenda:
Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake provides ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. Delta Lake runs on top of your existing data lake and is fully compatible with Apache Spark APIs.
Delta Lake sits on top of Apache Spark. The format and the compute layer helps to simplify building big data pipelines and increase the overall efficiency of your pipelines.
This workshop will provide a primer on Delta Lake and how to use Delta Lake to ensure consistency with ACID transactions. The workshop will use public lending data as an example and will walk through a notebook to showcase technology capabilities and features of Delta Lake.
More info on Delta:
https://databricks.com/product/databricks-delta

Building Reliable Data Lakes - Technical Workshop on Delta Lake