Skip to content

Details

Intro: The challenges in managing a data lake for structured and unstructured data.

Achieving manageability:
1. The components of the architecture and their role
Opentable formats
Catalogs
Data Version control systems
2. How it all fits together
Example using Databricks technologies
Example using Apache Iceberg
Example using AWS technologies
3. Discussion

Language: English

About the lecturer: Einat Orr is the CEO and Co-founder of Treeverse, the company behind lakeFS, an open source platform that delivers a git-like experience to object-storage based data lakes. She received her PhD. in Mathematics from Tel Aviv University, in the field of optimization in graph theory. Einat previously led several engineering organizations, most recently as CTO at SimilarWeb.

Related topics

Amazon Web Services
Cloud Computing
Big Data
Data Storage

You may also like