Skip to content

Virtual meetup: Building a Reliable Data Lake & What's New in Spark 3.0

Photo of Ron Sher
Hosted By
Ron S.
Virtual meetup: Building a Reliable Data Lake & What's New in Spark 3.0

Details

Hope you all are doing well and are energized to learn something new.

There is no time like the present to get back into all things Spark! Join us in a virtual meetup where we will deep dive on topics ranging from how to build a reliable data lake to all the latest in the realm of Spark 3.0.

We will discuss the following:

  • How to Build a Reliable Data Lake by Tal Sharon, staff Big Data Engineer, Intuit (https://www.linkedin.com/in/tal-sharon-89a87156/)
    At Intuit, a global financial platform company with products including TurboTax, QuickBooks, and Mint, we have a data lake built from parquet tables. We needed to support updates at scale, especially when handling streams of change requests to the data lake. So we compared two leading solutions – delta lake from Databricks and Apache Hudi from Uber. Tal will explain why ACID properties are the way to go when working with Spark and how delta and Hudi support ACID properties and enable building a reliable data lake.

  • What’s New in Spark 3? by Daniel Haviv, Solutions Architect, Databricks (https://www.linkedin.com/in/danielhaviv/)
    Spark 3 is almost out. Daniel will share all the latest on what’s to come in this super anticipated launch.
    --------------------------------------
    What will you learn?

  • Why do you need ACID when working in Spark

  • How do delta lake & Hudi work

  • What’s new in Spark 3.0

Spark is back, more talks, more features, more key learnings! We promise it will spark your interest!

See you on May 24!

Photo of Israel Spark Meetup group
Israel Spark Meetup
See more events