We will dive into the internals of Parquet files to understand how it enables fast and scalable retrieval of data.
Next we'll see how the Delta format enhances Parquet with features like a transaction log.
Does this mean we need to perform regular maintenance? How does it compare to indexing in a classic database?
I'll answer these questions and more in this demo heavy session.