Skip to content

TBDEG - The Who, What, and Why of Data Lake Table Formats by Alex Merced!

Photo of Joe Blankenship
Hosted By
Joe B.
TBDEG - The Who, What, and Why of Data Lake Table Formats by Alex Merced!

Details

This meetup is a monthly chat for our community to discuss the latest and greatest in data engineering. We'll cover interesting topics, techniques, and tools through general open discussions and focused presentations.

This month, Alex Merced will present "The Who, What, and Why of Data Lake Table Formats" - A comprehensive exploration of the intricacies of Data Lake Table Formats and their impact on business analytics.

Data lake table formats are a critical component of modern data analytics. They provide a way to organize and manage data in a data lake, and they offer several benefits for business analytics, including:

  • Scalability: Data lake table formats can scale to handle large amounts of data.
  • Performance: Data lake table formats can improve the performance of queries on large datasets.
  • Durability: Data lake table formats can ensure that data is durable and recoverable.
  • Auditability: Data lake table formats can help to ensure that data is auditable and compliant.

This presentation will explore the who, what, and why of data lake table formats. We will discuss the different data lake table formats, such as Apache Iceberg, Apache Hudi, and Delta Lake. We will also discuss the benefits of using data lake table formats for business analytics.

By the end of this presentation, you will better understand data lake table formats and how they can be used to improve business analytics.

Key takeaways:

  • Data lake table formats are a critical component of modern data analytics.
  • They offer a number of benefits for business analytics, including scalability, performance, durability, and auditability.
  • There are a variety of data lake table formats available, including Apache Iceberg, Apache Hudi, and Delta Lake.

Speaker Bio:

Alex Merced is a developer advocate for Dremio, a developer, and a seasoned instructor with a rich professional background. Having worked with companies like GenEd Systems, Crossfield Digital, CampusGuard, and General Assembly.

Alex is a co-author of the O'Reilly Book "Apache Iceberg: The Definitive Guide." With a deep understanding of the subject matter, Alex has shared his insights as a speaker at events including Data Day Texas, OSA Con, P99Conf and Data Council.

Driven by a profound passion for technology, Alex has been instrumental in disseminating his knowledge through various platforms. His tech content can be found in blogs, videos, and his podcasts, Datanation and Web Dev 101.

Moreover, Alex Merced has made contributions to the JavaScript and Python communities by developing a range of libraries. Notable examples include SencilloDB, CoquitoJS, and dremio-simple-query, among others.

twitter: amdatalakehouse
threads: alexmercedcoder
mastodon: @alexmerced@data-folks.masto.host
linkedin: /in/alexmerced

Looking forward to seeing you all soon!

Photo of Tampa Bay Data Engineering Group group
Tampa Bay Data Engineering Group
See more events