Skip to content

Details

Please join us for this important webinar discussion of the upcoming Apache Spark™ 4.0.0 release.

The upcoming Apache Spark™ 4.0.0 has many new features for data practitioners, analysts, and machine learning practitioners. To cover all the features comprehensively in this webinar, we’ll have to hold you captive for a few hours, which is impossible given time constraints. In our subsequent webinar sessions, we will extensively cover Spark 4.0.0 salient features.

For this session, we’ll provide an overview of:

  • New feature functionalities
  • New Extensions for DataSources
  • Custom Python and SQL functions and procedures

Please join us and learn more about the upcoming Apache Spark™ 4.0.0 🤝

📅 Date: April 22, 2025
⏰ Time: 9:30 AM - 10:30 AM PST
📍 Location: online

RSVP HERE 👉 https://lu.ma/4p1ulj49

Agenda:
Talk 1: Upcoming Apache Spark™ 4.0.0 Release
Abstract: This session will cover new features and enhancements in the upcoming Apache Spark™ 4.0 release. For this session, I’ll do an overview of the following features:

  • Spark Connect: The future of Spark extensibility
  • ANSI Mode: For better ANSI SQL compatibility
  • Variant data types for semi-structured data
  • String collation support
  • Python UDTF functions
  • SQL and UDTF functions
  • PySpark UDF Unified Profiler

Speakers:
Daniel Tenedorio is a Senior Staff Engineer at Databricks

Apache Spark
Open Source

Members are also interested in