Big Data - Apache Spark 3.0 in 45 minutes + book giveaway


Details
We've taken the last month off preparing for All Things Open in October, but we're thrilled to be back with a meetup looking at a macro topic - big data analytics/processing, but in a quasi micro way..
Big data analytics/processing is increasingly important today and the Apache Spark framework/engine remains popular with organizations of all sizes, from small to enterprise level, as well as with major cloud vendors. This talk will look specifically at the newest version 3.0.
While Spark 3 is a big topic, our speaker will deliver a 45 minute overview intended to hit the high points. In addition, we'll do 10-15 minutes of Q&A to answer attendee questions.
Finally, we'll give signed copies of the speaker's book - Spark in Action (2e) https://www.manning.com/books/spark-in-action-second-edition - to 4 lucky winners. We'll ship the signed books right away.
Topic: Spark 3 in 45 minutes
Abstract:
Apache Spark v3 is a new milestone for the Big Data framework. In this session, you will (re)discover what Spark is, learn about the new features in its third major version, and go through a complete end-to-end project.
I like to call Spark an Analytics Operating Systems. It is offering far more than just a framework or a library. I will explain why. Spark v3 is the latest major evolution. It was released mid-June 2020 and adds impressive new features. After looking at them from a high level, I will detail a few of my favorites.
Finally, as we all like code (well, at least I do), I will demonstrate a complete data & AI pipeline looking at Covid-19 data.
Key takeaways: Spark as an Analytics OS, Spark v3 highlights, building data/AI pipelines/models with Spark.
Audience: software engineers, data engineers, architects, data scientists.
Speaker: Jean-Georges Perrin, Senior Solutions Architect, Advance Auto Parts and noted author and speaker
Bio:
Jean-Georges Perrin "jgp” is a senior solutions architect working for Advance Auto Parts and the author of Spark in Action, 2nd edition (Manning). He is passionate about software engineering and all things data, small and big data. His latest endeavors bring to more and more data engineering, data governance, and, his favorite theme, the industrialization of data science. He is proud to have been the first in France to be recognized as an IBM Champion and to have been awarded the honor for his 12th consecutive year. Jean-Georges shares his more than 25 years of experience in the IT industry as a presenter and participant at conferences and through publishing articles in print and online media. His blog is visible at http://jgp.net. When he is not immersed in IT, which he loves, he enjoys exploring his adopted region of North Carolina with his kids.
Twitter: https://twitter.com/jgperrin

Sponsors
Big Data - Apache Spark 3.0 in 45 minutes + book giveaway