Skip to content

Details

Hey there!
We're continuing the season with the guys from HablaComputing. They are coming from Madrid to teach an introductory course on Spark, and will stop by in the meetup to delight us with some of their wisdom :)

See you next 30th September, 19:00 @ LifullConnect (Trovit) offices.

Please also check out the course, they are offering a discount code, and also raffling an entire ticket just for the community!!!

Abstract:
Nowadays Apache Spark is very consolidated and has become one of the most popular big data technologies, with a rich ecosystem of libraries and extensions that make Spark a complete tool for data engineering. But, is it all perfect in Spark? Are there any design flaws in the Spark APIs and execution model? In this talk, we'll discuss Spark strengths (unified programming model, optimizations, caching…) and weaknesses (unexpected side effects, composition limitations, inconsistencies in caching API...) and how they affect development.

About the speaker:
Mikel San Vicente is a senior data engineer in Habla Computing, he has worked with Scala and big data for more than 5 years in different fields like finance, marketing, retail, telcos...

Members are also interested in