Data Council Berlin Meetup #3: Spark Edition


Details
Hi everyone,
Spark is one of the technologies that many people consider for their data processing / ETL needs. In this edition of the Data Council Berlin Meetup, we want to delve deeply into the pros and cons of data pipelining with Spark.
Two of the companies in Berlin that use Spark at a significant scale are GetYourGuide and HelloFresh and we are very happy to have David Mariassy, Thiago Rigo and Rodrigo Peternella reporting first hand about the challenges of applying Spark to their data problems.
PyData Berlin veteran Matti Lyra will add an additional perspective by sharing his experiences on building data pipelines with Dask.
We want the evening to be opinionated. If you can share your experiences with Spark, then please reach out to Daniel or me for a talk or just come there and join the discussion.
Doors open 18:30h, there will be food at drinks.
Speakers:
https://www.linkedin.com/in/david-mariassy-4a881a5b/
https://www.linkedin.com/in/thiago-rigo-479a2042/
https://www.linkedin.com/in/rpeternella/
https://www.linkedin.com/in/mattilyra/

Data Council Berlin Meetup #3: Spark Edition