Skip to content

Data Council Berlin Meetup #3: Spark Edition

Photo of Martin Loetzsch
Hosted By
Martin L. and 2 others
Data Council Berlin Meetup #3: Spark Edition

Details

Hi everyone,

Spark is one of the technologies that many people consider for their data processing / ETL needs. In this edition of the Data Council Berlin Meetup, we want to delve deeply into the pros and cons of data pipelining with Spark.

Two of the companies in Berlin that use Spark at a significant scale are GetYourGuide and HelloFresh and we are very happy to have David Mariassy, Thiago Rigo and Rodrigo Peternella reporting first hand about the challenges of applying Spark to their data problems.

PyData Berlin veteran Matti Lyra will add an additional perspective by sharing his experiences on building data pipelines with Dask.

We want the evening to be opinionated. If you can share your experiences with Spark, then please reach out to Daniel or me for a talk or just come there and join the discussion.

Doors open 18:30h, there will be food at drinks.

Speakers:
https://www.linkedin.com/in/david-mariassy-4a881a5b/
https://www.linkedin.com/in/thiago-rigo-479a2042/
https://www.linkedin.com/in/rpeternella/
https://www.linkedin.com/in/mattilyra/

Photo of Data Council Berlin Data Engineering & Science group
Data Council Berlin Data Engineering & Science
See more events