What we're about

The focus of this Meetup group is to provide free monthly community data events.

About us

The Data Science Festival is a global data community. We aim to connect the data world and foster the sharing of knowledge, inspiration and ideas.

The global network is dedicated to free education through grassroots technical events. we will cover the latest topics that matter most to data scientists and data engineers. There will be no demos that you can learn from a book or video, instead our speakers will be discussing real world problems, what works, what doesn’t work and why they’ve implemented the solutions they have. They will generate lively discussion and debate, offering real-world take-aways to help you in your job.

Who is the Data Science Festival for?

• Data engineers, analysts, scientists, and other practitioners

• Academics, founders, researchers, authors

• R, Python and other software engineers who work with data or want to learn

• Data visualisation developers and designers

• Non-technical team leads, executives, and other decision makers from data centric startups and large companies looking to utilise open source tools

How can I get involved?

Visit the DSF site to get involved! ( http://www.datasciencefestival.com/ )

We are actively looking for community minded individuals to help build and grow our group, please feel free to get in touch if you would like to:

• Host an event

• Sponsor an event

• Present a session

• Volunteer to help organise the festival

Upcoming events (2)

Facebook Presents Women Create: Making Career Defining Products

(External registration needed, please read description)

Facebook Presents Women Create: Making Career Defining Products

Join us for a collaborative, online virtual event, hosted by the Facebook Data & Analytics team in partnership with Data Science Festival, bringing together Data Scientists & Engineers from across EMEA to connect, share and learn from others in the industry. We are excited to feature an inspiring line-up of women working in Data & Analytics across our EMEA Facebook teams & apps (Facebook Reality Labs, WhatsApp, Central Integrity, Facebook Messenger), who have each created their own product focused technical/strategic lightning talks and interactive speaker sessions to share with you. Together, we will explore how they navigated complex product challenges and created solutions that power decision making when working with one of the richest data sets in the world to make career defining products.

Start your summer evening hearing from the people behind the products, with 4 technical/strategy talk showcases and live audience Q&A hosted by Director of Data Idols, David Loughlan.

During the event, you'll have the opportunity to attend 1 of 5 bespoke speaker sessions designed to give you face to face, behind the scenes access to connect with Facebook Data Engineers & Data Scientists, who will help demystify their role in the Product eco-system, share how they navigated new problem space, and took the lead to direct their own career experience.

Please read the lightning talks & breakout speaker session abstracts below to learn more about what each talk will cover. You will be asked to select the breakout session that most interests you during the registration process.

This is an external event and will have a separate registration process.



Only then will you receive your joining links. This is so you can select your break-out room. Please note that this event will not be recorded so be sure to attend live.


5:00pm - 5:10pm Start of Event Part 1:

Welcome and Introduction Introduction and overview of the events by David Loughlan, Director, Data Idols

5:10pm - 6:00pm Product Lightning Talks:

Lighting talk 1: “How data scientist drives product decisions at FB” - Chloe Goh

Lighting talk 2: "Connected to connect: Data Engineering collaboration upon the launch of Facebook Shops" - Aneta Peryga

Lighting talk 3: "Measuring the Value of Ads Measurement for Advertisers" - Maisie Lynton

Lighting talk 4: “Overview of Roles within Analytics at Facebook” - Waad Aljaradt

6:00pm - 6:30pm -Live Audience Q&A Hosted By Data Science Festival

6:30pm End of Event Part 1

6:40pm - 7:10pm Start of Event Part 2:

Behind the Scenes Product Speaker Sessions

Six speakers, five Product speaker sessions, designed to give you face to face behind the scenes access to connect with Data Engineers & Data Scientists, who will help demystify their role at Facebook, share how they navigated new problem space, and took the lead to direct their own career experience across a variety of Product teams.

Breakout 1: "Data Engineering in Facebook Reality Labs" - Nasia Ntalla, Data Engineer, Facebook Reality Labs

Breakout 2: "Growing Your Career in Analytics" - Zineb Amrani, Data Science Manager, Facebook Ads, & Kate Vang, Data Science Manager, Central Integrity

Breakout 3: "IG Lite at Facebook Tel Aviv" - Dan-ya Shwartz, Director, Product Growth & Selena Treister, Product Growth Analyst, Lite interfaces

Breakout 4: "Navigating Unstructured and New Problem Space: The Role of Data Scientists" - Shadi Janansefat: Data Scientist, App & Messaging

Breakout 5: "Fighting Misinformation on WhatsApp" - Qiaohong Wang, Data Science Manager, WhatsApp

7:10pm - 7:25pm - Speaker Session Q&A

7:30pm - End of Event Part 2

Spark optimisation: building an efficient Lakehouse with Databricks.

Spark optimisation: building an efficient Lakehouse with Databricks.

Join DSF in June for our Women in Data Talks. Speakers from this sector share their stories, projects, joys, trials, and tribulations over the course of June 2021. Come and listen to these amazing companies and ask questions to learn more about this growing industry.

Ticket Allocation Process:
Registering here guarantees you a ticket for the Data Science Festival Virtual Event on June 24th, 2021. Please ensure to add this session to your schedule in order to receive the joining URL links.

Registration Link: https://womenindata.datasciencefestival.com/talks/women-in-data-sandbox-session-4-databricks/

Summary: Spark optimisation: building an efficient Lakehouse. Apache Spark is a unified analytics engine for large-scale data processing. A lakehouse is a new, open architecture that combines the best elements of data lakes and data warehouses. Lakehouses are enabled by a new open and standardized system design: implementing similar data structures and data management features to those in a data warehouse, directly on the kind of low cost storage used for data lakes. In this talk we’ll cover how to use Spark in the most efficient way: how writing an optimised Spark jobs can reduce run time and costs building a strong and future-proof foundation for the lakehouse. We’ll discuss the topics like partitioning of data, choosing the optimal spark configuration, and main pitfalls to avoid.

Speaker: Oleksandra Bovkun- Solutions Architect at Databricks

Bio: After obtaining her master’s degree in applied mathematics, Oleksandra started as a researcher in the R&D department of an energy company. Once she finished her research, she continued her career as software developer, database architect, and data engineer. Spark was always one of the core technologies she worked with including large scale Spark optimisation project and implementing data platform with Spark on Kubernetes. After joining Databricks she helps customers to implement Data and AI projects and enable them to run Spark application in a more efficient way.

Past events (55)

Photos (68)