Apache Beam meetup 9
Details
After over a year of video conferencing, staying at home and virtual conferences, it's time to get together again and have the first Beam meetup in real life in a long time.
The meetup will take place on Tuesday September 21nd at the Arabesque AI offices (Fifth Floor, Jamestown Wharf, 32 Jamestown Rd, London NW1 7BY).
=======
Agenda
18:15 - Welcome
18:40 - Kick-off
18:45 - 1st talk: Mark Kelly (Director of Security Solutions at Fastly)
19:15 - 2nd talk: Peter Marshall (Technology Evangelist at Imply)
19:45 - 3rd talk: Tatiana Al-Chueyr Martins (Principal Data Engineer at BBC)
20:15 - pizza, drinks and networking
=====
Talks
1st talk
For the first talk, we welcome Mark Kelly, director of security solutions engineering at Fastly (https://www.linkedin.com/in/marklkelly76/) who will talk about their journey with Apache Beam.
Abstract:
Attack detection at scale: processing real-time security events with Apache Beam and BigQuery.
In this talk, we’ll take you through the lessons learned and performance challenges overcome on our journey from PoC to a production-ready pipeline for ingesting 1M+ web application firewall events/sec.
2nd talk
For the second talk, we welcome Peter Marshall, Technology Evangelist at Imply, who will be teaching us about the synergies between Apache Beam and Apache Druid! (https://www.linkedin.com/in/petermarshallio/).
Abstract:
Analytics-in-motion is here. In this talk, we lay out a temperature-based way of categorising different analytics technologies. And we uncover how the Apache Druid database is able to deliver on instant data visibility, ad-hoc query, and high query concurrency when used alongside Apache Beam for the ultimate in hot analytics experience.
3rd talk
Tatiana Al-Chueyr Martins, Principal Data Engineer at BBC will be talking about their journey with Apache Beam and Google Cloud Dataflow and how they scaled machine learning to millions of users.
Abstract:
Apache Beam is a critical technology in delivering millions of personalised recommendations to the BBC audience daily. The journey to adopt the technology, however, wasn’t the smoothest. The objective of this talk is to save others time and money. This talk will discuss:
- Why Beam?
- First pipeline which allowed us to go from a machine learning prototype to production
- Issues faced with the first approach
- Solutions embraced to handle problems
- Current pipeline design and cost gains This talk will focus on using the Python SDK and the Dataflow runner.
==================
Who should attend
Everyone interested in Data Engineering, Data Science and Machine Learning, who wants to learn about one of the newer and exciting Apache projects focused on batch & stream processing of data. We try to cover both business value as well as digging deeper technically.
=========
Sponsors
Thanks to Arabesque AI (https://www.arabesque.com) for sponsoring the recordings, food & drinks on this event!
