Apache Beam Meetup in Seattle
Details
This is the 1st Apache Beam meetup in Seattle! We will be presenting the use cases, really cool features that are coming to Beam and many other things that are happening in the world of data processing!
We welcome you at the Google (https://cloud.google.com) office in Seattle!
Agenda
18:00 - Registrations, speed networking, pizza and drinks.
18:30 - kick-off
18:40 - Making Beam Schemas Portable by Brian Hulette (Google)
19:10 - Apache Beam @ Brightcove - A case study
19:40 - ZetaSQL as a SQL dialect in BeamSQL by Rui Wang (Google)
20:10 - Networking
Talks
1st talk
Abstract:
Apache Beam’s Java SDK has had a Schema API for some time now. It enables Java users to use their own types in concise pipelines built with relational operators like Select, Filter, and Join. Sadly it has been a convenience offered only to Java users, until now.
In this talk, I will discuss our efforts to make Beam Schemas portable, and demonstrate some of the new capabilities it will enable in the Python SDK, including the ability to use Java’s SqlTransform in Python pipelines.
Speaker Bio:
Brian is a software engineer at Google and an active contributor to Apache Beam focusing on making schemas portable. Prior to joining Google, he worked on a wide array of projects, ranging from distributed software-defined radio systems to high-performance data visualization tools built with Apache Arrow in Javascript. He occasionally writes short things on twitter @BrianHulette and longer things on http://theneuralbit.com.
2nd talk
Abstract:
This will be a case study of how Brightcove unified our high-scale streaming & batch processing pipelines, using Beam and Cloud DataFlow, to reduce complexity & duplication, while increasing scale and performance.
Speakers: Rick Pike, Travis Hume, Iryna Varganovska from Brightcove
3rd talk
Abstract:
There are different SQL dialects adopted by different systems. It would be more user-friendly to allow users use what they are more familiar if BeamSQL can support more than one SQL dialect. Support ZetaSQL by BeamSQL is an attempt on this direction.
Speaker Bio:
Rui is an Apache Beam committer and he is interested in unified batch and streaming SQL.
Who should attend
Everyone interested in batch and stream data processing, who wants to learn about one of the newer and exciting Apache projects. We will try to have talks covering both use cases, and deep technical dives.
=========
Sponsors
Thanks to Google (https://cloud.google.com) for providing the space and sponsoring the meetup.
