Skip to content

Details

This is the 1st Apache Beam meetup in Seattle! We will be presenting the use cases, really cool features that are coming to Beam and many other things that are happening in the world of data processing!

We welcome you at the Google (https://cloud.google.com) office in Seattle!

Agenda

18:00 - Registrations, speed networking, pizza and drinks.

18:30 - kick-off

18:40 - Making Beam Schemas Portable by Brian Hulette (Google)

19:10 - Apache Beam @ Brightcove - A case study

19:40 - ZetaSQL as a SQL dialect in BeamSQL by Rui Wang (Google)

20:10 - Networking

Talks

1st talk

Abstract:
Apache Beam’s Java SDK has had a Schema API for some time now. It enables Java users to use their own types in concise pipelines built with relational operators like Select, Filter, and Join. Sadly it has been a convenience offered only to Java users, until now.
In this talk, I will discuss our efforts to make Beam Schemas portable, and demonstrate some of the new capabilities it will enable in the Python SDK, including the ability to use Java’s SqlTransform in Python pipelines.

Speaker Bio:
Brian is a software engineer at Google and an active contributor to Apache Beam focusing on making schemas portable. Prior to joining Google, he worked on a wide array of projects, ranging from distributed software-defined radio systems to high-performance data visualization tools built with Apache Arrow in Javascript. He occasionally writes short things on twitter @BrianHulette and longer things on http://theneuralbit.com.

2nd talk

Abstract:
This will be a case study of how Brightcove unified our high-scale streaming & batch processing pipelines, using Beam and Cloud DataFlow, to reduce complexity & duplication, while increasing scale and performance.

Speakers: Rick Pike, Travis Hume, Iryna Varganovska from Brightcove

3rd talk

Abstract:
There are different SQL dialects adopted by different systems. It would be more user-friendly to allow users use what they are more familiar if BeamSQL can support more than one SQL dialect. Support ZetaSQL by BeamSQL is an attempt on this direction.

Speaker Bio:
Rui is an Apache Beam committer and he is interested in unified batch and streaming SQL.

Who should attend

Everyone interested in batch and stream data processing, who wants to learn about one of the newer and exciting Apache projects. We will try to have talks covering both use cases, and deep technical dives.

=========
Sponsors

Thanks to Google (https://cloud.google.com) for providing the space and sponsoring the meetup.

Members are also interested in