Skip to content

Berlin Open Source Data Infrastructure Meetup - November 2023

Photo of Team Aiven
Hosted By
Team A.
Berlin Open Source Data Infrastructure Meetup - November 2023

Details

Are you interested in learning more about open-source data technologies? ✅
Do you want to network with local and international tech professionals in a fun, relaxed environment? ✅

Then join us on November 30, 2023, at Betahaus Berlin, for an evening full of inspiring conversations and exciting talks by Yingjun Wu at Risingwave Labs and Gunnar Morling at Decodable.

Agenda:

  • 18:00 - 18:30 Welcome: Networking & snacks
  • 18:30 - 18:35 Kickoff: Welcome Aiven
  • 18:35 - 19:00 On the Journey of Redefining Stream Processing: What We Learned from Building RisingWave? - Yingjun Wu, RisingWave
  • Abstract: RisingWave is an open-source streaming database designed from scratch for the cloud. It implemented a Snowflake-style storage-compute separation architecture to reduce performance cost, and provides users with a PostgreSQL-like experience for stream processing. Over the last three years, RisingWave has evolved from a one-person project to a rapidly-growing product deployed by nearly 100 enterprises and startups. But the journey of building RisingWave is full of challenges. In this talk, I'd like to share with you lessons we've gained from four dimensions: 1) the decoupled compute-storage architecture, 2) the balances between stream processing and OLAP, 3) the Rust ecosystem, and 4) the product positioning. I will dive deep into technical details and then share with you my views on the future of stream processing.
  • Speaker: Yingjun Wu is the founder of RisingWave Labs (https://www.risingwave.com/), a database company developing RisingWave, a distributed SQL database for stream processing. Before running the company, Yingjun was a software engineer at the Redshift team, Amazon Web Services, and a researcher at the Database group, IBM Almaden Research Center. Yingjun received his PhD degree from National University of Singapore, and was a visiting PhD at Carnegie Mellon University. He has been working in the field of stream processing and database systems for over a decade.
  • 19:00 - 19:30 From Postgres to OpenSearch in No Time - Gunnar Morling, Decodable
  • Abstract: You've been tasked with implementing a data streaming pipeline for propagating data changes from your operational Postgres database to a search index in OpenSearch. Data views in OS should be denormalized for fast querying, and of course there should be no noticeable impact on the production database.

In this session we'll discuss how to build this data pipeline using two popular open-source projects: Debezium for log-based change data capture (CDC) and Apache Flink for stream processing. Join us for this talk and learn about:

* Setting up change data streams with Debezium
* Efficiently building nested data structures from 1:n joins
* Deployment options: Kafka Connect vs. Flink CDC

  • Speaker: Gunnar Morling is a Software Engineer and open-source enthusiast by heart, currently working at Decodable on stream processing based on Apache Flink. In his prior role as a software engineer at Red Hat, he led the Debezium project, a distributed platform for change data capture. He is a Java Champion and has founded multiple open source projects such as JfrUnit, kcctl, and MapStruct. Gunnar is an avid blogger (morling.dev) and has spoken at various conferences like QCon, Java One, and Devoxx. He lives in Hamburg, Germany.
  • 19:30 ~ 21:00 Food & networking

* Please note that this is an alcohol-free event. Light bites will be provided.

* By attending this event, you agree to abide by our community code of conduct.

COVID-19 safety measures

Event will be indoors
The event host is instituting the above safety measures for this event. Meetup is not responsible for ensuring, and will not independently verify, that these precautions are followed.
Photo of Berlin Open Source Data Infrastructure Meetup group
Berlin Open Source Data Infrastructure Meetup
See more events