This time we will have Sascha Dittmann with a talk about building a robust data pipeline with open-source software.
Thanks to bluehands GmbH for hosting us
Agenda
6:30 PM: Doors open
6:45 PM: Orga Intro
7:00 PM: Talk by Sascha Dittmann: Build a Robust Data Pipeline with Open Source Software
Data quality has become a much discussed topic in the fields of data engineering and data science. It’s become clear that data validation is crucial to ensuring the reliability of data products and insights produced by an organisation’s data pipelines.
But can you do it completely with open source software?
Apache Airflow and dbt (data build tool) are among the prominent open source tools in the data engineering ecosystem, and while dbt offers some data testing capabilities, another open source data tool, Great Expectations, enhances the pipeline with data validation and can add layers of robustness.
Join expert Sascha Dittmann to explore the "dAG stack" and learn how to combine the functions of these three open source tools to build, test, validate, document, and orchestrate an entire pipeline, end to end, from scratch.
7:45 PM: Networking
---
Hosted By
Martina Kraus, Organizer
Christian Liebel, Organizer
Tanja Ulianova, Organizer
Christian Janz, Organizer
Ayden Mohammadi, Organizer
Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-karlsruhe-presents-gdg-karlsruhe-april/.