What we're about
Upcoming events (1)
Please find the details to join this fun and informative meetup below.
Find information about upcoming meetups and tons of content from past Kafka Meetups all over the world:
Agenda (time below is GMT+2):
6:00pm-6:05pm: Online networking (optional)
6:05pm-6:50pm: Bayer Document Stream Pipelines, Astrid Rheinländer, Computational Scientist, Bayer Pharmaceuticals and Victor Künstler, Software Engineer, bakdata
Joining our slack space is not instant, so ensure that you are in, in time for the event, follow the steps within this link before the day of the event if you can! cnfl.io/slack
In addition, be sure to check out all the great content here,
Astrid Rheinländer, Computational Scientist, Bayer Pharmaceuticals
Victor Künstler, Software Engineer, bakdata
Bayer Document Stream Pipelines
Bayer continuously analyzes millions of text-rich data (e.g., scientific literature, clinical trials, patents, news, etc.) to support insight generation along the R&D value chain. We selected Apache Kafka® as the primary layer to implement a variety of document streams flowing through several text processing, integration, and enrichment steps. In this talk, we highlight the strategic importance of this project, provide an end-to-end technical overview and demonstrate central components of our solution.
We also look at concrete challenges, which come with processing text-rich data at a very large scale and discuss respective solutions, such as large document processing, error handling, semantic data integration, and enrichments using NLP. We will demo how users create new document processing pipelines, and how we can easily keep track of the many Kafka pipelines running on Kubernetes.
Astrid Rheinländer is a computational scientist at Bayer Pharmaceuticals, R&D Data Sciences working on Big Data Analytics for the Life Sciences. Previously, she was a member of the Stratosphere Research Group, which laid the foundations for the development of Apache Flink. She holds a Ph.D. in Computer Science from Humboldt-Universität zu Berlin.
Victor Künstler is a software engineer at bakdata, working on data streaming applications and Big Data solutions. He is interested in building scalable distributed systems and machine learning solutions using open source and cloud technologies.
Online Meetup Etiquette:
•Please unmute yourself when you have a question.
•Please hold your questions until the end of the presentation or use the zoomchat!
•Please arrive on time as zoom meetings can become locked for many reasons (though if you get locked out a recording will be available, but you may have to wait a little while for it!)
If you would like to speak or host our next event please let us know! [masked]