What we're about

New to Apache Kafka®? Start with these free resources: https://cnfl.io/learn-ak-mu

This is an open community - if you want to present, host or contribute in other ways follow this link (http://cnfl.io/get-involved-mu (https://www.confluent.io/community/get-involved/)) - first time speakers welcome!

This meetup is for your fellow event streaming enthusiasts!

The topics discussed at our events are all about event streaming, including Confluent Platform, Confluent Cloud, Apache Kafka®, Kafka Connect, streaming data pipelines, ksqlDB, Kafka Streams as well as stream processing, Security, Microservices and a lot more!!

Code of conduct: https://cnfl.io/code-of-conduct-welcome

Beyond this group, we also have the following resources to help you learn and develop your skills! See them here:

*The Meetup Hub*

Find recordings of previous meetups around the world and see upcoming dates for many more at the Meetup Hub

https://cnfl.io/meetup-hub-desc

*Ask The Community:*

-Forum;

This is a place for all the community to ask the tough questions, share knowledge and win badges :D http://cnfl.io/forum-desc

-Slack;

Join tens of thousands of community members in this community cross-collaboration tool, exchanging thousands of messages every month:

cnfl.io/slack

*Confluent Community Catalysts*

Nominate the next Community Catalysts (MVPs) and find out more here:

https://cnfl.io/nominate-desc

*Confluent Training and Certification discounts!*

Learn Apache Kafka® and become Confluent Certified (with 20% off your certification exam with the code MU2021CERT): https://cnfl.io/train-cert

--

Also here’s a gift: Get $200 worth of free Confluent Cloud usage every month for your first 3 months; (that could be $600 worth, without spending a single penny) (Ts & Cs apply) http://cnfl.io/mu-try-cloud

If you’re already a user, you can get an extra $60 on top with the code: CC60COMM

Head to http://cnfl.io/get-involved-mu if you have any questions, ideas, concerns or if you want to contribute in some way!

Apache Kafka®, Kafka, and associated open source project names are trademarks of the Apache Software Foundation. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided here or at any of our Meetups.

Upcoming events (1)

Document Stream Pipelines with Apache Kafka®

Online event

Hello Streamers!

Please find the details to join this fun and informative meetup below.

Find information about upcoming meetups and tons of content from past Kafka Meetups all over the world:
cnfl.io/meetup-hub
-----
Agenda (time below is GMT+2):
6:00pm-6:05pm: Online networking (optional)

6:05pm-6:50pm: Bayer Document Stream Pipelines, Astrid Rheinländer, Computational Scientist, Bayer Pharmaceuticals and Victor Künstler, Software Engineer, bakdata

6:50pm-7:00pm: Q&A

Joining our slack space is not instant, so ensure that you are in, in time for the event, follow the steps within this link before the day of the event if you can! cnfl.io/slack

In addition, be sure to check out all the great content here,
cnfl.io/streaming-audio-podcast-mu
-----
Speakers:
Astrid Rheinländer, Computational Scientist, Bayer Pharmaceuticals
Victor Künstler, Software Engineer, bakdata

Title:
Bayer Document Stream Pipelines

Abstract:
Bayer continuously analyzes millions of text-rich data (e.g., scientific literature, clinical trials, patents, news, etc.) to support insight generation along the R&D value chain. We selected Apache Kafka® as the primary layer to implement a variety of document streams flowing through several text processing, integration, and enrichment steps. In this talk, we highlight the strategic importance of this project, provide an end-to-end technical overview and demonstrate central components of our solution.
We also look at concrete challenges, which come with processing text-rich data at a very large scale and discuss respective solutions, such as large document processing, error handling, semantic data integration, and enrichments using NLP. We will demo how users create new document processing pipelines, and how we can easily keep track of the many Kafka pipelines running on Kubernetes.

Bios:
Astrid Rheinländer is a computational scientist at Bayer Pharmaceuticals, R&D Data Sciences working on Big Data Analytics for the Life Sciences. Previously, she was a member of the Stratosphere Research Group, which laid the foundations for the development of Apache Flink. She holds a Ph.D. in Computer Science from Humboldt-Universität zu Berlin.
Victor Künstler is a software engineer at bakdata, working on data streaming applications and Big Data solutions. He is interested in building scalable distributed systems and machine learning solutions using open source and cloud technologies.

-----
Online Meetup Etiquette:
•Please unmute yourself when you have a question.
•Please hold your questions until the end of the presentation or use the zoomchat!
•Please arrive on time as zoom meetings can become locked for many reasons (though if you get locked out a recording will be available, but you may have to wait a little while for it!)

----
If you would like to speak or host our next event please let us know! [masked]

Photos (15)

Find us also at