Apache Kafka® at scale / Real-Time Analytics With Apache Pinot® and Apache Kafka


Details
Hello streamers! Please join us for an IN-PERSON Apache Kafka® meetup on Tuesday, May 9th from 5:30pm hosted by our friends at Optum!
📍Venue:
Optum
11000 Optum Circle
Eden Prairie, MN 55344
*Use main entrance. Please bring government issued id.
Please note: Registrations will be closed on Monday May 8th, one day prior to the meetup.
***
🗓 Agenda:
- 5:30pm: Doors open
- 5:30pm - 6:00pm: Snack, drinks and networking
- 6:00pm - 6:45pm: What they don’t tell you about running Kafka at scale, Ryan Belgrave, Sr. Principal Engineer, Optum
- 6:45pm - 7:30pm: Real-Time Analytics With Apache Pinot and Apache Kafka, Tim Berglund, VP of Developer Relations, StarTree
- 7:30pm-8:00pm: Additional Q&A & Networking
***
💡 Speaker One:
Ryan Belgrave, Sr. Principal Engineer, Optum
Title of Talk:
What they don’t tell you about running Kafka at scale
Abstract:
We all have read the Apache & Confluent Kafka documentation on how to run Kafka clusters, how to manage HA and DR, how to configure topics, configure brokers, etc.… We have also read the blog posts about the major outages, issues, and learnings various companies have had with Kafka. This talk isn’t about the documentation and blog posts. Instead, it is about the things that are not documented, the gotchas, the rules that you can break and the rules you must always follow, the things that can save you from having an outage when a client misbehaves. Let’s talk about the things you really must know to run 500+ Kafka clusters with over 8PB worth of data in every major cloud provider and in your own datacenter.
Bio:
Ryan Belgrave is a Sr. Principal Engineer and has been working in the Distributed Data Platforms space at Optum since 2018. Before he joined Optum he worked at Target on their Public Cloud team building their cloud application platform for running [Target.com](http://target.com/). Ryan specializes in all things containers, Kubernetes and Cloud and has a Home Lab running various CNCF software. While Ryan has only been officially working in the industry since 2016, he has been learning and working with all the various Linux and Cloud technologies since 2006.
***
💡 Speaker Two:
Tim Berglund, VP of Developer Relations, StarTree
Title of Talk:
Real-Time Analytics With Apache Pinot and Apache Kafka
Abstract:
Apache Pinot is a real- time, distributed, analytical data store which is widely used in the industry today for internal as well as user facing analytical use cases. The columnar data format and variety of rich indexing strategies makes it a perfect fit for running highly concurrent queries on multi-dimensional data within milliseconds. It has out of the box support for Apache Kafka, HDFS, S3, Presto and so on and can seamlessly integrate with any big data stack.In this talk, we will go over the basics of Apache Pinot and understand what makes it so fast. We will look at a simple use case for ingesting Kafka data, how Pinot is optimized to ingest from Kafka in particular, and demonstrate how to query this data using a convenient SQL interface.
Bio:
Tim is a teacher, author, and technology leader with StarTree, where he serves as the VP of Developer Relations. He is a regular speaker at conferences and a presence on YouTube explaining complex technology topics in an accessible way. He tweets as @tlberglund, blogs every few years at http://timberglund.com, and lives in Littleton, CO, USA. He has three grown children and two grandchildren, with a third on the way.
https://startree.ai/
***
DISCLAIMER
BY ATTENDING THIS EVENT IN PERSON, you acknowledge that risk includes possible exposure to and illness from infectious diseases including COVID-19, and accept responsibility for this, if it occurs.
NOTE: We are unable to cater for any attendees under the age of 21.
***
If you would like to speak or host our next event please let us know! community@confluent.io
COVID-19 safety measures

Apache Kafka® at scale / Real-Time Analytics With Apache Pinot® and Apache Kafka