Seattle Apache Kafka Meetup


Details
Greetings!
We have another session of the Seattle Apache Kafka Meetup coming up. Please RSVP, and also forward to anyone who may be interested.
Date: Apr 18, 2019
Location: Microsoft City Center Plaza, conference room 2110 (Spruce).
5:30 pm: Doors Open
- 6:00 to 6:30 pm
Speaker: Noor Abani, Negin Raoof
Noor Abani is a Software Engineer at Microsoft. She works on Siphon, a near real-time distributed data bus based on Apache Kafka on Azure. Prior to joining Microsoft, Noor graduated with a PhD degree in Computer Science from UCLA in 2018.
Negin Raoof is a Software Engineer in the AI Training Service team at Microsoft. She works on Siphon, a near real-time distributed data bus based, where she has gained extensive experience running Kafka in production.
Title: " Processing trillions of events per day with Apache Kafka on Azure "
Abstract: In this talk, we share our experience and learnings from running one of world’s largest Kafka deployments. Besides underlying infrastructure considerations, we discuss several tunable Kafka broker and client configurations that affect message throughput, latency and durability. After running hundreds of experiments, we have standardized the Kafka configurations required to achieve maximum utilization for various production use cases. We demonstrate how to tune a Kafka cluster to get the best possible performance for anyone planning to run a production Kafka cluster.
- 6:30 to 7:00 pm
Speaker: Kai Waehner, Confluent
Kai Waehner works as Technology Evangelist at Confluent. Kai’s main area of expertise lies within the fields of Big Data Analytics, Machine Learning / Deep Learning, Cloud / Hybrid Architectures, Messaging, Integration, Microservices, Stream Processing, Internet of Things and Blockchain. He is regular speaker at international conferences such as JavaOne, O’Reilly Software Architecture or ApacheCon, writes articles for professional journals, and shares his experiences with new technologies on his blog (www.kai-waehner.de/blog).
Title: " How to Leverage the Apache Kafka Ecosystem to Productionize Machine Learning"
Abstract: This talk shows how to productionize Machine Learning models in mission-critical and scalable real time applications by leveraging Apache Kafka as event streaming platform. The talk discusses the relation between Machine Learning frameworks such as TensorFlow, DeepLearning4J or H2O and the Apache Kafka ecosystem. The talk also discusses hybrid Kafka architectures combining model training in the public cloud and deployment of the model at the edge..
- 7:00 to 7:30 pm
Speaker: Justin Long, Skymind
Justin Long is a Deep Learning Engineer at Skymind and was also the founder of the first AI for dating, Bernie. Long’s career has taken him through ad-tech, dating, telecoms, and even energy.
Title: " Kafka for backpressure and aggregation for complex analysis and anomaly detection"
Abstract: The amount of log data generated a single company can easily exceed 1 TB per day. Intelligent analysis for fraud and intrusion detection is difficult when bad actors develop increasingly sophisticated behavior to avoid detection. In this talk Justin Long from Skymind will show the basics of using Kafka for backpressure and aggregation for complex analysis and anomaly detection.
- 7:30 to 8:00 pm
Speaker: Paul Davidson, Salesforce
Paul Davidson is a Principal Engineer at Salesforce, working on a team responsible for operating Kafka reliably at scale.
Title: Mirus: reliable, high performance replication for Apache Kafka
Abstract: At Salesforce we manage high-volume Apache Kafka clusters in a growing number of data centers around the globe. Until recently we relied on Apache Kafka's Mirror Maker for cross-data center replication but, as the volume and variety of data increased, we needed a new solution to maintain a high standard of service reliability. In this talk we will describe Mirus, our open-source data replication tool based on Kafka Connect.

Sponsors
Seattle Apache Kafka Meetup