BDAM 11/08: Big Data Security, Apache Pulsar and more!


Details
Shoutout to Streamlio for kindly sponsoring this meetup!
Streamlio will also be giving away an Amazon Echo Dot! Enter the raffle on the day of the event for a chance to win.
AGENDA
6:00 - 6:30 - Socialize over food and beverages
6:30 - 8:00 - Talks
TALKS
Talk #1: Foundations for Securing Big Data Applications, by Yaojie Feng from Cask
Talk #2: Multi-tenant and Geo-replication Messaging with Apache Pulsar, by Matteo Merli and Sijie Guo from Streamlio
ABSTRACTS
Talk #1: Foundations for Securing Big Data Applications, by Yaojie Feng from Cask
With the ever increasing amount of data being processed, big data security breaches can mean big problems, which is why effective security controls have become a hard requirement for managing data in particular in large environments. In this talk, Yaojie Feng will introduce the foundations of securing big data applications. He will discuss the importance of authentication, authorization, impersonation and other security controls for big data initiatives, and demonstrate how they complement the existing security model in Hadoop and Spark from data prep to production.
Talk #2: Multi-tenant and Geo-replication Messaging with Apache Pulsar, by Matteo Merli and Sijie Guo from Streamlio
In this session presented by Matteo Merli and Sijie Guo, learn how Yahoo developed a messaging system from the ground-up to support their enterprise requirements of multi-tenancy and geo-replication to support mission-critical services like Yahoo Mail, Finance, Sports, and Gemini ad network.
SPEAKER BIOS
• Yaojie Feng is a Software Engineer at Cask where he is building software to simplify data application development. He is also an open source contributor for Apache Twill. Yaojie has a Masters degree in Computer Science from University of Illinois at Urbana-Champaign.
• Matteo Merli is a software engineer at Streamlio, where he works on messaging and storage technologies. Previously, he spent several years building database replication systems and multi-tenant messaging platforms at Yahoo. Matteo was the architect and lead developer for Pulsar and is a PMC member of Apache BookKeeper.
• Sijie Guo is the cofounder of Streamlio, a company focused on building a next-generation real-time data stack. Previously, he was the tech lead for the messaging group at Twitter, where he co-created Apache DistributedLog, and worked on push notification infrastructure at Yahoo. He is the PMC chair of Apache BookKeeper.
ARRIVAL AND PARKING
Cask HQ is a few minutes walk from the California Avenue Caltrain Station.
Also, Cask HQ has its own parking lot, but it will certainly not accommodate all guests. Please use parking lots available nearby:
https://secure.meetupstatic.com/photos/event/5/b/2/f/600_438983343.jpeg

BDAM 11/08: Big Data Security, Apache Pulsar and more!