Join us for the inaugural Druid LA meetup! You can register on this meetup page or at https://www.eventbrite.com/e/ideas-druidio-la-data-science-meet-up-tickets-50101621298
Apache Druid (incubating) is a high performance analytics data store for event-driven data.
Druid is primarily used to store, query, and analyze large event streams. Examples of event streams include user generated data such as clickstreams, application generated data such as performance metrics, and machine generated data such as network flows and server metrics. Druid is optimized for sub-second queries to slice-and-dice, drill down, search, filter, and aggregate this data. Druid is commonly used to power interactive applications where performance, concurrency, and uptime are important.
To learn more about Druid, please visit: http://druid.io/
*** Notes ***
We will be co-hosting this meetup with the International Data Engineering and Science Association (https://www.ideassn.org/).
IDEAS builds Artificial Intelligence, Blockchain, Data Engineering and Data Science hub by providing robust resources and connecting real-world expertise together from business leaders, professionals, and promising students. The vision is to foster the AI, data engineering and data science ecosystems and broaden the adoption of their underlying technologies, thus accelerating the innovations data can bring to society. IDEAS empowers and nurtures community growth by offering online resources, conferences, latest industry trends and data related job opportunities.
*** Presentations ***
Talk 1: The rise of operational analytic data stores
Operational analytic data stores are a new emerging class of databases that merges ideas of logsearch systems (Elastic, Splunk, etc) and traditional analytic databases (Vertica, Teradata, etc). Popular open source projects in this class include Apache Druid (incubating), Clickhouse (from Yandex), Pinot (from LI), Palo (from Baidu), and more. We will discuss the motivation behind these databases, and discuss in the detail the history, architecture, and future of Druid.
Speaker: Fangjin Yang
Fangjin is a co-author of the open source Druid project and a co-founder of Imply, a San Francisco based technology company. Fangjin previously held senior engineering positions at Metamarkets (now a part of Snap) and Cisco. He holds a BASc in Electrical Engineering and a MASc in Computer Engineering from the University of Waterloo, Canada.
Talk 2: A peek into big data analytics systems at Snap
Snap’s programmatic advertising platform drives over $2 Million in revenue per day. Additionally, Snap’s ~188 million daily active users communicate through many billions of engaging interactions per day. Charles will give a high level overview of some of the ways Druid is leveraged inside of Snap to get insights on this massive data stream.Topics to be covered include getting data into Druid, managing a large-scale Druid deployment, and various ways of getting effective insights.
Speaker: Charles Allen
Charles Allen received his Ph. D. in Electrical Engineering from Purdue University in 2010. He has been developing various solutions in the big data space for companies such as Acxiom, Metamarkets, and Snap. He has been heavily involved in Druid.io since about 2014, and is currently working at Snap in Santa Monica, California.
6:30 - 7:00 pm: Check in and settle, networking
7:00 - 7:05 pm: Intros
7:05 - 7:30 pm - Talk #1
7:30 - 7:55 pm - Talk #2
7:55 - 8:00 pm - Wrap up
If you are in the LA area and interested in presenting at one of the meetups, please contact the organizer.
Space is limited so please RSVP!