Bangalore Apache Druid - Sub-second Slice and Dice your Data!
Details
Have you heard the term "Data River" before? If not this is meetup should be a great introduction to Data Rivers. Apache Druid is the newest tool in your analytics toolkit. It is a distributed, column orientated OLAP database with native SQL support. It delivers sub-second ad-hoc queries against both streaming and batch (Hadoop) data. Come and learn about Data Rivers and how you can use Apache Druid to build your own. Benjamin Hopp, Imply solution architect, will be speaking about Druid and Imply. We will also have a presentation by our generous hosts at [27]7.ai, Anand Sinha a Sr. Architect will speak about how they are implementing Druid, today.
3:30 – 3:45 - Socialize over food and drinks
3:45 – 4:00 - Welcome, opening remarks and announcements
4:00 – 4:45 – Benjamin Hopp - Imply Solution Architect
4:45 - 5:30 - Anand Sinha - [27]7.ai Sr. Data Architect
5:30-6:00 - Networking
Talk 1: "Setting the stage for fast analytics with Druid"
Speaker/Bio: Benjamin Hopp has been involved in architecting big data and streaming data solutions for companies of all sizes. Currently, he is a Solutions Architect with Imply where he assists organizations to deploy and manage Apache Druid solutions. Previously, he worked as a Senior Systems Architect with Hortonworks specializing in streaming data use-cases using HDF and Apache NiFi.
Abstract:
Druid is an emerging standard in the data infrastructure world, designed for high-performance slice-and-dice analytics (“OLAP”-style) on large data sets. This talk is for you if you’re interested in learning more about pushing Druid’s analytical performance to the limit. Perhaps you’re already running Druid and are looking to speed up your deployment, or perhaps you aren’t familiar with Druid and are interested in learning the basics. Some of the tips in this talk are Druid-specific, but many of them will apply to any operational analytics technology stack.
The most important contributor to a fast analytical setup is getting the data model right. The talk will center around various choices you can make to prepare your data to get the best possible query performance.
Talk 2: "Agents Interaction metrics and multidimensional performance reports"
Speaker/Bio: Anand Sinha
Abstract: TBD

