Skip to content

Bay Area Druid Meetup @ Pinterest

Photo of Gian
Hosted By
Gian and Vadim O.
Bay Area Druid Meetup @ Pinterest

Details

*** Notes ***

Please register with your full name and email for NDA purposes. IDs will be checked at the front door.

------------------------------------------------------------------------------------------------------

*** Presentations ***

Talk 1:
Pinterest: Our Journey to Operationalizing Druid at Scale

Speaker(s): Filip Jaros + colleagues

Abstract:
We will present the unique challenges of migrating our ads metrics store backed by HBase into a new stack on top of Druid. First, we will discuss why we chose Druid, highlighting the features which were impossible to implement with the old solution and which unlocked opportunities for us to bring deeper analytical capabilities to our ads platform. Next, we will take you through the long journey it took to change our access patterns and productionize Druid in our ecosystem:
Development of our special ingestion process on top of Apache Spark
Namespacing of segments to allow ingestions from multiple pipelines
Metadata service to short-circuit querying non-existent data
The choice of Native vs SQL querying
Effort of tuning access patterns from HBase-friendly to Druid-friendly.

Bio:
Filip Jaros is a software engineer at Pinterest. Starting his career in VoIP technologies, he has worked in the ads space for the last five years focusing on developing scalable ETL pipelines, designing database schemas and data retrieval services. Outside of the office, he is passionate about learning foreign languages, computer game design philosophy and nutrition science.

------------------------------------------------------------------------------------------------------

Talk 2:
Rethinking Druid's user experience

Speaker: Vadim Ogievetsky

Abstract:
Apache Druid has always been a fast, powerful, and scalable, but it has never been "user friendly" from a UX perspective. This talk will examine how the Druid UX is being redesigned from the ground up. This will make Druid straightforward to get started with, load data into, and manage at scale.

Bio: Vadim Ogievetsky, co-founder of Imply, cares about two things: making huge datasets accessible to everyone, and highly complex distributed systems easier to manage. Previously Vadim led the Application team at Metamarkets (acquired by Snap). He holds an MS in Computer Science from Stanford and a BA in Mathematics and Computer Science from Oxford.

------------------------------------------------------------------------------------------------------

Talk 3:
Druid Roadmap Discussion

Speaker: Gian Merlino

Abstract:
We will talk about Druid news, including details about the latest roadmap and releases.

Bio: Gian is a co-founder of Imply, a San Francisco based technology company, and a committer on Apache Druid. Previously, Gian led the data ingestion team at Metamarkets (now a part of Snapchat) and held senior engineering positions at Yahoo. He holds a BS in Computer Science from Caltech.

------------------------------------------------------------------------------------------------------

*** Schedule ***

6:00 - 6:25 -- People shuffle in, get food and beverage and talk
6:25 - 6:30 -- "Hi, Welcome to Druid Meetup Group" talk and introduction
6:30 - 7:30 -- Talk 1 + Q&A
7:30 - 8:00 -- Talk 2 + Q&A
8:00 - 8:15 -- Druid roadmap discussion
8:15 - 9:00 -- Networking, exit.

Photo of Apache Druid® San Francisco group
Apache Druid® San Francisco
See more events