Apache Druid Bay Area Meetup @ Unity Technologies



Please note that the date of this meetup has changed. Unfortunately because of scheduling conflicts we have to postpone the meetup until April.

*** Presentations ***

Talk 1: High cardinality aggregations using Spark and Druid

Unity's monetization business generates billions of in-game events in a multi-sided marketplace, which creates complexity, slowness, and overhead for reporting. To work around these issues, Unity deploys a Kafka, Spark, and Druid based ingestion and aggregation pipeline. In this talk, we'll be discussing how Unity has built a high-cardinality data cube that allows for various business, product, and engineering groups to evaluate and act on the same data source.

Speaker: Mehdi Asefi, Unity Technologies
Mehdi Asefi Completed his MS and PhD from the University of Waterloo, Canada in Electrical and Computer Engineering. He has been working on a range of problems in big data, machine learning and data science in startups, midsize and large size companies. Prior joining Unity Technologies, Mehdi was part of personalization team at Yahoo working on building machine learning pipelines for Yahoo home page. Currently, Mehdi is technical lead for ads monetization data team working on designing and building various pipelines which feed data science model training and business reports.


Talk 2: Druid Ecosystem at Yahoo

Flurry Analytics enables you to measure and analyze activity across your app portfolio to answer your hardest questions and optimize your app experience. On a typical day, there are over 100B events streaming into the system with over 1M companies using Flurry. In this session, we will talk about how we have developed an ecosystem with Druid at its core along with Kafka, Airflow, SuperSet and Hive. We will also discuss about ingestion data into Druid, querying your data, monitoring and tuning Druid to work best for you

Speaker: Niketh Sabbineni, Oath
Niketh is a Principal Engineer at Yahoo/Oath. In his current role, he works with Flurry and helps to build infrastructure and applications required to support analytics at Petabyte scale. Niketh was previously the CTO at Bookpad, which was acquired by Yahoo in 2014. He holds a BTech in Computer Science from IIT.


Talk 3: Druid Roadmap Discussion

Gian will talk about Druid news, including details about the latest roadmap and releases.

Speaker: Gian Merlino
Gian is an Apache Druid (incubating) PMC member and a co-founder of Imply. Previously, Gian led the data ingestion team at Metamarkets and held senior engineering positions at Yahoo. He holds a BS in Computer Science from Caltech.


*** Schedule ***

6:30 - 7:00 -- People shuffle in, get food and beverage and talk
7:00 - 7:05 -- "Hi, Welcome to Druid Meetup Group" talk and introduction
7:05 - 7:25 -- First speaker
7:25 - 7:30 -- Q/A
7:30 - 7:50 -- Second speaker
7:50 - 7:55 -- Q/A
7:55 - 8:15 -- Druid roadmap discussion
8:15 - 8:20 -- Q/A