Skip to content

Inside Apache Druid's Storage and Query Engine

Photo of Chester Chen
Hosted By
Chester C.
Inside Apache Druid's Storage and Query Engine

Details

Registration Link : https://www.aicamp.ai/event/eventdetails/W2021042112
We are leveraging our partner AICamp's Zoom meeting. So you have to register at above AICamp as well. Once registered an email will send to you with Zoom meeting Link. Sorry for the inconvenience.

Agenda
12:00 pm -- 12:05 pm PDT -- join zoom room
12:05 pm -- 12:50 pm PDT -- Talk + Q&A
1 pm -- closing

Apache Druid is an open-source columnar database known for high performance at scale; its largest deployments comprise thousands of servers. But no matter the scale, high performance starts with good fundamentals. This talk will dive into those fundamentals by exploring the inner workings of a single data server. We’ll cover how Apache Druid stores data, what kinds of compression it uses, how it indexes data, how the storage engine is linked with the query processing engine, and how the system handles resource management and multithreading. Together, all these pieces enable Apache Druid to process billions of records per second on a single data server.

Speaker : Gian Merlino ( Imply)

Gian Merlino is CTO and a co-founder of Imply, a San Francisco based technology company, and the Apache Druid PMC chair. Previously, Gian led the data ingestion team at Metamarkets (now a part of Snapchat) and held senior engineering positions at Yahoo. He holds a BS in Computer Science from Caltech.

Photo of SF Big Analytics group
SF Big Analytics
See more events
Online event
This event has passed