Skip to content

Druid NYC Meetup @ Datadog

Photo of Imply Developer Relations
Hosted By
Imply Developer R.
Druid NYC Meetup @ Datadog

Details

Join us for the inaugural Druid NYC meetup! Please bring government issued ID as this meetup is in the NY Times building.

Apache Druid (incubating) is a high performance real-time analytics database.

Druid is primarily used to store, query, and analyze large event streams. Examples of event streams include user generated data such as clickstreams, application generated data such as performance metrics, and machine generated data such as network flows and server metrics. Druid is optimized for sub-second queries to slice-and-dice, drill down, search, filter, and aggregate this data. Druid is commonly used to power interactive applications where performance, concurrency, and uptime are important.

To learn more about Druid, please visit: http://druid.io/

*** Presentations ***

Talk 1: Swimming in the data river, or, when “streaming analytics” isn’t

Abstract:
The dirty secret of most “streaming analytics” technologies is that they are just stream processors: they sit on a stream and continuously compute the results of a particular query. They’re good for alerting, keeping a dashboard up-to-date in real time, and streaming ETL, but they’re not good at powering apps that give you true insight into what is happening: for this you need the ability to explore, slice/dice, drill down, and search into the data.This talk will cover the current state of the streaming analytics world, what Druid brings to the table, and some of the technical details behind its design and its integration with Kafka.

Speaker: Gian Merlino
Gian is a cofounder and CTO of Imply, a San Francisco based technology company. Gian is also one of the main committers of Druid. Previously, Gian led the data ingestion team at Metamarkets and held senior engineering positions at Yahoo. He holds a BS in Computer Science from Caltech.

Talk 2: KSQL: the power of Kafka with the simplicity of SQL

Abstract:
In the past few years, Apache Kakfa has become a popular and widely used streaming platform. With KSQL, the simplicity of the SQL language was brought to this platform to allow anyone to explore the data in realtime. In this talk, we will see how KSQL not only offers the ability to run streaming queries with a familiar language, but also how it allows you to run streaming ETL jobs.

Speaker: Alexis Seigneurin
Alexis Seigneurin is a Data Engineer at Ippon USA. He focuses on Big Data and Streaming technologies and enjoys working with open source frameworks (Apache Kafka, Apache Spark...). When possible, he contributes his work back to the open source community (he is the author of the Scala API for Kafka Streams) and he loves sharing his work through blog posts. Alexis also enjoys studying Machine Learning techniques so as to bridge the gap between Data Science and Data Engineering.

Agenda:

6:30 - 7:00 pm: Check in and settle, networking
7:00 - 7:05 pm: Intros
7:05 - 7:30 pm - Talk #1
7:30 - 7:55 pm - Talk #2
7:55 - 8:00 pm - Questions and wrap up

If you are in the NYC area and interested in presenting at one of the meetups, please contact the organizer.

Space is limited so please RSVP!

Photo of Apache Druid® USA Northeast group
Apache Druid® USA Northeast
See more events
Datadog
New York Times Bldg, 620 8th Avenue, 45th Floor · New York, NY