Skip to content

Building effective near-real-time analytics with Spark Streaming and Kudu

Photo of Andrew R Kursar
Hosted By
Andrew R K.
Building effective near-real-time analytics with Spark Streaming and Kudu

Details

Spark Streaming allows developers to create streaming pipelines that harness the rich Spark API, and Kudu is a new storage layer that is able to capture the output of a stream in the same place that analytics users can then query that same data. Together these Hadoop ecosystem components allow for some exciting near-real-time analytics use cases. In this talk Cloudera will explore what is possible with this combination, how to efficiently develop these pipelines, and how to identify and avoid the pitfalls. This talk will be led by Jeremy Beard, Senior Solutions Architect at Cloudera.

Food & Refreshments Provided.

For Building Security purposes, please provide your Full Name.

Photo of New York Hadoop User group group
New York Hadoop User group
See more events
MediaMath
4 World Trade Center, 150 Greenwich Street, 45th Floor · New York, NY