Skip to content

Spark Structured Streaming : Introduction and Internals

Photo of Revin
Hosted By
Revin and Abhishek G.
Spark Structured Streaming : Introduction and Internals

Details

Structured Streaming is a new stream processing engine built on Spark SQL, which enables developers to express queries using powerful high-level APIs including DataFrames, Dataset and SQL.

In this meetup, we'll walk through the basics of Structured Streaming, its programming model and processing the data in Kafka with Structured Streaming. We'll use the techniques including Window operations, Watermarking and Triggers. The concepts will be illustrated using code examples.

Presenter: Revin Chalil. Big Data Development Engineer. Expedia.

Photo of Seattle Data Science and Data Engineering group
Seattle Data Science and Data Engineering
See more events
extraSlice - The Place for Tech
3600 136th Pl SE #300 · Bellevue, WA