Skip to content

Building a real-time transformation engine on Apache Spark Streaming

Photo of Brock Noland
Hosted By
Brock N.
Building a real-time transformation engine on Apache Spark Streaming

Details

Abstract: In this talk Brock will will detail how he integrated the StreamSets data collector with Spark Streaming to build a GUI driven real-time transformation engine. Spark Streaming is a fantastic tool, but writing Spark jobs can be difficult. StreamSets Spark Streaming integration opens streaming to a whole new class of users. This talk will be both a technical discussion of Spark Streaming and of the StreamSets data collector including use cases, architecture, scaling, and more.

Speaker: Brock Noland is an Apache Member, co-founder of Apache Sentry, and PMC member on Apache, Flume, Crunch, and MRUnit.

Parking: There are two options to pay for parking in the adjacent Anderson ramp. You can either enter/exit with a credit card, or you can take a ticket and use the pay kiosk on the northeast corner of the ramp to get an exit ticket.

Food: Pizza and drinks, first come first serve, starting at 6:30PM. phData is sponsoring https://phdata.io/

Map: http://bit.ly/RCtaTI

Photo of Twin Cities Spark and Hadoop User Group group
Twin Cities Spark and Hadoop User Group
See more events