Skip to content

Data Club Meetup - Introduction to Streaming Distributed Processing using Storm

Photo of Fahad Shah
Hosted By
Fahad S. and Bhaumik C.
Data Club Meetup - Introduction to Streaming Distributed Processing using Storm

Details

Abstract
In the world of Big Data, analytics systems have benefitted greatly from the ability to scale horizontally. Systems like Hadoop have been widely used to perform distributed batch processing on massive data sets, but there is a growing need in the industry to do the same scale of processing except in a real-time streaming fashion. Apache Storm is one such framework that enables this kind of processing. In this session, Brandon will introduce the core concepts of streaming distributed processing using Storm, the architecture of a Storm cluster, and show you what it takes to build your first Storm topology.

About Storm
Apache Storm is an open-source distributed realtime computation system used in the industry by companies like Twitter, Spotify, Expedia and others. Storm makes it easy to reliably process unbounded streams of data, doing for
realtime processing what Hadoop did for batch processing.

About Brandon

http://photos3.meetupstatic.com/photos/event/6/d/b/5/600_438208085.jpeg

Brandon O’Brien is a Data Engineer working at Expedia who is leveraging Storm to build a real time travel market analytics platform called Expedia Insights. Contact: https://www.linkedin.com/in/brandonjobrien

Please bring your laptop, if you want to implement code.

Directions and Parking: http://www.ci.bellevue.wa.us/parking-directions.htm

Bellevue City Hall provides complimentary parking.

Photo of SIGKDD Seattle Chapter group
SIGKDD Seattle Chapter
See more events
Bellevue City Hall
450 110th Ave NE · Bellevue, WA