Data Club Meetup - Introduction to Streaming Distributed Processing using Storm


Details
Abstract
In the world of Big Data, analytics systems have benefitted greatly from the ability to scale horizontally. Systems like Hadoop have been widely used to perform distributed batch processing on massive data sets, but there is a growing need in the industry to do the same scale of processing except in a real-time streaming fashion. Apache Storm is one such framework that enables this kind of processing. In this session, Brandon will introduce the core concepts of streaming distributed processing using Storm, the architecture of a Storm cluster, and show you what it takes to build your first Storm topology.
About Storm
Apache Storm is an open-source distributed realtime computation system used in the industry by companies like Twitter, Spotify, Expedia and others. Storm makes it easy to reliably process unbounded streams of data, doing for
realtime processing what Hadoop did for batch processing.
About Brandon
http://photos3.meetupstatic.com/photos/event/6/d/b/5/600_438208085.jpeg
Brandon O’Brien is a Data Engineer working at Expedia who is leveraging Storm to build a real time travel market analytics platform called Expedia Insights. Contact: https://www.linkedin.com/in/brandonjobrien
Please bring your laptop, if you want to implement code.
Directions and Parking: http://www.ci.bellevue.wa.us/parking-directions.htm
Bellevue City Hall provides complimentary parking.

Data Club Meetup - Introduction to Streaming Distributed Processing using Storm