Hadoop for Newbies

Details
Abstract: We have a very diverse membership, with Hadoop experience ranging from none to expert. In this meeting, we are going to address those who are new to Hadoop. First, Brad will provide a brief overview of big data and Hadoop, including the HDFS file system and the MapReduce programming model. Then, we will see how to solve the same simple stock data programming problem with a variety of Hadoop ecosystem tools, such as Java MapReduce, Hive, Pig, Impala, and Spark. After briefly showing Eclipse/Maven-based development and unit testing via MRUnit, he will deploy these examples on a live Hadoop cluster and also demonstrate a cluster administration, management, and monitoring tool. Join us for this “there are no dumb questions” session!
Speaker: Brad Rubin has been a professor in the Graduate Programs in Software department in the School of Engineering at the University of St. Thomas for the past 10 years. He is a founding faculty member of the Center of Excellence for Big Data, and teaches a course in Big Data Architecture, along with courses in Computer Security and Software Analysis & Design. He co-leads the Twin Cities Hadoop User Group. Previously, he spent most of his industry career at IBM in Rochester, MN. Brad has degrees in Computer and Electrical Engineering from the University of Illinois, Urbana and a doctorate in Computer Science from the University of Wisconsin, Madison.
Parking: Free, in the Anderson Ramp.
Food: Pizza and drinks, first come first serve, starting at 6:30PM provided by the University of St. Thomas, Graduate Programs in Software.
Map: http://bit.ly/RCtaTI

Hadoop for Newbies