Programming Hadoop: MapReduce using Python and an Intro to Pig


Details
Hi everyone,
This is a continuation of the previous meetup, where we learned some basics on Hadoop, MapReduce and Hive. This new meetup will cover Programming Hadoop - programmatic data processing in Hadoop, including writing and running a simple MapReduce job in Python. Plus, and introduction to Pig which allows you to create MapReduce programs without having to learn MapReduce. The MapReduce/Python part will be presented by Matthew Rathbone and the Pig part will be presented by Ryan Bosshart, who gave the previous presentation. For continuity, we will be using the same data set as the previous meetup. The instructions around setting up the VM and downloading the dataset are in the previous meetup announcement.
Please don't forget to bring your laptop. I hope to see everyone there!
Thanks, Pitt Fagan

Sponsors
Programming Hadoop: MapReduce using Python and an Intro to Pig