July 14, 2013 · 10:00 AM
Hadoop, MapReduce, etc. are not difficult but require thinking about a problem differently. In our very first meetup, I would like to focus on introducing this new way of thinking. We will work with a simple problem of counting token frequency in search datasets. We will first write a simple solution and then expand it into our first map-reduce program. Finally we will run our first map-reduce program using hadoop streaming.
In order to keep pace with the class, I highly recommend installation of hadoop locally on your machine. You can find instructions on how to setup hadoop on your machine over here: http://ragrawal.wordpress.com/2012/04/28/installing-hadoop-on-mac-osx-lion/
Also all the source code related to this class is available on github. Please download the code and data beforehand: https://github.com/ragrawal/meetup