Zero to Facebook Analytics in 6 hours
A $100 worth of AWS credits for every attendee is included in the price.
This workshop is for Technologists looking to gain hand-on experience with big data technologies, especially
- Amazon's Elastic Map Reduce (EMR)
- Mining Facebook data to design a data-driven marketing campaign
- Using Map Reduce to crawl the web to create data mashups
The attendees will get a hands-on practitioner's approach to the techniques and tools required for analyzing Big Data. The goal is to enable attendees to
- Become an immediate contributor on data science/analytics teams
- Assist reframing a marketing challenge as an analytics challenge
- Gather data from the web to prepare it for analysis
- Apply appropriate analytic techniques and tools to analyze big data
- Tell a compelling story with the data to drive business action
- Use open source tools such as Python, Hadoop, and Elastic Map Reduce
Here is an outline of the day:
- Hello Hadoop World. It's to get everyone oriented, set up on Amazon EC2 and running a wordcount exercise. If you have not used Amazon EC2 before, we cover that here. We also cover installing Hadoop and using it. 90-minutes. We thank Amazon for their generous sponsorship of this workshop with a grant of up to $100 worth of AWS credits for every attendee.
- Introduction to Elastic Map Reduce. Builds on what we learned in the first exercise. 60 minutes.
- Discussion if the data collected and how the exercise is structured. 60 minutes.
- Web crawling using Map Reduce to acquire additional data. 60 minutes.
- More Map Reduce to analyze the original data plus the crawled data. 60 minutes.
- Discussion and wrap-up: 30 minutes.
Please bring your laptop (we'll use Windows but Mac or Linux are OK also).
Also be prepared to sign up for Amazon AWS. It requires a credit card although, with the $100 credit, the chances of you going over $100 are remote. Getting an Amazon AWS account also requires a cell phone — Amazon needs to call you with a code to set up your AWS account.
The instructions will be Linux-based but quite detailed so people with a general programming background can follow along.
To attend Where and When The Offices of Foley Hoag
155 Seaport Blvd
Boston, MA 02210
June 11, 2012,
9:00 AM - 4:30 PM.
Foley Hoag is not affiliated with Early Stage IT or its DataThinks service. About the Instructor
J Singh is Founder of Early Stage IT and helped create their DataThinks.org service focusing on Big Data Solutions. DataThinks was started after he taught a graduate-level Database Systems class at Worcester Polytechnic Institute. J holds a PhD. in Electrical Engineering.
J was one of the featured speakers at Boston Software Engineers' first meetup and his talk on the Hadoop Ecosystem was very well received. J has extensive background in Financial Services and CAD/CAM industries. He co-chairs the Software and Services SIG at TiE Boston.