add-memberalign-toparrow-leftarrow-rightbellblockcalendarcamerachatchevron-leftchevron-rightchevron-small-downchevron-upcircle-with-crosscomposecrossfacebookflagfolderglobegoogleimagesinstagramkeylocation-pinmedalmoremuplabelShape 3 + Rectangle 1pagepersonpluspollsImported LayersImported LayersImported LayersshieldstartwitterwinbackClosewinbackCompletewinbackDiscountyahoo

Beyond the Hadoop Hype

Zero to Facebook Analytics in 6 hours

A $100 worth of AWS credits for every attendee is included in the price.

This workshop is for Technologists looking to gain hand-on experience with big data technologies, especially

  • Hadoop
  • Amazon's Elastic Map Reduce (EMR)
  • Mining Facebook data to design a data-driven marketing campaign
  • Using Map Reduce to crawl the web to create data mashups

The attendees will get a hands-on practitioner's approach to the techniques and tools required for analyzing Big Data. The goal is to enable attendees to

  • Become an immediate contributor on data science/analytics teams
  • Assist reframing a marketing challenge as an analytics challenge
  • Gather data from the web to prepare it for analysis
  • Apply appropriate analytic techniques and tools to analyze big data
  • Tell a compelling story with the data to drive business action
  • Use open source tools such as Python, Hadoop, and Elastic Map Reduce

Here is an outline of the day:

  1. Hello Hadoop World. It's to get everyone oriented, set up on Amazon EC2 and running a wordcount exercise. If you have not used Amazon EC2 before, we cover that here. We also cover installing Hadoop and using it. 90-minutes. We thank Amazon for their generous sponsorship of this workshop with a grant of up to $100 worth of AWS credits for every attendee.
  2. Introduction to Elastic Map Reduce. Builds on what we learned in the first exercise. 60 minutes.
  3. Discussion if the data collected and how the exercise is structured. 60 minutes.
  4. Web crawling using Map Reduce to acquire additional data. 60 minutes.
  5. More Map Reduce to analyze the original data plus the crawled data. 60 minutes.
  6. Discussion and wrap-up: 30 minutes.

Please bring your laptop (we'll use Windows but Mac or Linux are OK also).

Also be prepared to sign up for Amazon AWS. It requires a credit card although, with the $100 credit, the chances of you going over $100 are remote. Getting an Amazon AWS account also requires a cell phone — Amazon needs to call you with a code to set up your AWS account.

The instructions will be Linux-based but quite detailed so people with a general programming background can follow along.

To attend Where and When The Offices of Foley Hoag
Seaport West
155 Seaport Blvd
Boston, MA 02210

June 11, 2012,
9:00 AM - 4:30 PM.

Foley Hoag is not affiliated with Early Stage IT or its DataThinks service. About the Instructor

J Singh is Founder of Early Stage IT and helped create their service focusing on Big Data Solutions. DataThinks was started after he taught a graduate-level Database Systems class at Worcester Polytechnic Institute. J holds a PhD. in Electrical Engineering.

J was one of the featured speakers at Boston Software Engineers' first meetup and his talk on the Hadoop Ecosystem was very well received. J has extensive background in Financial Services and CAD/CAM industries. He co-chairs the Software and Services SIG at TiE Boston.

Join or login to comment.

12 went

Your organizer's refund policy for Beyond the Hadoop Hype

Refunds offered if:

  • the Meetup is cancelled
  • you can cancel at least 1 day(s) before the Meetup

Additional notes: If a meetup is rescheduled we will refund the money if you can't make the new date.

Payments you make go to the organizer, not to Meetup. You must make refund requests to the organizer.

Our Sponsors

  • Microsoft NERD

    Research and Innovation Center in the heart of Cambridge.

  • General Assembly

    Event Space, Co-Working, & Classrooms

  • Foliage

    Complex software-intensive and electromechanical products and systems.

  • RBM Technologies

    Tear down the barriers between data to maximize efficiency.

  • Startup Institute

    Immersive software engineering programs.

  • TripAdvisor

    Travel Website using user generated content to make traveling better.

  • Litmus

    Email Testing and Analytics.

  • Climb

    We're Reinventing the business of IT Recruiting.

  • NuoDB

    Cloud Relational Database

  • Akiban Technologies

    Providing the Full Power of SQL

  • Couchbase

    Simple, fast, elastic NoSQL database technology.

  • Free Software Foundation

    Promoting the use of free software and documentation everywhere.

  • Microsoft

    Encouraging the development of multiple software technologies.

  • Kiva Systems

    Automation technology for distribution centers.

  • Hopper

    Machine Learning + Big Data Travel Discovery Application.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy