addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupshelp-with-circleimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1ShapeoutlookpersonJoin Group on CardStartprice-ribbonShapeShapeShapeShapeImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruserwarningyahoo

Hadoop Workshop I: configure your first Hadoop cluster on Amazon EC2

  • Apr 7, 2014 · 7:00 PM
  • Conductor Inc

$5/event is collected for your dinner. 

$5/month is collected to offer you premium learning experience in our workshops through video recording and screen sharing services, such as join.me.

Speaker:  Vivian Zhang, CTO and co-founder of SupStat Inc, organizer of NYC Open Data Meetup, Founder of NYC Data Science Academy( http://nycdatascience.com/ ) She teaches R and Hadoop. 

Her data school hires the best working professionals to teach Python, D3.js and related Data Science skills. All the courses are designed to teach you employable skills. We teach the skills and toolkits in the class and assist you to do projects of students' own choice. Students will show case their projects in this meetup group at the end of their courses.

Outline:

In Hadoop workshop I and II, I will walk you through the steps to configure a Hadoop cluster on Amazon EC2 and run two simple map-reduce jobs on the cluster.

Preparation:

1. Sign up for Amazon AWS acct from http://aws.amazon.com/account/  

2. get familiar with basic vi commands(if you don't know it, I can show you quickly. You are welcome to read more before coming.)

3. You don't need to know Java at this moment.  If you know Java, you can program in Hadoop quickly in later workshops.

Join or login to comment.

  • Shangxuan Vivian Z.

    I added the putty part into my tutorial ----generate your server rsa key for three instances Note: for windows user, you can refer to Connect to Your Amazon EC2 Instance from Windows Using PuTTY

    April 9, 2014

    • Esther M.

      thanks Vivian I checked that out!

      April 9, 2014

  • Devora R.

    Can I apply the fifteen dollars (three reservations) to another event? We had a production issue we needed to troubleshoot. Thanks.

    April 8, 2014

  • Mandy

    Yes thanks Vivian for hosting again!

    Here is a link with info that will help Windows users connect to EC2 instances using Putty for ssh.

    http://docs.aws.amazon.com/gettingstarted/latest/wah-linux/getting-started-deploy-app-connect.html

    April 9, 2014

  • Esther M.

    Vivian, many thanks for hosting that workshop on Tuesday. I will send an email to Lenin, who had some pretty detailed instructions on generating the server keys for the instances, and post it on here if he replies. Thanks again!

    April 9, 2014

  • David R.

    Hi Vivian,
    I got very ill yesterday and was disappointed I couldn't make the workshop. I hope you will repeat this one :)

    April 8, 2014

    • Shangxuan Vivian Z.

      David, you are signing up for the full course already. Hadoop I and II is just half of week 1 class content.

      April 8, 2014

  • Shangxuan Vivian Z.

    Guys, you can't miss a great event just because you don't have laptop. Work on the same laptop with someone who sit next to you. It is a easy way out!

    April 7, 2014

    • A former member
      A former member

      I'll right, I thought bringing computer was a must. Otherwise sure I'll come!

      April 7, 2014

    • Christopher J.

      Hey, I'd like to apply my $5 to another event. I'm too far away at this point and with no umbrella!

      April 7, 2014

  • Cheuk Kit L.

    What kind of AMI do we need? Can I just use Ubuntu on free tier Linux Micro instance?

    April 7, 2014

    • Cheuk Kit L.

      Thanks ... Just want to let you know ..There is no free tier for this instance. The min one is on demand instance m3.medium which is 7 cents /hour.

      April 7, 2014

    • Shangxuan Vivian Z.

      I can't run mapreduce on freetier, that is why I go with this configuration.

      April 7, 2014

  • Shangxuan Vivian Z.

    Steps to apply for a AWS acct from http://aws.amazon.com/account/:
    put name, email;
    put full name, email, billing info, credit bard;
    pass the phone number authorization;
    select AWS support plan, use Basic (Free) ;
    You should see message "Thank you for updating your Amazon Web Services Account!”

    April 7, 2014

    • Shangxuan Vivian Z.

      meetup posting will add extra "/" in the link I shared, please delete it when you go the website.

      April 7, 2014

  • Shangxuan Vivian Z.

    Please get your acct ASAP. It might take Amazon up to 2 hours to validate your billing information.

    April 7, 2014

  • A former member
    A former member

    I have two questions. About the price: I will only be charged $5.00 for today's event, right? And when I try to sign up for Amazon AWS acct I am required for payment information. Is it required?

    April 7, 2014

    • Arpit

      Payment info for AWS is required to enable access. But for the most part I'm assuming we will run within the free allowance so hopefully shouldn't get charged.

      April 7, 2014

    • Shangxuan Vivian Z.

      We are not using free tier, because we can't run map-reduce jobs on this configuration. The instances I will use only cost $2 cents per hour.

      April 7, 2014

  • Shangxuan Vivian Z.

    This class has online offering, weekday and weekend offering. Please the details from http://nycdatascience.com/course/hadoop-data-analytic-platform/

    April 7, 2014

  • A former member
    A former member

    I assume we bring our laptops, right? Any software requirement?

    April 7, 2014

  • Shangxuan Vivian Z.

    sample slide for week 1 Hadoop Intermediate level:
    http://www.slideshare.net/ShangxuanZhang/hadoop-dev-01 . If you are interested, you can sign up for the class at http://www.meetup.com/NYC-Data-Science-Academy/events/172992002/

    March 29, 2014

Your organizer's refund policy for Hadoop Workshop I: configure your first Hadoop cluster on Amazon EC2

Refunds offered if:

  • the Meetup is cancelled
  • the Meetup is rescheduled
  • you can cancel at least 3 day(s) before the Meetup

Payments you make go to the organizer, not to Meetup. You must make refund requests to the organizer.

Regardless of the refund policy set by the organizer, Meetup may issue refunds on an organizer's behalf if we determine that Meetup's Payment Policies have been violated.

Our Sponsors

  • NYC Data Science Academy

    use"nycopen100ff" coupon to take classes on www.nycdatascience.com

  • Supstat

    Supstat shares its expertise in data mining and visualization.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy