A guide to Python frameworks for Hadoop

Distributed computing frameworks like Hadoop have revolutionized our ability to process large amounts of data. Using these tools typically requires writing complex programs in lower-level languages like Java; however, data scientists and analysts prefer to spend time in higher-level languages, such as Python. In order to address this gap, multiple open-source Python frameworks have been built to enable simple, user-friendly access to Hadoop’s underlying systems. This talk will review the different available frameworks, including a comparison of performance, ease of use/installation, differences in implementation, and other features.

Bio
Uri Laserson is a data scientist at Cloudera. Previously, he received his PhD from MIT developing applications of high-throughput DNA sequencing to immunology. During that time, he co-founded Good Start Genetics, a next-generation diagnostics company focused on genetic carrier screening. In 2012 he was selected to Forbes's list of 30 under 30.

Join or login to comment.

  • Nitin k.

    Will there be a recording for people like me who cannot make it?

    June 12, 2013

    • Pete S.

      Yep, we'll have a recording up on g33ktalk.com. Will keep you posted.

      June 13, 2013

    • Salil N.

      Hi Peter!
      I am not able to find the above recording on your website. Can you post the link directly.

      October 5, 2013

  • venkatanathen

    It was good. Very good high level introduction to available frameworks

    June 14, 2013

  • A former member
    A former member

    Great talk! Thanx 4 the yummy 2 boots pizza! Always wanted to check-in at foursquare on foursquare :-)

    June 14, 2013

  • Volney S.

    It saved considerable trial and error and experimentation.

    June 13, 2013

  • Volney S.

    Uri gave a very good talk about using Python over map reduce both as native Python and then using Python frameworks designed for Map Reduce such as MRJob and others that he did not like as much. Uri also ended up talking a little about Cloudera's recent ML (Machine Learning) offering which is not a distribution of Mahout but rather a highly parallelized set of capabilities built for map reduce. I found it very informative. It said considerable trial and error.

    1 · June 13, 2013

  • Arron G.

    Who has the wifi password?

    June 13, 2013

  • Michel B.

    Can't make it tonight. Freeing up my spot!

    June 13, 2013

  • A former member
    A former member

    Pizza?

    June 13, 2013

    • Adam S.

      Plenty of pizza.

      June 13, 2013

  • A former member
    A former member

    Can't make it..stuck at school. Also hoping for a livestream..

    June 13, 2013

  • Debbie S.

    I am not able to attend unfortunately. Please give my spot away!

    June 13, 2013

  • A former member
    A former member

    Sorry I can't make it tonight.

    June 13, 2013

  • Brian M.

    Someone can have my spot, can't make it.

    June 13, 2013

  • Greg M.

    Neither of us will be able to come tonight. Revelytix is interested in hosting in September. Please let me know when you would like to agree on a discussion topic around data management, traceability and ease of use for Hadoop.

    Thanks

    June 13, 2013

  • Sudhakar

    Can't make it today. Freed up the space. Please post the link.

    June 13, 2013

  • Srini

    Can't go. Freeing up my spot now. Would still like to view the live stream, if there will be one.

    June 13, 2013

  • Duane L.

    Can't go. Freeing up my spot now. Would still like to view the live stream, if there will be one.

    June 13, 2013

  • Vishal G.

    Could you post your slide-deck in advance, I would like to see what's being covered.

    1 · June 13, 2013

  • Kaisar Nova K.

    Out of town.

    June 12, 2013

  • venkatanathen

    I hope, i can get the recordings...:)

    June 12, 2013

  • krishna m

    Add a commenthi

    June 12, 2013

  • Tzu-Yen W.

    Hi

    1 · June 7, 2013

  • ahsan f.

    Looking forward to it

    June 7, 2013

  • June 3, 2013

  • Vietnhi P.

    I hope they tape this :)

    June 2, 2013

  • A former member

    A former member changed the location to WFC Winter Garden

    June 2, 2013

  • Aijaz S.

    Hadoop for bio-medicine? What can it do?

    May 31, 2013

  • Jeren

    How to use Hadoop to distribute indexing work?

    May 30, 2013

  • Haobo L.

    What are the other major farmeworks with Hadoop?

    May 13, 2013

  • prabakar B.

    I believe in python tech stack...

    May 8, 2013

  • Priyank M.

    Ask attendees

    May 7, 2013

Our Sponsors

People in this
Meetup are also in:

Create a Meetup Group and meet new people

Get started Learn more
Allison

Meetup has allowed me to meet people I wouldn't have met naturally - they're totally different than me.

Allison, started Women's Adventure Travel

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy