Making R play well with Hadoop - David Champagne & Antonio Piccolboni-Revolutio­n

This is a combined meetup with the LA R meetup group. Hadoop is rapidly being adopted as a major platform for storing and managing massive amounts of data, and for computing descriptive and query types of analytics on that data. However, it has a reputation for not being a suitable environment for high performance complex iterative algorithms such as logistic regression, generalized linear models, and decision trees.

This presentation will explain and demonstrate how to use R with Hadoop for high-performance analytics that scale.  We will present three R packages designed to work with Hadoop:

•  RHadoop

•  plyrmr

•  Revolution R Enterprise ScaleR

Each of these R packages provides the Data Scientist the ability to work with data stored in Hadoop and leverage the full power of the MapReduce framework for model building, model estimating, data transformation and visualization.

Revolution Analytics presenter Bio's: 

David Champagne is an innovative technology leader with over 20 years of experience in enterprise and web application development for business customers across a wide range of industries. As Chief Architect at Revolution Analytics he has led  the development teams and has overall product responsibilities.  Prior to joining Revolution Analytics, he was Principal Architect/Engineer for SPSS .

Antonio Piccolboni is a data scientist with both industrial and academic experience. His recent work includes the design and implementation of a big data analysis package in R, social network analysis for a top 20 global web site and web analytics for a major web ratings company. He is currently an independent consultant with clients including Dataspora and Revolution Analytics


Join or login to comment.

  • - Szilard Pafka -

    Video recording here: http://www.ustream.tv/recorded/39729279 Thanks Antonio and David for coming and for the excellent talk.

    2 · October 14, 2013

  • Gary K.

    I missed the event. Is there a presentation that can be shared or view online ?

    October 11, 2013

  • Andrew D.

    Well, if there is an online recording for later, then I will attend the Oxbridge Big Data in Healthcare thing over at Caltech. I'm sure that technically, this will be so much better than the Caltech event, but if I can watch the Hadoop lecture later...

    October 10, 2013

    • Andrew D.

      An e-mail solicitation was sent to my boss at City of Hope in Duarte, CA:

      October 11, 2013

    • Andrew D.

      LA Chapter is hosting a panel on Big Data in Biotechnology and Healthcare on October 10th (5 PM-7PM) at the Caltech Cahill Center (1216 California Blvd, Pasadena). http://www.oxbridgebi...­
      Feel free to email Zach Shao at [masked] if you have any questions.Sincerely,Jane­t Chung

      October 11, 2013

  • Farhad

    A little short, Demo would be nice

    October 10, 2013

    • Jeff W.

      I think they had a demo planned, but the projector wouldn't hook up with their computers. I remembering them offering to do a demo after the presentation.
      Though to your point, yes, seeing it in action would've been nice.

      October 11, 2013

  • Daniel G.

    Great presentation!

    1 · October 10, 2013

  • Jeff W.

    Great presentation. I think it's still in the development stage, but hey--- so am I. So maybe it will be more accessible to me as I gain more skills.

    October 10, 2013

  • Carlos M.

    Will parking be provided?

    October 10, 2013

  • Jeff W.

    This is a great topic and worth seeing more than once. Is anybody videotaping? If not, I'd like to Volunteer to do that.

    1 · October 7, 2013

    • Subash D.

      Thanks but i think we have that covered. :-)

      1 · October 10, 2013

    • Jeff W.

      Most excellent!

      October 10, 2013

  • A former member
    A former member

    Looking forward to this.

    I'm coming from the 91 and 605. Kangsan, or anybody else wants to carpool.

    October 10, 2013

  • Joseph W.

    will attend remotely if we are steaming

    September 16, 2013

  • A former member
    A former member

    How long are we expecting this event to last? And if we can't make the full thing, can somebody save the video online somewhere?

    October 9, 2013

    • Subash D.

      The whole event is usually about 2 hrs but the presentation is usually about an hr. Yes we should have a recording later available.

      October 10, 2013

  • Siobhain T.

    If everything goes according to plan we’ll be streaming.
    http://www.ustream.tv/channel/l-a-hadoop-user-group

    4 · October 9, 2013

    • David R.

      I changed my reservation to no - I'll definitely watch the stream. :)

      October 9, 2013

  • Michelle

    Interested in learning more about hadoop, R and analytics

    September 30, 2013

  • A former member
    A former member

    Is there anyone who can carpool with me? Then I will really appriciate you. Thanks

    September 16, 2013

People in this
Meetup are also in:

Sometimes the best Meetup Group is the one you start

Get started Learn more
Rafaël

We just grab a coffee and speak French. Some people have been coming every week for months... it creates a kind of warmth to the group.

Rafaël, started French Conversation Group

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy