add-memberalign-toparrow-leftarrow-rightbellblockcalendarcamerachatchevron-leftchevron-rightchevron-small-downchevron-upcircle-with-crosscomposecrossfacebookflagfolderglobegoogleimagesinstagramkeylocation-pinmedalmoremuplabelShape 3 + Rectangle 1pagepersonpluspollsImported LayersImported LayersImported LayersshieldstartwitterwinbackClosewinbackCompletewinbackDiscountyahoo

Rhadoop Data Hacking and Lightning Talks

We will be meeting in room 201 at the Meeting House in Columbia, MD.

Rhadoop Data Hacking

Ed Kohlwey, Booz Allen Hamilton

Rhadoop is an effective platform for doing exploratory data analysis over big data sets. The convenience of an interactive command-line interpreter and the overwhelming number of statistical and machine learning routines implemented in R libraries make a highly effective environment to perform elementary data science.

We'll discuss the basics of RHadoop: what it is, how to install it, and the API fundamentals. Next we'll discuss common use cases that you might want to use RHadoop for. Last, we'll run through an interactive example.

Ed Kohlwey is a developer/hacker/data scientist at Booz Allen Hamilton. He is generally interested by parallel computation and data analytics, and has worked in many problem domains including cyber, genomics, and finance. Ed is also one of the Meetup group coordinators.

Lightning Talks!

Rather than having a second presenter, we're going to try something new: lightning talks. This is a free-for-all: give a presentation on any topic that you think will interest the group. If its bad, we'll probably just boo you off the stage ;).

To be fair, we DO ask that you sign up here to give your talk. That way people will know if they have a shot of giving their talk, and people can also see the general talk topics that will be given and decide weather or not they want to stick around. Heres the general rules:

  1. Talks will be given priority on a first-register-first-serve basis.
  2. Talks should be limited to 10 minutes or less.
  3. Talks can be on any topic you think would be interesting to the group.
  4. If you have materials for your talk, e-mail them to Ed Kohlwey in Powerpoint format no later than 5pm EST on the August 20th at kohlwey_edmundatbahdotcom. Ed will consolidate them into a single, in-order deck to facilitate speed.
  5. Approximately 90 minutes are allocated for talks and we will do as many of them as possible. That means probably 9-10 talks.
  6. Q+A should be reserved for the general meandering that always follows the meeting.


5:30-6:00 pm - Networking and snacks

6:00-6:10 pm - Announcements and kickoff

6:10-6:45 pm - Rhadoop presentation

6:45-8:15 pm - Lightning talks

Join or login to comment.

  • Anupom G.

    Great talk!

    September 13, 2012

  • Denish

    We should schedule more events near Columbia,MD.

    September 4, 2012

    • A former member
      A former member

      Agreed. We need a DC Hadoop Group to match the Columbia/Baltimore Group.

      September 5, 2012

    • jinlian w.


      September 5, 2012

  • Clint G.

    Here are my slides:

    Not sure how clear they would be without the diatribe...

    August 24, 2012

    • Brian F.

      Thanks Ross. Couldn't agree more. I won't miss the next one.

      August 24, 2012

    • Clint G.

      Ross, I am really glad you enjoyed it. I a was a bit concerned that it wasn't technical, so I am happy it was valuable. Thanks for catching the misspell on Stonebraker, that's embarrassing.

      August 24, 2012

  • A former member
    A former member

    Great little gathering, definitely good exchange of data, very high signal-to-noise ratio. (Personal note on venue and travel: As a VA-dweller and DC-worker, however, Columbia is really too far after a workday; hope to see these much closer to core DC. )

    August 23, 2012

  • Jason B.

    I hope you plan on having more of the meetings in Columbia.

    August 23, 2012

  • Brandyn W.

    My slides on and

    August 22, 2012

    • A former member
      A former member


      August 22, 2012

  • Bruce T.

    Speaker 1 wasn't very dynamic and the discussion of 'R' was not meaningful. Due to time constraints, I had to leave early and I did not hear the next speaker.

    August 22, 2012

  • Vlad K.


    Are you going to post the slides?

    August 22, 2012

  • kathy m.

    Very informational, and people were very willing to answer questions.

    August 22, 2012

  • A former member
    A former member

    The talks were very diverse and i learned a few new things. I'm excited about checking out Hadoopy and

    August 22, 2012

  • A former member
    A former member

    Decent overview - went a little over on time - wish I could have stayed for the lightening talks.

    August 22, 2012

  • Matthew E.

    I'll be running late but I'll be there...

    August 20, 2012

Our Sponsors

  • Tetra Concepts

    Thank you to Tetra Concepts for sponsoring this meetup.

  • BAE Systems

    Thank you to BAE Systems for sponsoring this meetup.

  • Booz Allen

    Thank you to Booz Allen for sponsoring this meetup.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy