This will be a 15 minute lightning talk covering a few ways to get started using R to analyze data stored in a Hadoop cluster. I'll cover solutions for running MapReduce jobs through R, then turn to a couple of more interesting packages that tie HBase and Spark into R.
A discussion of Data for Good - Ryan Elmore and Matt Pocernich
We have been discussing the concept of starting a group for statisticians and data people to help local non-profits or groups with limited resources productively use their data. Conceptually, this would be similar a Datakind Chapter. We would like to gauge interest and get suggestions on how such a group might function as well as possible topics.
If you are interested - but can't make this meetup, send us a note and we will keep you in the loop as things progress.