Skip to content

AN EXAMPLE OF A MAP/REDUCE ALGORITHM USING R AND HADOOP

Photo of Yodit Stanton
Hosted By
Yodit S.
AN EXAMPLE OF A MAP/REDUCE ALGORITHM USING R AND HADOOP

Details

In this session, Anette Bergo talk will give us a tutorial on conducting statistical analysis on large data sets.

R has been described as 'a DSL for statistical analysis'. Hadoop is for LARGE scale computing. Between them, they can take on a number of interesting problems - once you get them to play together. Which is actually both easier and more accessible than you might think. In this demo I will solve a simple map/reduce problem in R, and run it on an Amazon EMR cluster.

Anette is a consultant for ThoughtWorks where she builds people, teams, projects and occasionally a bit of code. She has worked in a number of different countries, industries and development stacks to solve all sorts of problems, but lately it has been R and EMR and big piles of data that has been taking up her time.

This is a hands on session so please bring a laptop with R stats analysis package installed.

Don't forget to sign-up on the Skills Matter site after you RSVP on meetup http://skillsmatter.com/podcast/home/an-example-of-a-mapreduce-algorithm-using-r-and-hadoop

Photo of Women Crunching Data group
Women Crunching Data
See more events
Skills Matter
The Skills Matter eXchange, 116-120 Goswell Road · London EC1V7DP