A Detailed Look at R on Hadoop


Details
R is a powerful Open Source programming language and software environment that enables deep statistical analysis of data, and graphical representation of the results. Key features include data exploration, statistical analysis, modeling, machine learning, simulations, and visualization of the results.
But, R's full potential is limited by process memory utilization and its inability to take advantage of parallel platforms, such as HADOOP, diminishing its ability to analyze very large data sets. Using R on Hadoop breaks all this limits. These capabilities enable data scientists to focus on the statistical analysis of large data sets, without being concerned with the underlying infrastructure.
This session will provide an overview of R, comparison of Hadoop distributives with R-ability and live demo of the capabilities of R on Hadoop.

Sponsors
A Detailed Look at R on Hadoop