Skip to content

A Detailed Look at R on Hadoop

Photo of Andrey Vykhodtsev
Hosted By
Andrey V. and Andrey O.
A Detailed Look at R on Hadoop

Details

R is a powerful Open Source programming language and software environment that enables deep statistical analysis of data, and graphical representation of the results. Key features include data exploration, statistical analysis, modeling, machine learning, simulations, and visualization of the results.

But, R's full potential is limited by process memory utilization and its inability to take advantage of parallel platforms, such as HADOOP, diminishing its ability to analyze very large data sets. Using R on Hadoop breaks all this limits. These capabilities enable data scientists to focus on the statistical analysis of large data sets, without being concerned with the underlying infrastructure.

This session will provide an overview of R, comparison of Hadoop distributives with R-ability and live demo of the capabilities of R on Hadoop.

Photo of Data, Cloud and AI in Moscow group
Data, Cloud and AI in Moscow
See more events