Introduction to Distributed R, an open-source high-performance platform for R

Name: Introduction to Distributed R, an open-source high-performance platform for R
Start: 2015-04-10T18:45:00-07:00
End: 2015-04-10T20:15:00-07:00
Location: Hacker Dojo Large Event Room

Hosted By Silicon Valley AI

public group

Introduction to Distributed R, an open-source high-performance platform for R

Details

Data scientists struggle to find the right tools when it comes to processing Big Data. Wouldn’t it be nice if one could continue to use the familiar laptop based tools, such as R, to analyze Big Data?

In this talk, Jorge will introduce Distributed R, an extension to R for Big Data processing. Distributed R enables large scale machine learning, statistical analysis, and graph processing by splitting tasks across multiple cores and machines in a cluster. As a result, Distributed R is much faster than regular R and can handle much larger workloads. Data scientists can continue using their familiar R environment, benefit from a number of out-of-the box parallel algorithms, and even write their custom parallel applications.

Jorge will use the Kaggle March Madness dataset as an example to show how Distributed R is used to solve real life machine learning problems.

Bio: Jorge Martinez is part of HP Vertica engineering team and works on the HP Distributed R product. His interests are distributed systems and machine learning

Events in Mountain View, CA

Silicon Valley AI

See more events

Silicon Valley AI

Friday, April 10, 2015
6:45 PM to 8:15 PM PDT

Hacker Dojo Large Event Room

599 Fairchild Drive · Mountain View, CA

Silicon Valley AI

public group

Introduction to Distributed R, an open-source high-performance platform for R