This Meetup is past

59 people went

Location visible to members


Dear members:

Welcome to our first meet up in 2017. We are glad to have Balaram Panda to talk about R + Spark.

This talk may contain some commercial information, but won’t be majority or focus of the presentation.

Unfortunately, no longer support file upload and here is the detail of the talk:

Topic: Machine Learning @ Scale


Real Information Intelligence always like to mine large data set and build model from it for better insights and prediction. However, there are several challenges in the technology learning curve caused by the technology limitations.

In this presentation I’ll try to simplify these challenges. The presentation aims to provide the knowledge about an eco-system, namely R + Spark, with a set of tested technologies and components that allows a data scientist to build machine learning models and perform analysis on large scale data set in an efficient manner without putting too much technical learning effort.

I am going to address the following:

· What are the limitations on performing machine learning on large data sets

· What are the solutions

· End-to-end scalable data analysis using R + Spark with large data sets (using R Code)

· Predictive modelling built on large data sets with R + Spark (using R code)

Hope to see you there at the meet up.

Organizers’ Committee