Feature Exploration, Visualization and Predictive Modeling with R and AzureML

Name: Feature Exploration, Visualization and Predictive Modeling with R and AzureML
Start: 2014-08-04T18:00:00-07:00
End: 2014-08-04T19:30:00-07:00
Location: Hacker Dojo Large Event Room

Hosted By Data Science Community – Silicon Valley

public group

Feature Exploration, Visualization and Predictive Modeling with R and AzureML

Details

http://photos1.meetupstatic.com/photos/event/b/4/5/2/600_388006162.jpeg

Overview:

Feature engineering refers to the process of visualizing and exploring data to find and sometimes create useful features out of existing data. A big chunk of time should be spent in feature engineering. Once useful features are present, any off the shelf predictive modeling algorithm can provide decent performance in most cases.

In this talk, we will go through the steps involved in building a predictive model for a classification problem. Below is an overview of the talk:

Data Set:

We will be using Titanic data set for this tutorial. Details here: https://www.kaggle.com/c/titanic-gettingStarted

Tools:

We will be using R and Azure Machine Learning Studio for this tutorial.

Exploration and Visualization:

• Getting familiar: Sampling and eyeballing data

• Understanding class distribution: Pie charts in R.

• Understanding feature values and distribution: Box plots, histograms, density plots, box and whisker plots, violin plots, scatter plots in R

• Feature processing: Missing values, creating more features, reducing dimensionality

Building A Predictive Model:

We will build a predictive model using the random forest R package. We will look at things like training error, variable importance and various metrics for classifier evaluation.

Azure Machine Learning Studio Demo:

In the end, we will show how the whole work flow can be built in Azure Machine Learning studio.

I am trying to get someone from the product group in Microsoft that worked on Azure ML studio to come and do a demo. I will update everyone either way.

Events in Mountain View, CA

Data Science Community – Silicon Valley

See more events

Data Science Community – Silicon Valley

public group

Monday, August 4, 2014
6:00 PM to 7:30 PM PDT

Hacker Dojo Large Event Room

599 Fairchild Drive · Mountain View, CA

Data Science Community – Silicon Valley

public group

Feature Exploration, Visualization and Predictive Modeling with R and AzureML