addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgoogleimageimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

Data Science by R programming(Beginner­ level, Five Sundays) 001

Update:

our Dec 1st(Sunday, thanksgiving weekend) will be held as planned.

If you can't make thanksgiving weekend class, you can choose from

--attend make-up session on Dec 7th(Sat)

--watch the video from home and take the make-up session in Feb, 2014.

----------------------------------------------------------------------------


Date: Nov 10th, Nov 17th, Nov 24th, Dec 1st, Dec 8th (Five Sundays)

Time: 12:00pm to 4pm

Instructors: 
Scott Kostyshak (Data Scientist @ Supstat Inc, 5th year Econ PhD at Princeton Univ.)

Vivian Zhang (CTO @Supstat Inc, Master degrees in Computer Science and Statistics)

Slides contributor:

Kai Xiao(Data Scientist @ SupStat Inc), Scott Kostyshak, Vivian Zhang

Special Thank:

We thank Ramnath Vaidyanathan(Advisory Data Scientist @ SupStat Inc, professor at McGill University), Joe Cheng(Software Engineer @ Rstudio), Josh Paulson(product manager @ Rstudio) for suggesting us a few Stunning R showcases.

Cost:

Individual: $110/class

For group(5 or more persons) and enterprise pricing, please email [masked]

Course Outline:

(Content may be adjusted based on the real teaching condition)

Basics 6 hours
Abstract: explain the basic operation of knowledge through this unit of study , students can learn the characteristics of R , resource acquisition mode , and mastery of basic programming
Case and Exercise: Using the R language completion of certain Euler Project (euler project)

* How to learn R
* How to get help
* R language resources and books
* RStudio
* Expansion Pack
* Workspace
* Custom Startup Items
* Batch Mode
* Data Objects
* Custom Functions
* Control statements
* Vectorized operations

Data for two hours

Abstract:  explain the various ways the R language read data , the participants through the basic WEB knowledge of web crawling , connect to the database via sql statement calling data from a variety of local read excel file data .
Case studies and exercises: crawl watercress data on the site , write a custom function .

* Web data capture
* API data source
* Connect to the database
* Local Documentation
* Other data sources
* Data Export

Data collation 3 hours

Abstract: how to manipulate the data use R for the all kinds of data conversion, especially for string operation processing .
Case studies and exercises : Find the QQ(the most used instant messager tool) group , then discuss research options with text features.

* Data sorting
* Merge Data
* Summary data
* Remodeling Data
* Take a subset of data
* String manipulation
* Date Actions

Data Visualization 3 hours

Abstract: cover two advanced drawing package , lattice and ggplot2, understand the various methods of visualization to explore.
Case and Exercise: Using graphics to right before the movie , text and other data to describe

* Histogram
* Point
* Column
* Line
* Pie
* Box Plot
* Scatter
* Matrix related
* Map

Elementary statistical methods 5 hours
Abstract: The primary explanation to use R for statistical analysis , regression analysis, students can master the basic statistical significance and role model.
Case and Exercise: Using regression to predict commodity prices ; simulated casino game winner.

* Descriptive Statistics
* Statistical Distributions
* Frequency and contingency tables
* Correlation
* T test
* Non-parametric statistics
* Linear Regression
* Regression Diagnostics
* Robust Regression
* Nonlinear regression
* Principal Component Analysis
* Logistic Regression
* Statistical Simulation

Preliminary data mining ( Selected Topics )

Abstract: explain the R language for data mining expansion pack and functions use , students can master the supervised learning and unsupervised learning two mining methods .
Case and Exercise: Use R to participate in Kaggle Data Mining Competition
* General Mining Process
* Rattle bag
* Hierarchical clustering
* K -means clustering
* Decision Trees
* BP neural network

Join or login to comment.

  • Shangxuan Vivian Z.

    Shiny meetup is tonight, check out the slide from
    http://nycdatascience.com/shiny_intro/index.html#1 Hope to see you around.

    March 24, 2014

  • Elise P.

    Hi would anyone be interested in having a Wednesday evening study session on week 2 material?

    November 18, 2013

    • Elise P.

      Hi Carlos, something came up for Thursday - can we do Saturday morning? I've finished all class exercises up until Week 2 Case problems (Euler) & Berkeley data set which i'm working on now. I think there will be more handouts tonight.

      November 20, 2013

    • Carlos F.

      Unfortunately, I have a different class from 9 - 4 on Saturdays.

      November 20, 2013

  • Shangxuan Vivian Z.

    piazza class page is announced, https://piazza.com/class/hnw42yyrw214ho?cid=4

    All the students and instructors should have received invitation to join the class.

    November 11, 2013

  • Krish S.

    I would love to take these sessions, considering I already missed the 1st session. It's just painful to travel to NYC from NJ.
    Is it possible to get recordings for all 5 sessions? $$?

    November 10, 2013

  • Aaron G.

    My preference would be to extend the class one week if you decide to skip Thanksgiving weekend rather than meet Saturday and Sunday.

    November 9, 2013

    • Aaron G.

      My first choice is to keep the schedule as originally planned. I agree with Carlos.

      November 9, 2013

    • Shangxuan Vivian Z.

      Aaron, we are going to keep the schedule as originally planned. See you tmr.

      November 9, 2013

  • Punj

    I'll be traveling the last weekend. Can I make it up another time?

    November 7, 2013

    • Shangxuan Vivian Z.

      We will record it. You can watch the class at home and take the rest 4 classes.

      November 7, 2013

  • David R.

    Do we receive some kind of certificate of completion?

    November 7, 2013

    • Shangxuan Vivian Z.

      I would be happy to make a very pretty one. much better than Coursera cert. :)

      November 7, 2013

    • Shangxuan Vivian Z.

      And this class is not one shot effort. I believe it is difficult to learning new things and applying to work. So we will have a few follow-up sessions to do more hands-on projects. We have planned San Francisco Restaurant rating project and citibike usage visualization project. examples can be found http://glimmer.rstudi...­

      November 7, 2013

  • Maria

    When do we pay?

    November 6, 2013

    • Shangxuan Vivian Z.

      You can RSVP $1 here through meetup page and paypal us at [masked](Sup­Stat Inc) the rest.

      November 6, 2013

    • Maria

      Thanks will do!!!

      November 6, 2013

  • John Van A.

    Looking forward to it!

    November 4, 2013

  • Yu Lin C.

    What are some preliminary knowledge required for this class?

    November 2, 2013

    • Shangxuan Vivian Z.

      If you have background in other languages, that will help. No programming background is required. We take interactive learning approach and make many hands-on projects for you to learn.

      November 2, 2013

20 went

Your organizer's refund policy for Data Science by R programming(Beginner level, Five Sundays) 001

Refunds are not offered for this Meetup.

Our Sponsors

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy