The unsexy part of data science: data munging

Name: The unsexy part of data science: data munging
Start: 2013-11-14T18:30:00-06:00
End: 2013-11-14T21:30:00-06:00
Location: General Assembly LA

Hosted By

Szilard P. and Gordon

The unsexy part of data science: data munging

Details

Real world data is usually dirty, messy, full of errors. If you want to get reasonable results from your statistical modeling, you'll need to explore the data, clean it up, transform it, prepare it for modeling. Data scientists commonly spend 80% of their time doing this tedious task of data munging. But how to do it?

In this meetup we'll have 5 short 10-15 min talks on data munging. We'll then open for Q&A where we'll address further issues.

Talks:

Szilard Pafka: Intro and overview
Yasmin Lucero: Munging date-times in R: tools, tricks, gotchas
Daniel Gutierrez: Data Munging: the Good, the Bad and the Ugly
Neal Fultz: Tidy Data, Facts & Rules for R
Eric Klusman: Plyr for split-apply-combine

Timeline:

6:00pm food/drinks and networking
7:00pm talks starts promptly

Please arrive by 6:55pm the latest.

Please RSVP as places are limited.

Venue: General Assembly ( http://generalassemb.ly ) will kindly host this meetup. There is no parking provided. Q ( http://www.qconnects.com/ ) will kindly sponsor/provide the food and drinks.

Events in Santa Monica, CA