We'll get together to drill through interesting data sets. Work alone, in pairs, or in teams. Drop by anytime between 1 pm and 4 pm!
The goal: explore the data and do interesting analysis. Learn about the tools and techniques other people are using. This isn't a hackathon. No judging and prizes. Just fun.
You do need to bring a laptop unless you just want to be an adviser/spectator.
We'll have pizza, tea, and coffee in case people want to snack.
Challenge #1: Predicting tech startup success!
We'll use a dataset we've derived from CrunchBase to study how the characteristics of company founders relate to the success of their companies, include age, birthplace and education.
(For inspiration and some laughs, see this question on Quora: http://b.qr.ae/1htlvBB)
Challenge #2: Predicting airplane flight delays
We'll use a data set of all airplane flights in the U.S. in 2008 to predict flight delays.
The data sets are now available in the nycDataWranglers/Event2 folder at this link: