Twitter: @NycDataSci (https://twitter.com/NycDataSci)
Check out NYC Data Science Academy (http://nycdatascience.com/)'s latest offering!
• 6 week Hadoop and spark workday night class (http://nycdatascience.com/courses/big-data-with-hadoop-and-spark/) starts on Aug 11th.
• 6 week full time Data Engineering bootcamp (http://nycdatascience.com/data-engineering-bootcamp/) starts on Aug 24th.
• 12 week full time Data Science Bootcamp (http://nycdatascience.com/data-science-bootcamp/) starts on Sept 21st.
Topic: Pierre will speak about the Medicare open data and its potential applications. In particular, we will have a look at outliers and fraud as well as the impact of companies payments on doctor prescriptions. The whole data wrangling and modellig will be done using sql and python.
Speaker Bio: Pierre Gutierrez is a senior Data Scientist at Dataiku (www.dataiku.com (http://www.dataiku.com/)). He has experience in several topics such as fraud detection, predictive maintenance, recommender systems, smart cities or healthcare.
Schedule: Door opens at 6:00pm. We serve pizza and beer from 6:00-6:30pm. Event starts on 6:30pm.
Pre requites : You should have a very basic understanding of SQL and Python or R.
Preparation : Please have a running postgresql db running and a R/Python environment. Ipython notebook or jupyter ( https://jupyter.org/ ) would be perfect. It would be better to have a basic understanding of the medicare data available here : http://www.cms.gov/OpenPayments/Explore-the-Data/Dataset-Downloads.html Looking at this video might also help : https://www.youtube.com/watch?v=mtlYQIDLdc8