Saturday Tutorial: Data Analysis with Sarah and Python


Details
Sarah is presenting this tutorial at OSCON, and is first presenting it for you! http://www.oscon.com/oscon2014/public/schedule/detail/34254
Hands-On Data Analysis with Python
Python is quickly becoming the go-to language for data analysis. However, there are so many tools out there that it can be difficult to figure out which ones are useful. In this workshop, I’ll give you an in-depth look at some of the best tools for data wrangling, machine learning, and data visualization. You’ll learn strategies for working with data, how to structure a data analysis workflow, and which tools are appropriate for handling different kinds of data. You’ll leave with a good understanding of different data analysis techniques in Python.
Using Pandas, Scikit-Learn, and matplotlib, we’ll work through a data analysis workflow from start to finish, and we’ll cover the following data analysis problems:
• Data preprocessing and data wrangling with Pandas
• Using Scikit-Learn for machine learning
• Visualizing our results with matplotlib
Required materials
Pandas, Scikit-Learn, and matplotlib are the tools we’ll be working with in Python. They can easily be installed with a distribution (such as Anaconda). Please have them installed prior to the tutorial. I’ll post all of the tutorial materials to my Github (https://github.com/sarguido) account, so having a Github account would be helpful.

Canceled
Saturday Tutorial: Data Analysis with Sarah and Python