Data Scientist Workshop Using Python

Data Scientist Workshop Using Python


This afternoon of talks will cover some basic libraries for Python data science.  The session is a part of the data science weekend that includes R-Bootcamps and the theoretical and R-oriented machine learning session on Sunday morning. The Python topics covered here are stand-alone modules. This content has been chosen to both complement and extend upon the prior sessions.


Prerequisites: Ideal would be basic python knowledge, although if you're just interested in seeing what Python can do for you, it's a pretty easy language to read.  If you want to install and try to follow along, the simplest method is to install the Enthought free distribution, which will solve many installation problems you might have with the necessary libraries:

http://www.enthought.com/products/epd_free.php.

You will then want to install pandas, scikit-learn, statsmodels, and patsy (links below).  If you want to go-it-alone, you should minimally get Python 2.7 and then install numpy, IPython, matplotlib, scipy, and  then the other libraries.


[masked]  Intro to IPython Notebook & Pandas (Lynn Cherny)

The IPython notebook is a browser-based workspace for exploring and recording your Python actions. Because the notebook is an increasingly popular way to share python code demos, Lynn will walk you thru the basics (including how to view one is someone sends you one), and then move on to an introduction and demo of the Pandas library (http://pandas.pydata.org/).  Pandas (built on numpy) provides a convenient data frame-like environment for manipulating data in Python, making the transition for R-users even easier.  The popular new book by Wes McKinney, Python for Data Analysis, uses Pandas as the primary tool example.


[masked] Intro to Statsmodels and Patsy (Thomas Wiecki)

Statsmodels (http://statsmodels.sourceforge.net/) and Patsy (http://patsy.readthedocs.org/en/latest/overview.html) allow an R-style description of models in Python, and support a growing number of basic statistical models, from glms to time series and discrete choice methods.  Thomas will illustrate the basics of these tools and their capabilities.


3.30 - 4.30 Intro to Scikit-Learn (Ryan Feather)

The workhorse tool for machine learning in Python is scikit-learn (http://scikit-learn.org/stable), an actively developed and rich collection of machine learning algorithms (with excellent documentation!).   Ryan will illustrate the basics of the scikit-learn interface to these algorithms, and show you applications of supervised and unsupervised techniques, including Random Forests.

 

 

Join or login to comment.

  • Ming S.

    The presenters did a wonderful job! The examples were very good and easy to follow. A great idea to use notebook for this tutorial...fun and interactive. Kudos to the organisor John for making this learning event possible!

    December 7, 2012

  • Philip P.

    Good material. A bit too much time was wasted on getting environments set up.

    December 4, 2012

  • Jim T.

    The second presenter wasn't as prepared as I would have expected

    December 2, 2012

  • ju t.

    It is great for beginners! I get a lot of useful materials. But the material is more suitable for self-learning. It is hard to see the screen and hear the speaker.

    December 2, 2012

  • Raymond K.

    great

    December 2, 2012

  • Lynn C.

    If anyone has any difficulties installing, here is another popular option for Mac OSX builds: http://fonnesbeck.github.com/Sc...­

    1 · November 24, 2012

    • Jim T.

      Worked like a charm, thanks!

      December 2, 2012

  • Guru

    How to get on WL ?

    November 26, 2012

  • Shyam S.

    How do I get on the waiting list?

    November 26, 2012

  • Sid

    I am attending the morning session but was too late for this one. will the waiting list be enabled again?

    November 25, 2012

Your organizer's refund policy for Data Scientist Workshop Using Python

Refunds offered if:

  • the Meetup is cancelled
  • the Meetup is rescheduled
  • you can cancel at least 4 day(s) before the Meetup

Payments you make go to the organizer, not to Meetup. You must make refund requests to the organizer.

People in this
Meetup are also in:

You don't have to be an expert to start your own Meetup Group

Get started Learn more
Katie

I'm surprised by the level of growth I've seen since becoming an organizer, it's given me more confidence in my abilities.

Katie, started NYC ICO

Start your Meetup today

Act now and get 50% off.
Until February 1.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy