Scikit-learn Sprint (to contribute to open source)


Details
AGENDA
9:30am - 10:00am: arrive early for technical support
10:00am - 1:00pm: Sprint
1:00pm - 2:00pm: Lunch will be provided
2:00pm - 4:00pm: Sprint
-------------------------
Code of Conduct
WiMLDS is dedicated to providing a harassment-free experience for everyone. We do not tolerate harassment of participants in any form. All communication should be appropriate for a professional audience including people of many different backgrounds. Sexual language and imagery is not appropriate.
All attendees should read the full Code of Conduct before participating: https://github.com/WiMLDS/starter-kit/wiki/Code-of-conduct
-------------------------
DESCRIPTION
We would like to increase the number of women in open source, particularly for the Python machine learning library scikit-learn. The scikit-learn repository (https://github.com/scikit-learn/scikit-learn) has over 1000 issues open. By organizing and offering this workshop, we hope to increase women’s participation in open source as well as advance the scikit-learn library.
The plan is to work in pairs. The goal is that each participant will be able to resolve one trivial fix and one actual fix.
-------------------------
GitHub Repo
https://github.com/WiMLDS/scikit-sprint-nyc-2018
-------------------------
PREPARATION
We will review the basics of Git at the beginning of workshop so attendees are able to submit pull requests.
- Hardware
- bring your laptop & charger
- Software
- have Python installed via Anaconda. (Anaconda includes Jupyter Notebook)
- Python / R
- be comfortable with Python
- familiarity with Jupyter Notebook
R users are also welcome and able to contribute:
- your math knowledge
- help with documentation
- good idea to look at sphinx and ReST
http://www.sphinx-doc.org/en/master/
http://docutils.sourceforge.net/docs/user/rst/quickstart.html
- Git
- Git should be installed
- should have a GitHub account (save your password where you can find it)
- some familiarity with Git / GitHub
- review some Git resources prior to event
- we'll go over pull requests at beginning of event
-------------------------
Scikit-learn
- Read thru "Contributing" documentation
- http://scikit-learn.org/stable/developers/contributing.html
- it is approximately 16 pages
- Review Open Issues
go through some Issues and become familiar with them
https://github.com/scikit-learn/scikit-learn/issues
-------------------------
Instructor Bio
This event will be led by Andreas Mueller. Andreas is a Lecturer in Data Science at Columbia University. Previously he worked as a Machine Learning Scientist at Amazon, working on computer vision and forecasting problems. He is one of the core developers of the scikit-learn machine learning library, and has maintained it for several years.
His mission is to create open tools to lower the barrier of entry for machine learning applications, promote reproducible science and democratize the access to high-quality machine learning algorithms.
Andreas’ twitter: @amuellerml
Andreas’ github: @amueller

Scikit-learn Sprint (to contribute to open source)