Gensim. ALL LEVELS

Details
About Gensim
Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.
Quite an amazing project that implements popular algorithms like word2vec or Latent Direchlet Allocation with great performance.
See the project page here: https://github.com/RaRe-Technologies/gensim
About the sprint
We'll work on different tickets, from easy docstring improvements, to more complex issues for people familiar with the project.
If you're not familiar with Gensim, and you don't already know which ticket you want to work on, this is what Ivan (Gensim mantainer) suggests: https://gist.github.com/menshikh-iv/075824fdc4906a3b1b74b8f94ce64b59
Gitter channel for the event: https://gitter.im/py-sprints/gensim
Our sponsor
Thanks to Touch Surgery (https://www.touchsurgery.com/jobs.html) for providing the venue, and the pizza and drinks for the night.
Set up instructions
They are available in the same Gensim CONTRIBUTING file:
https://github.com/RaRe-Technologies/gensim/blob/develop/CONTRIBUTING.md

Gensim. ALL LEVELS