Season 7 Episode 1 - Tools for Data Scientists

Netflix Open Source Platform
Netflix Open Source Platform
Public group

Netflix

131 Albright Way Los Gatos, CA 95032 · Los Gatos, CA

How to find us

Parking is around back including a parking garage.

Location image of event venue

Details

Join us for the next installment of Netflix Open Source meetups. We will be covering Tools for Data Scientists at the first meetup of 2020!

6:00-7:00 pm Registration/Food/Networking

7:00-8:00 pm Presentation in the Netflix Theater

Welcome (Faisal Siddiqi)

Metaflow (Ville Tuulos)
Data scientists at Netflix are expected to develop and operate large machine learning workflows autonomously. However, we do not expect that all our scientists are deeply experienced with distributed systems and data engineering. Metaflow was created to make it delightfully easy to build and operate ML workflows in the cloud using idiomatic Python and off-the-shelf ML libraries, covering the whole lifecycle of an ML project from prototype to production.

Polynote (Jeremy Smith)
Polynote is a new notebook tool we created from scratch to address some of the pain points we've run into while using Scala in machine-learning notebooks at Netflix. It provides essential code editing features other tools lack like interactive auto-completes, support for mixing multiple languages and sharing data between them within a single notebook, and encourages reproducible notebooks with its immutable data model.

Papermill (Matthew Seal)
Nteract is an open source organization under which there are several libraries and applications that Netflix and many other companies and individuals contribute to. One of these libraries is Papermill, a library used to programmatically parameterize and execute Jupyter Notebooks. Papermill provides a CLI and Python interface that we'll explore during the session to see how it can be used and what value it adds. Using this pattern we'll also briefly talk about how we've integrated papermill at Netflix and how it interfaces with other Jupyter and nteract services.

8:00-9:00 Demo Stations/Networking