Skip to content

[NYCML Virtual Meetup] Determined: open-source deep learning training platform

Photo of Max Kesin
Hosted By
Max K.
[NYCML Virtual Meetup] Determined: open-source deep learning training platform

Details

Speaker Bio:
Neil Conway is co-founder and CTO of Determined AI, a startup that builds software to dramatically accelerate deep learning model development. Neil was previously a technical lead at Mesosphere and a major developer of both Apache Mesos and PostgreSQL. Neil holds a PhD in Computer Science from UC Berkeley, where he did research on large-scale data management, distributed systems, and programming languages.

Abstract:
While deep learning (DL) has enormous potential, building DL-powered applications remains difficult, expensive, and time-consuming for most companies. A major cause is that deep learning engineers are forced to spend most of their time on DevOps and writing boilerplate code for common tasks like multi-GPU training and fault tolerance, rather than building better models.

Determined is an open-source deep-learning training platform that helps teams develop models more quickly, easily share GPUs, and collaborate more effectively. You can think of Determined as a platform that bridges the gap between tools like TensorFlow and PyTorch --- which work well for a single researcher with a single GPU --- to the challenges that arise when doing deep learning at scale.

This talk will include an overview of the problems that Determined aims to solve, the high-level architecture of the system, and a demo of the system. We’ll also dive deep on some key technical features, such as:

-Distributed training without changing your model code

-Intelligent hyperparameter search
-Flexible GPU spending, including management of cloud GPU instances
-Built-in experiment tracking and visualization
-Automatic fault tolerance and checkpoint management

Meeting Details:
Register in advance
https://zoom.us/webinar/register/WN_iA_6TPVFSaep-n63FLgx7A

Or an H.323/SIP room system:
H.323:
162.255.37.11 (US West)
162.255.36.11 (US East)
115.114.131.7 (India Mumbai)
115.114.115.7 (India Hyderabad)
213.19.144.110 (EMEA)
103.122.166.55 (Australia)
209.9.211.110 (Hong Kong SAR)
64.211.144.160 (Brazil)
69.174.57.160 (Canada)
207.226.132.110 (Japan)
Meeting ID: 961 5631 9061
SIP: 96156319061@zoomcrc.com

After registering, you will receive a confirmation email containing information about joining the webinar.

Photo of NYC Machine Learning group
NYC Machine Learning
See more events