Skip to content

Details

Title: Data Science Workflows Using Docker Containers

Speaker: Aly Sivji

Containerization technologies such as Docker enable software to run across various computing environments. Data Science requires auditable workflows where we can easily share and reproduce results. Docker is a useful tool that we can use to package libraries, code, and data into a single image. This talk will cover the basics of Docker; discuss how containers fit into Data Science workflows; and provide a quick-start guide that can be used as a template to create a shareable Docker image! Learn how to leverage the power of Docker without having to worry about the underlying details of the technology.

We will alternate between slides and working through examples in the terminal. This talk is geared towards an intermediate audience, but there should be enough of a ramp up for beginners and some material for Docker experts. Although this session is geared towards data scientists, the underlying concepts have many use cases (come find me after to discuss).

About Aly:

Aly Sivji is a Mathematician / Software Engineer in the Health Tech startup space and a part-time grad student at Northwestern University studying Medical Informatics. He is passionate about Python, cycling, and improving healthcare delivery models using information technology.

Related topics

You may also like