PyData Cambridge - 37th Meetup


Details
We are happy to announce the 37th PyData Cambridge meetup!
Agenda
18:45 - Doors open (Please do not arrive earlier)
19:00 - Introduction
19:15 - Setting up a Repository Template for your Team with Consistency, Reproducibility, and Compliance in Mind (Gabriel Harris, NielsenIQ)
19:50 - Interval
20:15 - Finding Data Science in the Microbiome (Michael Bateman, Arthur D. Little)
20:50 - End (Pub - Old Ticket Office, Station Square)
Title: Setting up a Repository Template for your Team with Consistency, Reproducibility, and Compliance in Mind
Abstract: In this talk, Gabriel presents the repository template he implemented at his team this year. The template uses:
- “Poetry” to manage virtual environments and dependencies
- “Make” to automate processes
- “pre-commit-hooks” to check code compliance with a number of conventions
Gabriel also discusses some of the packages the template uses, like black, isort, flake8, pydocstyle and few more and how these can also be configured in vscode
Bio: Gabriel Harris, PhD is a Senior Data Scientist and a Technical Lead at NielsenIQ Brandbank, one of the world’s most trusted providers of digital product content to brands, retailers and wholesalers across the globe. Gabriel earned his PhD from King’s College London in Medical Imaging Sciences in 2015. As the story usually goes, he found himself doing “MlOps” out of necessity and found it very challenging, yet interesting. He is also focused on bringing rigour to the usually messy data science practices
Title: Finding Data Science in the Microbiome
Abstract:
Recent years have seen a remarkable growth in research and understanding of the human gut microbiome, and specifically its connection to health and disease. Recent years have also seen a remarkable growth in AI-powered precision medicine.
In this talk I will discuss my experience as a data scientist working at the intersection of precision medicine and the gut microbiome. I will start with a high-level discussion of a few medical applications of microbiome science and show how these can be tackled with a combination of sophisticated labwork, metagenomics, and data science. I will then highlight precisely how the precision medicine questions translate into a binary classification problem that data scientists are familiar with.
Bio: Michael holds a PhD in pure mathematics and worked in the mathematics departments at UCLA and Cambridge, specializing in Fourier analysis and combinatorics. He then worked on bacterial genomics for about five years both in academia and in the biotech industry. More recently he worked for the UK government as part of the Covid response. Michael currently works as a data scientist and consultant at Arthur D. Little, one of the oldest consulting firms.
Code of Conduct
PyData is dedicated to providing a harassment-free event experience for everyone, regardless of gender, sexual orientation, gender identity, and expression, disability, physical appearance, body size, race, or religion. We do not tolerate harassment of participants in any form.
The PyData Code of Conduct governs this meetup. ( http://pydata.org/code-of-conduct.html ) To discuss any issues or concerns relating to the code of conduct or the behaviour of anyone at a PyData meetup, please contact NumFOCUS Executive Director Leah Silen (leah@numfocus.org) or organizers.

Sponsors
PyData Cambridge - 37th Meetup