We are delighted to welcome you to our meetup in September! We have two speakers tonight: Myles Mitchell from Jumping Rivers and Professor Roy Ruddle from the Leeds Institute for Data Analytics.
If you would like to volunteer as a speaker please reach out to us at lds@jumpingrivers.com.
Schedule
18:00--18:45h: Refreshments (Food served on first come first served basis)
18:45--18:50h: Welcome
18:50--19:25: Myles Mitchell @ Jumping Rivers
Demystifying MLOps
MLOps is an approach to data science where the use of trained models in production is a key focus. If we consider how a model changes during the evolution of a project, there are many factors involved. Incorporating additional training data, or changes to the model structure, or simply retraining an existing model will cause model outputs to change. Keeping track of the different versions of a model, and making it easy to deploy new versions, and roll-back to earlier versions of a model is critical. Conversely, the performance of a trained model may start to wane over time, such that monitoring of a deployed model is necessary. In this talk, Myles will discuss some open-source tools that he has been investigating that simplify using MLOps in practice.
19:25--20:00h: Professor Roy Ruddle @ Leeds Institute for Data Analytics (LIDA)
How should I investigate data quality?
Do you know that there are more than 100 ways in which data can be of “low” quality? That is one reason why data preparation often takes more than half of a data science project’s time. In this talk, Roy will show you how to investigate data quality in an efficient yet rigorous manner. He will demonstrate the approach with an openly available six-step workflow (https://doi.org/10.5518/1481) and associated Python package (https://pypi.org/project/vizdataquality/).
Bio 1: Myles Mitchell
Myles holds a PhD in Physics and works as a Principal Data Scientist at Jumping Rivers. With over a decade of experience in Python programming, he likes to apply himself to projects ranging from predictive analytics to software development. Keen to share his expertise, he enjoys teaching courses in beginner programming, database management, machine learning and more. When he’s not staring at computers, he enjoys running, hiking and anything else outdoors!
Bio 2: Professor Roy Ruddle
Roy is a Professor of Computing and LIDA’s Director of Research Technology. He has worked in both industry and academia, and is an expert in data visualization and data quality. Major outputs from his work include open source Python packages, the Leeds Virtual Microscope (commercialised by the healthcare company Roche) and Petriva (a spin-out company that provides specialist visual data analysis and data mining software).
News and Announcements
Have a news item or announcement you'd like to make about upcoming data events or job opportunities in Leeds? Comment below or contact us directly (lds@jumpingrivers.com) and we'll do our best to circulate this information at the end of the session.
Please contact the organisers if you would like to volunteer as a speaker for future events.