Skip to content

2024-11: Parallel Python for Data Processing

Photo of Graham Doerksen
Hosted By
Graham D.
2024-11: Parallel Python for Data Processing

Details

Is your Python data processing feeling sluggish? Learn how to turbocharge it with parallel programming techniques! Presented by Nathan Bryans, Senior Machine Learning Engineer at Coursera.

Where: Platform Calgary, East Annex
When: Wednesday, November 27th, at 5:30pm

Parallel Python for Data Processing: Embracing the Embarrassingly Parallel
This talk focuses on embarrassingly parallel problems — those that are ripe for massive speedups. We'll explore methods like shell scripting, Python's multiprocessing, PySpark, and Dask, examining their strengths and tradeoffs. Through real-world examples and code demonstrations, you'll gain the knowledge to choose the right approach and accelerate your data workflows, whether you're a seasoned data scientist or a Python newbie.

Schedule:
5:30 - Food and Networking
6:00 - Presentation and Discussion
7:30 - Wrap up

Bio:
Nathan is a Software and AI specialist with over a decade of experience bringing innovative solutions to life for companies like Coursera, Oracle, and ATB Financial. He excels at designing, building, and operating complex ML solutions in the cloud to address real-world business challenges. He has also had the privilege of mentoring and leading teams in this space, and is passionate to share his insights and practical learnings on the software engineering side of Data and AI.

Photo of PyData Calgary group
PyData Calgary
See more events