Skip to content

[Live Webinar] Data Science and Machine Learning At Scale

Photo of Reshama Shaikh
Hosted By
Reshama S.
[Live Webinar] Data Science and Machine Learning At Scale

Details

Details

## Register for webinar here:

https://www.bigmarker.com/neo4j/Data-Umbrella-Webinar3-2020-10-27-05-00-pm

## Time

3pm PDT / 6pm EDT

## Talk Level

Intermediate

## Pre-reqs

Not a lot. It would help if you knew

  • programming fundamentals and the basics of the Python programming language (e.g., variables, for loops);
  • a bit about `pandas`, `numpy`, and `scikit-learn` (although not strictly necessary);
  • a bit about Jupyter Notebooks;
  • your way around the terminal/shell.

However, I have always found that the most important and beneficial prerequisite is a will to learn new things so if you have this quality, you'll definitely get something out of this workshop.

## Prep Work

See GitHub repo: https://github.com/coiled/data-science-at-scale

## Event

In this tutorial, you'll learn everything you wanted to know about scaling your data science work to larger datasets and larger models, while staying in the comfort of the PyData ecosystem (numpy, pandas, scikit-learn, Jupyter notebooks).

You'll come away knowing

  • How to reason about when you need to scale your data and machine learning work and when to not;
  • How to leverage distribute computation on your local workstation (such as your laptop) to analyze larger datasets and build larger, more complex models;
  • How to harness the power of clusters to support larger-than-memory computation, all from the comfort of your own laptop;
  • How to do all of this while writing code similar to the numpy, pandas, and/or sckit-learn code you already write.

## Speakers

Hugo Bowne-Anderson is Head of Data Science Evangelism and Marketing at Coiled, a company that makes it simple for organizations to scale their data science seamlessly. He has extensive experience as a data scientist, educator, evangelist, content marketer, and data strategy consultant at DataCamp, the online education platform for all things data. He also has experience teaching basic to advanced data science topics at institutions such as Yale University and Cold Spring Harbor Laboratory, conferences such as SciPy, PyCon, and ODSC and with organizations such as Data Carpentry. He has developed over 30 courses on the DataCamp platform, impacting over 500,000 learners worldwide through his own courses. He also created the weekly data industry podcast DataFramed, which he hosted and produced for 2 years. He is committed to spreading data skills, access to data science tooling, and open source software, both for individuals and the enterprise. He has taught workshops & tutorials at SciPy, PyCon, ODSC, with the Carpentries, and has taught online many times over the years, including developing both product and content for online data science curriculum at DataCamp.

James Bourbeau is a Software Engineer at Coiled where he focuses on building tools for scalable computing. He is an active member of the PyData ecosystem where he maintains several open source projects including Dask and Distributed. Previously James worked at Quansight where he helped clients leverage open source scientific Python libraries for data science applications.

Twitter:https://twitter.com/hugobowne
GitHub:https://github.com/hugobowne
LinkedIn: https://www.linkedin.com/in/hugo-bowne-anderson-045939a5/

Photo of Data Umbrella group
Data Umbrella
See more events
Online event
This event has passed