Skip to content

Workshop: Web Scraping with Python - updated for 2024 (Austin)

Photo of Lynn Bender
Hosted By
Lynn B.
Workshop: Web Scraping with Python - updated for 2024 (Austin)

Details

Registration and full details at Eventbrite:
https://web-scraping-with-python-2024.eventbrite.com

This workshop - in conjunction with Data Day Texas and updated for 2024 - will cover everything you need to build sophisticated web crawlers and scrapers.

Every time Ryan comes to Austin to offer this course, it sells out. There will be no second session. Get your ticket now.

Learn how to collect data for everything from small personal projects to large enterprise applications. All students will have access to a repository of interactive code samples, and we’ll do hands-on programming throughout the course to solidify learning. Along the way, Ryan will share case studies, scenarios, and humorous anecdotes from the trenches of data wrangling.
This course will cover:
• Web page analysis and HTML parsing with BeautifulSoup
• Using Scrapy to build crawlers
• Best practices and engineering patterns for working with multiple data sources, data types, distributed scraping, and more
• Selenium and browser automation
• Common scraper-blocking tactics and how to avoid them
• Integrating with third-party services such as ScraperAPI and Zyte

REQUIREMENTS
To get the most out of this course, students should have an intermediate working knowledge of the Python programming language. Some experience with databases and data architecture is helpful, but not required.

Come prepared with a laptop running Python 3.8+. Attendees will receive instructions for installing pip and Jupyter notebooks if they have not done this already.

About the Instructor

Ryan Mitchell is the author of Web Scraping with Python (O'Reilly), with its third edition coming out this year.. She has six LinkedIn Learning courses, including Python Essential Training — currently the leading Python course on the platform. An expert in web scraping, application security, and data science, Ryan has hosted workshops and spoken at many events, including Data Day and DEF CON. Ryan holds a master’s degree in software engineering from Harvard University Extension School and a bachelor’s in engineering from Olin College. She is currently a senior software engineer at the Gerson Lehrman Group where she works on the search team.

Registration and full details at Eventbrite:
https://web-scraping-with-python-2024.eventbrite.com

Photo of DFW Data Geeks (official) group
DFW Data Geeks (official)
See more events
Needs a location