
What we’re about
A group for All things Data. All are welcome.
Upcoming events (2)
See all- Workshop: Web Scraping with Python - updated for 2024Needs location
Registration and full details at Eventbrite:
https://web-scraping-with-python-2024.eventbrite.comThis workshop - updated for 2024 - will cover everything you need to build sophisticated web crawlers and scrapers.
Every time Ryan comes to Austin to offer this course, it sells out. There will be no second session. Get your ticket now.
Learn how to collect data for everything from small personal projects to large enterprise applications. All students will have access to a repository of interactive code samples, and we’ll do hands-on programming throughout the course to solidify learning. Along the way, Ryan will share case studies, scenarios, and humorous anecdotes from the trenches of data wrangling.
This course will cover:
• Web page analysis and HTML parsing with BeautifulSoup
• Using Scrapy to build crawlers
• Best practices and engineering patterns for working with multiple data sources, data types, distributed scraping, and more
• Selenium and browser automation
• Common scraper-blocking tactics and how to avoid them
• Integrating with third-party services such as ScraperAPI and Zyte
REQUIREMENTS
To get the most out of this course, students should have an intermediate working knowledge of the Python programming language. Some experience with databases and data architecture is helpful, but not required.
Come prepared with a laptop running Python 3.8+. Attendees will receive instructions for installing pip and Jupyter notebooks if they have not done this already.About the Instructor
Ryan Mitchell is the author of Web Scraping with Python (O'Reilly), with its third edition coming out this year.. She has six LinkedIn Learning courses, including Web Scraping with Python and Python Essential Training — currently the leading Python course on the platform. An expert in web scraping, application security, and data science, Ryan has hosted workshops and spoken at many events, including Data Day and DEF CON. Ryan holds a master’s degree in software engineering from Harvard University Extension School and a bachelor’s in engineering from Olin College. She is currently a senior software engineer at the Gerson Lehrman Group where she works on the search team.Registration and full details at Eventbrite:
https://web-scraping-with-python-2024.eventbrite.com - Data Day Texas +AIAT&T Executive Education and Conference Center, Austin, TX
Details and early bird registration at datadaytexas.com.
Data Day Texas returns with an all-star lineup of speakers from around the world. Topics covered include: Data Science, Data Engineering, Platforms, Data Products, Realtime Data, Business Intelligence, Analytics, MLOps, LLMs, Generative AI and ....
Speakers already confirmed are
Joe Reis and Matthew Housley, co-authors of the O'Reilly best seller, Fundamentals of Data Engineering;
Susan Shu Chang, Principal Data Scientist at Elastic and author of the upcoming Machine Learning Interviews;
Mikiko Bazeley, head of MLOps at FeatureForm;
Veronika Durgin, VP of Data at Saks;
Lauren Balik and Mary MacCarthy, co-hosts of the Tech Bros on Linkedin and YouTube;
Jesse Anderson, author of the bestselling Data Teams.
Holden Karau, author of the upcoming Scaling Spark with Dask;
Jonathan Ellis, OG Cassandra PMC Chair and Co-Founder, DataStax.
Hala Nelson, Professor of Mathematics at James Madison and author of Essential Math for AI;
Adi Polak, VP of Developer Experience at Treeverse and author of Machine Learning with Apache Spark;
Santona Tuli, Head of Data at Upsolver;
Ole Olesen-Bagneux, author of The Enterprise Data Catalog;
Biill Inmon, father of the Data Warehouse concept.
Chris Tabb, co-founder of LEIT Data;
Andy Petrella, author of What is Data Observability;
and 60+ more to be announced.....Buy your early bird ticket now and save money!
Details and registration at datadaytexas.com