Web scraping: Why, How, and If?


Details
Web scraping is the skill of aggregating data from publicly available websites and storing them in a structured format. This is not only a fun skill to have for collecting information you like, but it is also a valuable skill for companies that run marketing research, and aggregators of travel data, prices, etc.
First, I will review the core concepts of how data are organized on a website using HTML. Then I will walk you through a live-coding scraping example using BeautifulSoup (a Python library) in a Jupyter notebook. Finally, I will briefly cover the ethics of web scraping. Some basic experience with Python would be useful.
About the speaker: Mahdi Sadjadi (he/him/his) is currently a data scientist at VideoAmp and earned his PhD in Physics at Arizona State University studying disordered systems such glass to design and understand their mechanical properties. He writes specialized software to study such systems as mathematical graphs.
LinkedIn profile: https://www.linkedin.com/in/mahdisadjadi/

Web scraping: Why, How, and If?