Web Scraping: Extracting Data from a Website's HTML code
Details
Taylor Guthrie will join us to present his work on web scraping. During this session, the following skills with be demonstrated:
- Use the RVEST package in R coupled with HTML search tools
(CSS selectors) such as the SelectorGadget to identify and pull data
from static web pages. Download available: http://selectorgadget.com/ - Explore how to iteratively pull data from multiple web pages that all share the same overall structure.
- Wrangle and analyze the data.
Please download materials and bring your laptop to follow along. Opportunities to ask questions, mingle and socialize available post session.
