Next Meetup

PDF & Web Scraping
We'll have two talks on this theme: The web scraping portion of the evening will focus on using the rvest and XML packages to pull data from a variety of web sources. A basic introduction to the topic will be followed by discussion of the few most central commands and examples of scraping various website types into clean data sets. Please feel free to bring sites or sources you are interested in scraping. For the pdf scraping portion will focus on extracting information from machine readable PDFs and cleaning the raw document to isolate specific data. The talk will cover an overview of the topic, use cases, and cleaning the raw data. Additionally, we will work through examples, scraping actual PDFs.

Location TBD

TBD · Houston, TX

    Past Meetups (64)

    What we're about

    This is a group of R enthusiasts living in the Greater Houston area.

    R is an open source statistical platform used in a wide range of academic, hobbyist, and professional applications. This group exists to promote R's use in the Houston area, as well as to provide a forum to exchange tips, tricks, ideas, and code.

    People of all experience levels with R, from novice to expert are welcome. We have members who are just starting out with R as well as several that have contributed packages and code to the R project.

    Members (1,267)

    Photos (94)

    Find us also at