Web Scraping using Beautiful Soup


Beautiful Soup is a Python library used for pulling data out of websites . It provides ways of navigating, searching, and modifying the parse tree while saving programmers hours or even days of work.
It creates a parse tree for parsed pages that can be used to extract data from HTML, which is useful for web scraping. It is available for Python 2.7 and Python 3.

Monica Puerto, data scientist and senior strategist at the American Federation of State, County and Municipal Employees (AFSCME) will present to us information on how to use beautifulSoup and provide us with a demonstration and lecture about its capabilities.


Monica Puerto is a data activist, a feminist, a Latina, a Women Who Code Director, and a Data Scientist for AFSCME. She has experience in Python’s Pandas, Matplotlib, Scikit Learn, and Numpy libraries. She is a Women Who Code Director for the DC local chapter. She handles the logistics of their Python meetups. Outside of python she has experience in R : ggplot,dplyr, tidyverse, caret, stringr,lubridate, acs, Rsocrata ; as well as SQL: Postgresql, MySql, Snowflake, Bigquery.

Food and drinks will be provided. We will also have some books on python to raffle off for FREE.

Difficulty: Beginner /Undergrad

You should bring: Pen/paper , laptop

Software: Python 3 , Jupyter notebook


Cleveland park library
CPK First Floor Meeting Room 1

3310 Connecticut Ave NW,
Washington, DC 20008