July 28, 2014 · 7:00 PM
Data themed office hours held Monday evenings, 7-9pm.
Bring your computer!
We're going to work on recreating a chart published in Vanity Fair that shows the most mentioned brands in Jay-Z songs.
Code: Here's a starter repo, with a scraper script using BeautifulSoup to download Jay-Z's lyrics and another that does basic parsing to find most frequent words: http://git.io/CsZ62w
Exercises: Try to further clean the parsing to find brand names mentioned and draw charts!
Don't forget to bring your computer =)
Our host for the evening is NewsCred. Bring an ID to sign in at the front desk and take the elevator to the 6th floor. Please RSVP early.
NewsCred offers a content marketing and syndication platform built with Python that allows clients to quickly discover meaningful new articles and images with the help of data science, manage their content, plan their editorial calendars, and measure their ROIs. (read more) @newscred