Greetings fellow dataists!
For our next meetup Jeff Patti and Patrick O'Brien from Monetate (http://monetate.com/) will be talking about some tools and techniques of the trade.
6:00 - 6:30: Networking, food
6:30 - 7:00: Map Reduce: Beyond Word Count by Jeff Patti (http://www.linkedin.com/pub/jeff-patti/31/b44/977)
7:00 - 7:30: Collecting data with Scrapy by Patrick O'Brien (http://www.linkedin.com/pub/patrick-o-brien/18/862/abb)
7:30 - 8:00: Lightning Talks
8:00 - Leave for bar
Map Reduce: Beyond Word Count by Jeff Patti
Have you ever wondered what map reduce can be used for beyond the word count example you see in all the introductory articles about map reduce? Using Python and mrjob, this talk will cover a few simple map reduce algorithms that in part power Monetate's information pipeline
Bio: Jeff Patti is a backend engineer at Monetate with a passion for algorithms, big data, and long walks on the beach. Prior to working at Monetate he performed software R&D for Lockheed Martin, where he worked on projects ranging from social network analysis to robotics.
Collecting data with Scrapy by Patrick O'Brien
Scrapy is a simple, fast web scraping library that enables the production of clean data from unstructured web information. Together we will dive into the architecture of this library and create our own crawler.
Bio: Patrick O'Brien is a software engineer at Monetate.