Data Lessons Learned at Scale
Details
For our October Meetup, Data Science DC and Big Data DC (https://www.meetup.com/bigdatadc/) are thrilled to jointly bring you two great speakers on the topic of lessons learned when working with really, really big data sets. First, Charlie Reverte (http://www.linkedin.com/in/charliereverte/) from AddThis will talk about his experiences leading an engineering team as it dealt with managing and analyzing web traffic data from over a billion worldwide users. Then, Prof. Jimmy Lin (http://www.umiacs.umd.edu/~jimmylin/index.html) from the University of Maryland at College Park will related what he learned at a two-year stint at Twitter, helping to build data mining tools that could scale to one of the larger sites on the web.
This event is listed on both the DSDC and BDDC (https://www.meetup.com/bigdatadc/events/139611972/) Meetups! Please RSVP to whichever group you choose, but please do not RSVP "Yes" to both!
Agenda:
6:30pm -- Networking, Empenadas, and Refreshments
7:00pm -- Introduction
7:10pm -- Presentations and discussion
8:30pm -- Adjourn for Data Drinks (Tonic, 22nd & G St.)
Abstracts:
pending
Bios:
Jimmy Lin is an associate professor in the iSchool at the University of Maryland, affiliated with the Department of Computer Science and the Institute for Advanced Computer Studies. He graduated with a Ph.D. in computer science from MIT in 2004. Lin's research lies at the intersection of information retrieval and natural language processing, and he has done work in a variety of areas, including question answering, medical informatics, and bioinformatics. Lin's current research focuses on massively-distributed data analytics in
cluster-based environments.
Charlie Reverte grew up on robotics and microprocessors. At Carnegie Mellon, he studied distributed systems while sending robots into caves and coal mines. He also developed one of the first augmented reality systems for robotic surgery and ACL reconstruction. Charlie has been working at AddThis (formerly Clearspring) since initial funding in 2006, and has helped it reach 1.4 billion unique users across the web. He co-authored the OExchange spec for open sharing, which was implemented by Twitter and Google, among others. He believes mobile apps are a fad and that the mobile web will win because of addressability and the URL. After hours, Charlie is hooked on 24-hour endurance car racing and drives for the Clearspring Motor Club (csmotorclub.com (http://csmotorclub.com/)).
Sponsors:
This event is sponsored by O'Reilly Strata (http://oreil.ly/1500l9u), Intridea (http://www.intridea.com/), Statistics.com (http://bit.ly/12YljkP), and Elder Research (http://datamininglab.com/).
Strata + Hadoop World (http://oreil.ly/1500l9u) is October 28-30 in New York. Strata is the industry's leading event in big data and data science, with a dedicated track for data visualization and design. Save 20% with the code DATADC.
Parking:
For those driving, we encourage you to find parking for this event via our sponsor, ParkMe (http://www.parkme.com/). ParkMe will help you find the closest, cheapest parking, and has iPhone (https://itunes.apple.com/us/app/parkme-parking-find-cheapest/id417605484?mt=8) and Android (https://play.google.com/store/apps/details?id=com.parkme.consumer) apps.