Big Data DC Meetup #4


Details
It is time for another Big Data DC meetup. We had a fantastic turnout and two great talks at the last meetup so hopefully this one will be even better. The format will be primarily the same -- two focused sessions with time for questions, and room for general discussion.
[7-7:15] Mingle/welcome/intro [7:15-7:45] Chris Burroughs, a Kafka committer, will present on Kafka - Previous meetups have focused on cool things you can do with data once it's in your database. This talk will instead focus on how to get large amounts of data into your analytics system to begin with. Kafka is a distributed publish-subscribe messaging system originally developed at LinkedIn for activity stream data and currently in the Apache Incubator. I'll introduce Kafka and explain how it contrasts with good old access logs and traditional (JMS, AMQP) messaging systems.
[7:45-8:00] Break and discussion [8:00-8:30] Palantir - Rob Giardina and Brendan Weickert - Big Data at Palantir [8:30-??] Informal discussion, drink-having Clearspring is hosting, additional food sponsorship opps available. There will once again be a fine selection of beer and other beverages.
As many of you are aware Stanford is hosting a few online AI (http://www.ai-class.com/) and ML (http://ml-class.org/) courses for free this fall. A few of us at Clearspring are signing up for the courses and looking to make a study group. If you are interested in joining the study group please let us know.
We look forward to seeing you all there for another great session.
- Will (@willmeyer) and Matt (@abramsm)

Big Data DC Meetup #4