Mar 5, 2013 · 6:00 PM
This location is shown only to members
Open Data Bay Area's first meeting in Mountain View! Please join us to hear Richard Park of LinkedIn talk about Azkaban, an open source batch scheduler for Hadoop and to hear Greg Lindahl of blekko talk about the open data movement and how you can make a difference.
Speaker: Richard Park
Richard Park is a Senior Software Engineer in LinkedIn's Hadoop Infrastructure. He is the lead developer on Azkaban
Title: Introducing Azkaban 2.0
Azkaban has been used by LinkedIn since 2009 to schedule and run massive Hadoop workflows. It was designed to be simple and Recently, Azkaban 2.0 has been released. Completely rewritten from the ground up, it was designed to be more scalable, extensible, secure, and to work with future versions of Hadoop. This talk will cover the features of Azkaban 2.0, how LinkedIn uses it, and the future plans for this workflow manager.
Speaker: Greg Lindahl
Title: Open Data Needs You!
It's hard to imagine now, but the Free & Open Software movement was once small and needed help. Linux and *BSD users used to have to pick from a very restrictive list of devices with drivers. Between the demise of Netscape and the release of Firefox, there was no maintained web browser for X Windows.
The Open Data movement today is at an awkward place. Many government organizations can't decide if data like bus timetables or court decisions should be free or monopolized by the highest bidder. The dominant search engine in South Korea is the only search engine with access to a walled garden with the majority of Korean-language web content. Organizations allegedly in favor of freedom make bogus copyright claims on public domain content in order to "monetize."
Open Data needs your help. We data nerds need to create, use, and widely distribute open data. And we need to convince organizations to participate in the Open Data community when it makes sense.