Feb 19, 2013 · 6:30 PM
Bronto Software, Inc.
Large-scale data has challenges at every phase in its lifecycle: capture, storage, processing, and result serving. Depending on the nature of the data, and the analysis goals of a data team, imposing the right schema on a NoSQL storage system such as Apache HBase can enable more efficient storage, retrieval, and analysis of relevant information as well as increase the maintainability of such a system moving forward.
In this talk, we present the Kiji framework for building real-time scalable data applications on Apache HBase. Kiji is a collection of Apache2-licensed open source components that extend the Hadoop ecosystem and help developers with schema management, MapReduce processing, and data integration tasks.
To download or learn more about Kiji, visit www.kiji.org.
About the speaker:
Aaron Kimball founded WibiData in 2010. He has worked with Hadoop since 2007 and is a committer on the Apache Hadoop project. In addition, Aaron founded the Apache Sqoop data import tool and Apache MRUnit Hadoop testing library projects. Previously he was the first engineer hired by Cloudera, the leading provider of Apache Hadoop-based software and services.
Aaron holds a B.S. in Computer Science from Cornell University and a M.S. in Computer Science from the University of Washington. When not thinking about Hadoop, Aaron is an avid sailor, Burning Man devotee, and player of board and video games.