March 9, 2011
I've designed and continue to work on a system for large scale collection of data for production analytics platforms such as hadoop.
These days: Distributed systems, programming languages, the hadoop and friends, java.
Sure. I'm willing to present about the flume project and its community, but would really like to have folks from the user community to present their scenarios and use cases.
architecture of real-world production collection architectures, multi-data center collection practices,
I'm a software engineer at Cloudera and the techincal and community lead on the Flume project, a distributed reliable streaming collection system for Hadoop.