addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwchatcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgoogleimageimagesinstagramlinklocation-pinmagnifying-glassmailminusmoremuplabelShape 3 + Rectangle 1outlookpersonplusprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

New Meetup: Collaborative Meetup with DC Hadoop User Group

From: Chida
Sent on: Tuesday, August 24, 2010 10:54 PM
Announcing a new Meetup for National Capital Area Google Technology Users!!

What: Collaborative Meetup with DC Hadoop User Group

When: Tuesday, September 7,[masked]:00 PM

Where: Kaiser Family Foundation
1330 G Street Northwest
Washington, DC 20005

NCAGTUG is set to hold our next meetup on September 7, 2010 with Washington DC Area - Hadoop User Group (HUG).

"Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license.[1] It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google's MapReduce and Google File System (GFS) papers" - Wikipedia

We are excited to say that we have scheduled Tom White, the author of the book "Hadoop The Definitive Guide" to give a presentation. This would be a great opportunity for all the members to work together and pull in both of our communities. Cloudera will be sponsoring the meetup so food and drinks will be provided by them.

More Info from Hadoop meetup group: .

I'd like to thank our sponsors:

Booz Allen Hamilton -
Cloudera -
This meetup is in coordination with several other meetup groups from the area. I'd personally like to thank them for working with us and helping to organize this event:

DC useR Group -
DC Python Meetup -
The Washington Semantic Web Meetup -
DC VA MD New Media Tech - Video - Social Web -
National Capital Area Google Technology Users -
I'm excited to say we have two very special guests presenting at this meetup, Tom White and Aaron Cordova. Tom has been instrumental in shaping the direction and development of Hadoop and many projects in Hadoop's ecosystem. He is also the author of the book "Hadoop: The Definitive Guide" and a part of the team at Cloudera. Aaron Cordova is a UMD graduate working at Booz Allen Hamilton. Before joining Booz Allen Aaron played a key role in defining the critical large scale data analytics infrastructure and applications at the NSA. Currently Aaron focuses on helping government organizations manage and analyze large amounts of data using technologies such as Hadoop, Hive, and HBase. Thus enabling them to make better decisions and answer key questions important to business and operations.

Kaiser Family Foundation - Public Affairs Center
1330 G St. NW
Washington, DC 20005

1 block from the Metro Center Station metro stop.


6:30 - 7:00 Food and Refreshments, Socialize
7:00 - 7:30 Tom White - Hadoop's powerful parallel processing paradigm provides a great generalized framework for storing and analyzing data. For all its raw power however the practical use of Hadoop requires more. What's needed is an integrated stack of components which makes it easier to develop and use real-world applications in a production environment. Tom will discuss the evolving Hadoop platform, its components and how each fills a critical role in making Hadoop more useful in the enterprise.
7:30 - 7:40 Short Break
7:40 - 8:10 Aaron Cordova - One of the barriers to scaling Hadoop to 10,000 machines is the single HDFS NameNode. Recent benchmarks [1] show that the HDFS needs to be able to do an order of magnitude more writes per second to reach 10,000. The most promising way to do this is to create a distributed NameNode. Aaron will discuss the issues surrounding distributing the NameNode functionality to multiple machines, including automatically and organically partitioning the namespace, how to keep operations serialized and durable, and how recovery from failure changes.
1. Shvachko, Konstantin. HDFS Scalability the Limits of Growth. USENIX Login Magazine. April 2010.

Don't Forget:
Registration is open for Cloudera's Hadoop World in NYC More information here: http://www.cloudera.c...

Look forward to see you there!

RSVP to this Meetup:

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy