Dear HUG members,
It's time again for a great meetup!
On April 22nd, Lars George will give us an interesting talk on NoSQL in the Hadoop World.
I am very excited about this talk and am looking forward to seeing all of you out there.
Stay tuned for more information regarding the location.
If you have any questions, please drop a line in the comment section below.
Times are changing fast in the rather young Hadoop world. From the batch oriented beginnings using MapReduce, complemented with a random read and write store called HBase, the use-cases seemed rather easy to lay out. But that has changed dramatically with the advent of MPP style query engines like Impala, or automated as well as interactive ad-hoc querying using Spark or Search. These days it is much more involved to find the right tool for the job at hand. Should you rely on existing "state-of-the-art" advise - or is there more to be said when preparing data for further processing in the Hadoop ecosystem. Is there such thing as "single source of truth" and if not, why? What are the strength of Impala, Spark, Search vs MapReduce and HBase? All of these questions are addressed while going deeper into the architectural bowels of how each of these tools plays its tricks. In the end there is only physics and you cannot cheat it: which is the ultimate tool for your use-case?
Lars George has been involved with Hadoop and HBase since 2007, and became a full HBase committer in 2009. He has spoken at many Hadoop User Group meetings, and conferences such as ApacheCon, FOSDEM, QCon, or Hadoop World and Hadoop Summit. He also started the Munich OpenHUG meetings. Lars now works for Cloudera, as the EMEA Chief Architect, acting as a liaison between the Cloudera professional services team and customers as well as partners in and around Europe, building the next data driven solutions. He is also the author or O'Reilly's "HBase - The Definitive Guide".