Data Masters and Big Data Folk!
Proudly supported by Cloudera APJ
February 6th we're back for another meetup and the first for 2014.
Topic of the Night - 'The Enterprise Data Hub' by Cloudera APJ
Sponsor of the night is Cloudera, Hadoop masters and all round champions!
This first talk starts out with an overview of the Enterprise Data Hub, what core components make up the Hub and how Enterprises can leverage Cloudera technology to Ask Bigger Questions
"An Enterprise Data Hub is one place to store all data, for as long as desired or required, in its original fidelity; integrated with existing infrastructure and tools; with the flexibility to run a variety of enterprise workloads — including batch processing, interactive SQL, enterprise search and advanced analytics — together with the robust security, governance, data protection, and management that enterprises require. With an Enterprise Data Hub, leading organizations are changing the way they think about data, transforming it from a cost to an asset."
The next talk starts out with an overview of Impala from the user's perspective, followed by a presentation of Impala's architecture and implementation. It concludes with a summary of Impala's benefits when compared with Apache Hive, commercial MapReduce alternatives, and traditional data warehouse infrastructure.
Impala: A Modern, Open Source SQL Engine for Hadoop, Wilfred Spiegelenburg ([masked])
"The Cloudera Impala project is pioneering the next generation of Hadoop capabilities: the convergence of fast SQL queries with the capacity, scalability, and flexibility of a Hadoop cluster. With Impala, the Hadoop community now has an open-sourced codebase that helps users query data stored in HDFS and Apache HBase in real time, using familiar SQL syntax. In contrast with other SQL-on-Hadoop initiatives, Impala's operations are fast enough to do interactively on native Hadoop data rather than in long-running batch jobs. Now you have the freedom to discover relationships and explore what-if scenarios on Big Data datasets. By taking advantage of Hadoop's infrastructure, Impala lets you avoid traditional data warehouse obstacles like rigid schema design and the cost of expensive ETL jobs. "
6.20pm - 6.30pm Registration
6.30pm - 6.40pm Welcome - Fernando Paul
6.40pm - 7.10pm Speaker #1 : The Enterprise Data Hub
7.10pm - 7.30pm - Networking/Food and Drink (20 mins)
7.30pm - 8.00pm - Speaker #2 : Cloudera Impala
8.00pm - 9.00pm - Networking/Food and Drink cont…
9.00pm - close
We look forward to seeing you again on February 6th, 2014,so please invite a friend or two (or three) to join the meetup group, RSVP this event, particpate, network, eat some pizza with a side of beer and most importantly wear your big data hat (regardless if your hat reads BDNewbie or BDGuru).
then, RSVP away and see you soon!