May Hadoop Meetup: Smarter Big Data Integration for Hadoop


Details
Dear HUG UK members,
I am pleased to announce our May meetup, sponsored by Syncsort.
This will be an exceptional evening at the London Transport Museum.
Details below.
Sebastian
TIME:
Tuesday May 14th 2013, Doors Open 6:30pm.
Presentations 7:00pm – 8:30.
Bar Closes 9:00pm, Doors Close 9:30pm
LOCATION: London Transport Museum, Covent Garden
AGENDA:
Intro:Hadoop in Europe 2013 Update
Session 1: Syncsort
Title: Smarter Big Data Integration for Hadoop
Speaker: Ruediger Schickhaus, Senior Big Data Solution Consultant, Syncsort
Abstract: Hadoop has become a de facto standard in supporting Big Data analytics. A very common use case for Hadoop is data transformation and a new way to deliver ETL and SQL migration. With this in mind, Syncsort has made a contribution to Apache Hadoop that not only makes sort pluggable, but also facilitates new and difficult real world ETL use cases and database off-load, working natively within the MapReduce framework. This session will show (including a short demo) how the Syncsort contribution optimises ETL processes, which enable vertical scalability and a smarter integration tool set for Hadoop.
Session 2: MapR
Title: Apache Drill - interactive, ad-hoc query at scale
Speaker: Michael Hausenblas, Chief Data Engineer EMEA, MapR Technologies
Abstract: Apache Drill is a distributed system for interactive analysis of large-scale datasets, inspired by Google's Dremel technology. It is designed to scale to thousands of servers and to be able to process Petabytes of data. Since its inception in mid 2012, Apache Drill has gained widespread interest in the community. In the talk we focus on how Apache Drill enables interactive, ad-hoc query at scale: first we walk through use cases and then delve into Apache Drill's architecture and discuss its key features such as Full SQL support and extensibility points, incl. plug-able data sources.
Session 3: Hortonworks
Title: Ambari and Falcon
Speaker: Olivier Renault, EMEA Solution Engineer, Hortonworks
Abstract: Ambari is the Open Source management interface enabling you to provision, manage and monitor your Hadoop environment. It's a product, which has been initially designed based upon the experience of Yahoo team. During this talk, we will present the current solution as well as presenting the future direction for Ambari. Project Falcon is a data flow management solution available as part of the community, it has been developed / used by InMobi for more than a year into production before being brought to Hortonworks to Open Source.
Session 4: Cloudera
Title: Cloudera Impala
Speaker: Paolo Castagna, Cloudera Systems Engineer
Abstract: Delve into the Cloudera Impala architecture to discover just how it manages to bring the only true low latency SQL capability to Hadoop and see the results live!

May Hadoop Meetup: Smarter Big Data Integration for Hadoop