Data Processing in Hadoop - Analytics & Data Pipelines in Practice


Details
Target audience-
Big Data engineers, Architects, BI / Data Analysts, Data Scientists, Developers, Product Managers
Abstract
Hadoop is the de facto standard in Big Data platforms, providing cost-efficient, raw storage and processing power.
Businesses have embraced Hadoop for its low-cost storage, and ability to scale. However, many of these businesses have difficulty extracting actionable insight out of data collected in Hadoop systems due to different data formats that need different computation tools, lack of unified environment for product deployment, Business analysts struggle with highly technical big data tools.
While the stack side is well defined and there are solutions addressing the machine learning side of data analytics (which are often an interactive and experimental), what is lacking is the more automated part in the middle, which is the data processing side and pipeline automation
This meetup is focusing on the next generation of data processing and automation of data pipelines, helping to understand why the adoption of Hadoop is hitting the "Trough of Disillusionment" (as per the Gartner hype cycle) and how to proceed to the "Plateau of Productivity" by shifting the focus from the platform to the vertical solutions of the business.
Agenda
17:30 - 18:00 Gathering (Food and Beverage)
18:00 - 19:00 Data Processing in Hadoop - Analytics and Data Pipelines in Practice – Mr. Lars George
19:00 – 19:45 In-Memory Data Processing of Big Data – Mr. Shmulik Sitton
19:45 - 20:00 Demo - SAP Vora and SPARK
20:00 – 20:30 Q&A
Session 1
Data Processing in Hadoop - Analytics and Data Pipelines in Practice
Presenter
Mr. Lars George, Co-Founder, OpenCore, Germany, www.opencore.com (http://www.opencore.com/)
A leading respected member of the global open source community,
The book author of (HBase - The Definitive Guide),
In his previous role he served as EMEA Chief Architect at CLOUDERA
An Hadoop and Big Data senior architect, open-source developer, public speaker, trainer, teacher, mentor, and leader.
Session 2
In-Memory Data Processing of Big Data and DEMO
Presenter
Mr. Shmulik Sitton, Senior Architect at sap

Data Processing in Hadoop - Analytics & Data Pipelines in Practice