Data Governance in Hadoop Environments


Details
Agenda:
6:30 PM: Welcome and Sandwiches
7:00 PM: Effective Data Wrangling with Trifacta
Trifacta is a data preparation application that enables users to transform complex data into structured formats for analysis.
With this tool users are able to interactively explore the content of their data and trough a process called predictive transformation, define a recipe for how the dataset should be transformed. This logic is used to define how the data is processed either on your desktop, server, cloud environment or Hadoop
Speaker: Bert Oosterhof, EMEA Field CTO at Trifacta
7:45 PM: Cloudera Navigator: Data Governance solution for Hadoop
Cloudera Navigator is a complete data governance solution for Hadoop, offering critical capabilities such as data discovery, continuous optimization, audit, lineage, metadata management, and policy enforcement. As part of Cloudera Enterprise, Cloudera Navigator is critical to enabling high-performance agile analytics, supporting continuous data architecture optimization, and meeting regulatory compliance requirements.
Speaker: Emre Sevinç: Big Data Architect, Big Industries
8:30 PM: Cloudera Optimizer Demo
Cloudera Navigator Optimizer helps optimize inefficient query workloads for best results on Apache Hadoop.
This tool profiles and analyzes the SQL text in large, complex SQL workloads so users can gain an in-depth understanding of their workloads, identify queries best-suited for Hadoop and modify them as needed for optimal efficiency on Hadoop—all via an easy-to-use web UI.
Speaker: Wim Villano, Sales Engineer at Cloudera
9:00 PM: End of the session

Data Governance in Hadoop Environments