Skip to content

From 0 to Dashboard in 20 mins with Cloudera Data Platform

Z
Hosted By
Zac K. and Sourabh
From 0 to Dashboard in 20 mins with Cloudera Data Platform

Details

DATE CHANGE:
The workshop moved to Thursday 23rd June at 4pm-7pm at the same location. We look forward to seeing you on the 23rd!

BACK IN PERSON!!

Join us for pizza and meet like minded data people!

This workshop is intended to showcase how to build a modern lakehouse architecture with Cloudera's open source technologies like Apache Nifi, Apache Spark, Apache Hive, Apache Iceberg and how to use them in your data architecture for large scale distributed data processing. The workshop will focus on real time data ingestion at scale, Spark SQL and introduce ACID transactions and time travel (data versioning) for ETL and streaming workloads. Slides, demos and QnA will help you understand the concepts required to build a modern multi analytic lakehouse architecture.

Motivation:
Whether you are new to the field of data analytics and data science, this workshop will help you understand tools required to build petabyte scale data pipelines easily and efficiently.

Who this is for?
Solution Architects
Data engineers
Data scientists

Agenda
1. Fundamentals of a modern hybrid multi function data platform

  • Overview of Cloudera data platform
    2. Building data ingestion pipelines at scale
  • Introduction to Apache Nifi
  • Serverless Nifi flows
  • Monitoring flows
    3. Fundamentals of Apache Spark
  • Introduction to Spark on Kubernetes
  • Monitoring Spark workloads
  • Workload orchestration with Apache Airflow
    4. Introduction to Data warehousing with Apache Hive
  • Introduction to Apache Iceberg as a table format
  • ACID transactions
  • Time travel
  • Data versioning
  • Dashboarding

Requirements
- Good to have some python and SQL knowledge
- Some knowledge of distributed data processing
- Basic cloud knowledge

COVID-19 safety measures

Event will be indoors
The event host is instituting the above safety measures for this event. Meetup is not responsible for ensuring, and will not independently verify, that these precautions are followed.
Photo of Future of Data: Melbourne group
Future of Data: Melbourne
See more events