Technical workshop on Apache Spark and Apache Drill


Details
Learn the basics of 2 key Big Data Technologies: Apache Spark and Apache Drill. Covered in the training are:
The essentials of Apache Drill, including writing SQL queries on a range of data types, using Drill Explorer on semi-structure data, and how Drill interacts with data and does schema discovery
The essentials of Apache Spark, an overview on loading and inspecting data in Spark, and a demo lab of RDDs
This event will be in a classroom set up for up to 60 people. Attendees can follow along but this is not required. All materials of the courses will be available on https://community.mapr.com/docs/DOC-1540 for attendees to follow up as well as additional content they can visit later to continue their training.
9:00-9:15 Arrival & Registration
9:15-9:30 Intro and Housekeeping
9:30-11:45 Drill Session
11:45-12:00 HR on MapR Culture + Hiring Opportunities
12:00-1:00 Meet & Greet - Lunch provided by MapR
1:00-3:30 Spark Session
3:30-4:00 Closure
Detailed Agenda
9:00-9:15 Arrival & Registration
9:15-9:30 Intro and Housekeeping
9:30-11:45 Drill Session
SQL on Hadoop landscape & Where does Drill fit in
Introduction to Apache Drill
How Drill achieves flexibility & performance - Architecture overview
Interactive demo
Using Drill to query Files, Hive tables & HBase/MapR-DB
ANSI SQL functionality (including queries on JSON, Parquet)
Working with Nested data using Drill
Using Tableau (BI tools) with Drill
11:45-12:00 HR on MapR Culture + Hiring Opportunities
12:00-1:00 Meet & Greet - Lunch provided by MapR
1:00-3:30 Spark Session ( James Casaletto)
Introduction to Apache Spark
Describe the features of Apache Spark
Advantages of Spark
How Spark fits in with the Big Data application stack
How Spark fits in with Hadoop
Define Apache Spark components
Load and Inspect Data in Apache Spark
Describe different ways of getting data into Spark
Create and use Resilient Distributed Datasets (RDDs)
Apply transformation to RDDs
Use actions on RDDs
Interactive demo
Write a simple Spark application in Java
Build the Spark application
Deploy the Spark application on the MapR sandbox
MATERIALS
Please visit the materials suggested prior the training available here: https://community.mapr.com/docs/DOC-1540

Technical workshop on Apache Spark and Apache Drill