Skip to content

Technical workshop on Apache Spark and Apache Drill

Photo of Elizabeth Land
Hosted By
Elizabeth L.
Technical workshop on Apache Spark and Apache Drill

Details

Learn the basics of 2 key Big Data Technologies: Apache Spark and Apache Drill. Covered in the training are:

The essentials of Apache Drill, including writing SQL queries on a range of data types, using Drill Explorer on semi-structure data, and how Drill interacts with data and does schema discovery

The essentials of Apache Spark, an overview on loading and inspecting data in Spark, and a demo lab of RDDs

This event will be in a classroom set up for up to 60 people. Attendees can follow along but this is not required. All materials of the courses will be available on https://community.mapr.com/docs/DOC-1540 for attendees to follow up as well as additional content they can visit later to continue their training.

9:00-9:15 Arrival & Registration

9:15-9:30 Intro and Housekeeping

9:30-11:45 Drill Session

11:45-12:00 HR on MapR Culture + Hiring Opportunities

12:00-1:00 Meet & Greet - Lunch provided by MapR

1:00-3:30 Spark Session

3:30-4:00 Closure

Detailed Agenda

9:00-9:15 Arrival & Registration

9:15-9:30 Intro and Housekeeping

9:30-11:45 Drill Session

SQL on Hadoop landscape & Where does Drill fit in

Introduction to Apache Drill

How Drill achieves flexibility & performance - Architecture overview

Interactive demo

Using Drill to query Files, Hive tables & HBase/MapR-DB

ANSI SQL functionality (including queries on JSON, Parquet)

Working with Nested data using Drill

Using Tableau (BI tools) with Drill

11:45-12:00 HR on MapR Culture + Hiring Opportunities

12:00-1:00 Meet & Greet - Lunch provided by MapR

1:00-3:30 Spark Session ( James Casaletto)

Introduction to Apache Spark

Describe the features of Apache Spark

Advantages of Spark

How Spark fits in with the Big Data application stack

How Spark fits in with Hadoop

Define Apache Spark components

Load and Inspect Data in Apache Spark

Describe different ways of getting data into Spark

Create and use Resilient Distributed Datasets (RDDs)

Apply transformation to RDDs

Use actions on RDDs

Interactive demo

Write a simple Spark application in Java

Build the Spark application

Deploy the Spark application on the MapR sandbox

MATERIALS

Please visit the materials suggested prior the training available here: https://community.mapr.com/docs/DOC-1540

Photo of Women in Big Data Meetup group
Women in Big Data Meetup
See more events
MapR
350 Holger Way · San Jose, CA