Skip to content

Unlocking your Hadoop data with Apache Spark and CDH5

Photo of Denny Lee
Hosted By
Denny L. and 4 others
Unlocking your Hadoop data with Apache Spark and CDH5

Details

This is an Introductory session showcasing real world implementations of working with Spark within the context of your Big Data Infrastructure. The session will be demo heavy and slide light focusing on getting your development environments up and running including getting up and running, configuration issues, SparkSQL vs. Hive, etc.

We also have swag from Cloudera and DataBricks!

Agenda

6:00-6:30pm: Come on up to Facebook Seattle HQ

6:30-7:00pm: Configuring and Deploying Spark on YARN with Cloudera Manager (Special Guest: Kostas from Cloudera)

7:00pm-7:45pm: Introductory Spark scenarios at Concur

• Quick primer on our expense receipt scenario

• Connecting to Spark on your CDH5.1 cluster

• Quick demos

– Pig vs. Hive

– SparkSQL

• Tableau connecting to SparkSQL (Special Guest: Jeff from Tableau)

• Deep Dive demo

– MLLib: SVD

Photo of Seattle Spark+AI Meetup group
Seattle Spark+AI Meetup
See more events
1730 Minor Ave., 14th Floor, Seattle, WA · Seattle, WA