Past Meetup

Unlocking your Hadoop data with Apache Spark and CDH5

This Meetup is past

100 people went

Details

This is an Introductory session showcasing real world implementations of working with Spark within the context of your Big Data Infrastructure. The session will be demo heavy and slide light focusing on getting your development environments up and running including getting up and running, configuration issues, SparkSQL vs. Hive, etc.

We also have swag from Cloudera and DataBricks!

Agenda

6:00-6:30pm: Come on up to Facebook Seattle HQ

6:30-7:00pm: Configuring and Deploying Spark on YARN with Cloudera Manager (Special Guest: Kostas from Cloudera)

7:00pm-7:45pm: Introductory Spark scenarios at Concur

• Quick primer on our expense receipt scenario

• Connecting to Spark on your CDH5.1 cluster

• Quick demos

– Pig vs. Hive

– SparkSQL

• Tableau connecting to SparkSQL (Special Guest: Jeff from Tableau)

• Deep Dive demo

– MLLib: SVD