Hands-on Introduction to Spark & Zeppelin


Details
Join us for an intro and overview of Apache Spark (http://spark.apache.org/), SparkSQL and Spark Streaming using Apache Zeppelin notebooks.
We will briefly cover Spark RDDs and then focus on Spark SQL DataFrames. We will use an interactive Zeppelin notebook to analyze and visualize an airline dataset with Spark SQL. Finally, we will write and run a simple Spark Streaming application.
We will use basic Scala syntax in our Spark SQL applications. If you would like to learn more about Scala here's an excellent resource: http://www.dhgarrette.com/nlpclass/scala/basics.html
To participate in the Hands-on-Labs, you will need to bring your own laptop with Hortonworks HDP Sandbox pre-loaded.
Please Note
To participate in the Hands-on Labs: Download the VM image (single-node): http://hortonworks.com/sandbox
For Windows - VMware Player (free download) is required to run a VMware image: https://my.vmware.com/web/vmware/free#desktop_end_user_computing/vmware_player/6_0
For Mac - VMware Fusion (free trial) https://www.vmware.com/products/fusion/

Sponsors
Hands-on Introduction to Spark & Zeppelin