Skip to content

Hands-on Introduction to Spark & Zeppelin

Photo of Future of Data
Hosted By
Future of D.
Hands-on Introduction to Spark & Zeppelin

Details

Join us for an intro and overview of Apache Spark (http://spark.apache.org/), SparkSQL and Spark Streaming using Apache Zeppelin notebooks.

We will briefly cover Spark RDDs and then focus on Spark SQL DataFrames. We will use an interactive Zeppelin notebook to analyze and visualize an airline dataset with Spark SQL. Finally, we will write and run a simple Spark Streaming application.

We will use basic Scala syntax in our Spark SQL applications. If you would like to learn more about Scala here's an excellent resource: http://www.dhgarrette.com/nlpclass/scala/basics.html

To participate in the Hands-on-Labs, you will need to bring your own laptop with Hortonworks HDP Sandbox pre-loaded.

Please Note
To participate in the Hands-on Labs: Download the VM image (single-node): http://hortonworks.com/sandbox

For Windows - VMware Player (free download) is required to run a VMware image: https://my.vmware.com/web/vmware/free#desktop_end_user_computing/vmware_player/6_0

For Mac - VMware Fusion (free trial) https://www.vmware.com/products/fusion/

Photo of Future of Data: Dublin group
Future of Data: Dublin
See more events