Skip to content

Details

Agenda

a) Hands-on - Build a Data analytics application using SPARK, Hortonworks, and Zeppelin. The session explains RDD concepts, DataFrames, sqlContext, use SparkSQL for working with DataFrames and explore graphical abilities of Zeppelin.

b) Follow along - Build a Recommendation Engine - This will show how to build a predictive analytics (MLlib) recommendation engine with scoring This will give a better understanding of architecture and coding in Spark for ML.

Hands-on session (pre-reqs) - Please download and come to the session

  1. Laptop with at least 8G or RAM

  2. Download and install Hortonworks Sandbox on VM http://hortonworks.com/products/hortonworks-sandbox/#install

  3. Please follow pre-requisite steps to download datasets needed for the labs otherwise you will only be able to watch and not participate. The instructions are in the following github link

https://github.com/zeltovhorton/njmeetup_spark

You will need to do both section HDP Setup and run the commands as zeppelin user to download the datasets and load them into hdfs.

Speaker: Alex Zeltov (Sr. Solutions Engineer, Hortonworks)

Members are also interested in