Intro to Data Analysis with Scala and Spark


Details
To benefit from the gathering, please spend two hours beforehand to prepare:
- Download spark:
http://spark.apache.org/downloads.html
Choose a Spark release: 1.5.2 (Nov 09 2015)
Choose a package type: Pre-built for Hadoop 2.4 and later
Choose a download type: Direct Download
Then click on spark-1.5.2-bin-hadoop2.4.tgz
- Download Java SE Development Kit 8:
http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html
Under ' Java SE Development Kit 8u65', click 'accept license agreement' then click on an image corresponding to your platform. For example, for a Mac, you'll click 'jdk-8u65-macosx-x64.dmg (http://download.oracle.com/otn-pub/java/jdk/8u65-b17/jdk-8u65-macosx-x64.dmg)'
If you need more info on installing Java, you can refer to Chapter 1 of Java: A Beginner's Tutorial (https://www.safaribooksonline.com/library/view/java-a-beginners/9780992133047/Text/ch01.xhtml).
- Download Scala:
http://www.scala-lang.org/download/
click on Scala 2.11.7
- Download the dataset by clicking "Loan Data from Prosper" at this link:
https://docs.google.com/document/d/1qEcwltBMlRYZT-l699-71TzInWfk4W9q5rTCSvDVMpc/pub
- Read chapter 2 of Advanced Analytics with Spark:
WiFi at Hacker Dojo
-
Select "HD-Day Pass" as network
-
Click "Free Trial (120 minutes, 17.8 Mbps)" under "Available price plans"
-
Click "Continue"

Intro to Data Analysis with Scala and Spark