Skip to content

Details

To benefit from the gathering, please spend two hours beforehand to prepare:

  1. Download spark:

http://spark.apache.org/downloads.html

Choose a Spark release: 1.5.2 (Nov 09 2015)

Choose a package type: Pre-built for Hadoop 2.4 and later

Choose a download type: Direct Download

Then click on spark-1.5.2-bin-hadoop2.4.tgz

2. Download Java SE Development Kit 8:

http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html

Under ' Java SE Development Kit 8u65', click 'accept license agreement' then click on an image corresponding to your platform. For example, for a Mac, you'll click 'jdk-8u65-macosx-x64.dmg (http://download.oracle.com/otn-pub/java/jdk/8u65-b17/jdk-8u65-macosx-x64.dmg)'

If you need more info on installing Java, you can refer to Chapter 1 of Java: A Beginner's Tutorial (https://www.safaribooksonline.com/library/view/java-a-beginners/9780992133047/Text/ch01.xhtml).

3. Download Scala:

http://www.scala-lang.org/download/

click on Scala 2.11.7

4. Download the dataset by clicking "Loan Data from Prosper" at this link:

https://docs.google.com/document/d/1qEcwltBMlRYZT-l699-71TzInWfk4W9q5rTCSvDVMpc/pub

5. Read chapter 2 of Advanced Analytics with Spark:

https://www.safaribooksonline.com/library/view/advanced-analytics-with/9781491912751/ch02.html#DataCleansingAggregate

WiFi at Hacker Dojo

  1. Select "HD-Day Pass" as network
  2. Click "Free Trial (120 minutes, 17.8 Mbps)" under "Available price plans"
  3. Click "Continue"

Related topics

You may also like