Skip to content

Intro to Data Analysis with Scala and Spark

Photo of Jenn
Hosted By
Jenn
Intro to Data Analysis with Scala and Spark

Details

To benefit from the gathering, please spend two hours beforehand to prepare:

  1. Download spark:

http://spark.apache.org/downloads.html

Choose a Spark release: 1.5.2 (Nov 09 2015)

Choose a package type: Pre-built for Hadoop 2.4 and later

Choose a download type: Direct Download

Then click on spark-1.5.2-bin-hadoop2.4.tgz

  1. Download Java SE Development Kit 8:

http://www.oracle.com/technetwork/java/javase/downloads/jdk8-downloads-2133151.html

Under ' Java SE Development Kit 8u65', click 'accept license agreement' then click on an image corresponding to your platform. For example, for a Mac, you'll click 'jdk-8u65-macosx-x64.dmg (http://download.oracle.com/otn-pub/java/jdk/8u65-b17/jdk-8u65-macosx-x64.dmg)'

If you need more info on installing Java, you can refer to Chapter 1 of Java: A Beginner's Tutorial (https://www.safaribooksonline.com/library/view/java-a-beginners/9780992133047/Text/ch01.xhtml).

  1. Download Scala:

http://www.scala-lang.org/download/

click on Scala 2.11.7

  1. Download the dataset by clicking "Loan Data from Prosper" at this link:

https://docs.google.com/document/d/1qEcwltBMlRYZT-l699-71TzInWfk4W9q5rTCSvDVMpc/pub

  1. Read chapter 2 of Advanced Analytics with Spark:

https://www.safaribooksonline.com/library/view/advanced-analytics-with/9781491912751/ch02.html#DataCleansingAggregate

WiFi at Hacker Dojo

  1. Select "HD-Day Pass" as network

  2. Click "Free Trial (120 minutes, 17.8 Mbps)" under "Available price plans"

  3. Click "Continue"

Photo of Hands-on Machine Learning group
Hands-on Machine Learning
See more events
Large Event Room at Hacker Dojo
599 Fairchild Dr · Mountain View, CA