Just Enough Scala for Spark

SF Spark and Friends
SF Spark and Friends
Public group
Location image of event venue


This free all-day tutorial is a part of the Cognitive Frameworks Festival (http://festival.framework.foundation) running 6/6-9 in partnership with IBM, Galvanize, and Framework Foundation.

This hands-on tutorial will be presented by Chaoran Yu from our partners at Lightbend, the company behind Scala, in which Spark is written. You can basically think of Spark as a Scala DSL, owing its success to the Scala Collections API.

Students only need to bring their laptops.

Before coming to the class, please clone the following Github repo:


and follow the instructions there to download JDK 8 and Spark Notebook. If your laptop runs Linux or Mac, you don't need Docker. For Windows users, Docker is required. During the class, we won't spend too much time to address environment setup issues, but if for any reason you run into trouble getting the notebook to run, bring your questions to get help!

This tutorial will be the same as the one Chaoran taught at the Spark Summit. The format will be a mix of lecture and hands-on sessions. Topics to be covered are the following:

• Declaring and using classes, methods, and functions

• Immutable values vs. mutable variables

• Type inference: type safety without boilerplate

• Pattern matching

• Scala collections and the common operations on them (the basics of the RDD API)

• Other Scala types like case classes and tuples

• Effective use of the Spark REPL, based on the Scala interpreter

• Common mistakes and how to avoid them, like serialization errors.

• The tutorial is free but our parters support it and space is limited. Please only RSVP if you will come and learn from 9am to 5pm on Thursday. No-shows will be disqualified from the future events.