Apache Spark Hands-on Workshop


Details
Hands-on introduction to Spark
Over the last three years Apache Spark has become one of the key frameworks for big data processing. It is designed for performance, scalability and ease-of-use. The release of Spark 2.0 in 2016 marked a major milestone, because DataFrames, a kind of relational tables, became the main API not only as basis for Spark SQL, but also for Machine Learning and Streaming. During the Meetup we will give a hands-on introduction to Spark's DataFrame API, Spark SQL and Spark Machine Learning.
The workshop will make use of Databrick's community cloud. You will need to register here (https://databricks.com/try-databricks) for a free community login.
The workshop will be held in English, if we have English-speaking guests.
Speakers
Jens Albrecht is a Professor at the Technische Hochschule Nürnberg. He is also working as a professional trainer and consultant for Big Data Technologies.
Marc Fiedler is a Data Architect at GfK SE. He has a strong background in Data Management and Business Intelligence.
Jens and Mark published together two articles about Apache Spark in the computer magazine iX at the beginning of 2017, which build the basis for this workshop.

Apache Spark Hands-on Workshop