Introduction to Apache Spark 2.x and running a Spark cluster in the cloud

Are you going?

35 people going



This talk briefly covers big data concepts, distributed data processing frameworks, and dives into Spark's Architecture and High-Level APIs for processing data with demos of using spark-shell and pyspark for developing Spark applications locally and setting up a Spark cluster in AWS using EMR (Elastic MapReduce) to process data in S3