Skip to content

Details

This talk briefly covers big data concepts, distributed data processing frameworks, and dives into Spark's Architecture and High-Level APIs for processing data with demos of using spark-shell and pyspark for developing Spark applications locally and setting up a Spark cluster in AWS using EMR (Elastic MapReduce) to process data in S3

Members are also interested in