Skip to content

Introduction to Apache Spark with Hands-on session - In Memory Map-Reduce

Introduction to Apache Spark with Hands-on session - In Memory Map-Reduce

Details

Apache Spark has made a buzz in the industry with its superior performance (100 times faster than Hadoop Map-Reduce).

In this meetup, we are planning to cover a basic overview/need, use-cases, why important for data-scientist/machine learning developers (iterative algorithms) with examples.

Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala and Python, and an optimized engine that supports general execution graphs. It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming.

Agenda:

  1. Brief Introduction
  2. Installation
  3. Brief Introduction about MLib, GraphX, and Spark Streaming
  4. Hands-on sessions - Installation, Few examples, and a small case study.

-----------------------------------------------------------------

Please read the pre-requisites for hands-on here on the below link.

https://www.meetup.com/Hyderabad-Programming-Geeks-Group/messages/boards/thread/47020022

Contact No: Rahul - +91-9908599937

Photo of BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics group
BigData-Blockchain-AI-ML-XR-3Dprinting-Automation-Robotics
See more events
IIIT-Hyderabad campus, Gachibowli, · Hyderabad