Scalable Big Graph Data Processing in Spark (Hands-on)


Details
(https://spark.apache.org/graphx/)Come to learn how to harness Spark's GraphX library in processing large-scale graph data such as social, biological or web networks.
GraphX (https://spark.apache.org/graphx/) is Spark's rich API for graphs and graph-parallel computation. It enables ETL, exploratory analysis, and iterative graph computation within a Spark application.
In this meetup, you will see GraphX in action through some illustrative code and examples. We will walk through the process of building, exploring and analyzing different networks and graphs. When working with messy datasets, you will learn also how to transform raw datasets into a usable form. In addition, we will cover the powerful and generic graph operations in Spark, which can be used to transform graphs or to implement graph-parallel iterative algorithms.
Schedule
6:00-6:30 Food & Networking
6:30-7:30 Presentation
7:30-8:00 Networkng
Aknowledgements
Big thanks for Simba Technologies and George Chow for hosting the meetup.
About Spark
Apache Spark is the next standard of open-source cluster-computing engine for processing big data.

Scalable Big Graph Data Processing in Spark (Hands-on)