Skip to content

Scalable Big Graph Data Processing in Spark (Hands-on)

Photo of Rindra
Hosted By
Rindra
Scalable Big Graph Data Processing in Spark (Hands-on)

Details

(https://spark.apache.org/graphx/)Come to learn how to harness Spark's GraphX library in processing large-scale graph data such as social, biological or web networks.

GraphX (https://spark.apache.org/graphx/) is Spark's rich API for graphs and graph-parallel computation. It enables ETL, exploratory analysis, and iterative graph computation within a Spark application.

In this meetup, you will see GraphX in action through some illustrative code and examples. We will walk through the process of building, exploring and analyzing different networks and graphs. When working with messy datasets, you will learn also how to transform raw datasets into a usable form. In addition, we will cover the powerful and generic graph operations in Spark, which can be used to transform graphs or to implement graph-parallel iterative algorithms.

Schedule

6:00-6:30 Food & Networking

6:30-7:30 Presentation

7:30-8:00 Networkng

Aknowledgements

Big thanks for Simba Technologies and George Chow for hosting the meetup.

About Spark

Apache Spark is the next standard of open-source cluster-computing engine for processing big data.

Photo of Vancouver Apache Spark Meetup group
Vancouver Apache Spark Meetup
See more events
Simba Technologies
938 West 8th Ave · Vancouver, BC