Claudio Martella presents "Massive-scale graph partitioning with Giraph"
Claudio Martella (Google Sciengineer, https://twitter.com/claudiomartella) presents the talk:
"Massive-scale graph partitioning with Giraph"
Based on his paper "Spinner: Scalable Graph Partitioning in the Cloud" (by Claudio Martella, Dionysios Logothetis, Andreas Loukas, Georgos Siganos) downloadable from https://arxiv.org/abs/1404.3861
We present Spinner, a scalable and adaptive graph partitioning algorithm based on label propagation designed on top of the Pregel model. Spinner scales to massive graphs, produces partitions with locality and balance comparable to the state-of-the-art and efficiently adapts the partitioning upon changes. We describe our algorithm and its implementation in the Pregel programming model that makes it possible to partition billion-vertex graphs. We evaluate Spinner with a variety of synthetic and real graphs and show that it can compute partitions with quality comparable to the state-of-the art. In fact, by using Spinner in conjunction with the Giraph graph processing engine, we speed up different applications by a factor of 2 relative to standard hash partitioning.
The paper was presented in 2017 at the 33rd IEEE International Conference on Data Engineering (ICDE)
Claudio Martella is a Sciengineer, currently working at Google. He is interested in understanding and modeling complex behaviour with data, by means of sensors, machine learning, and large-scale distributed systems. He holds a PhD in Computer Science from Vrije Universiteit Amsterdam. He is also a member of the Hadoop project Apache Giraph.
Wellcome Trust, Gibbs Building
215 Euston Road
London NW1 2BE
The nearest tube station is Euston/Euston Square. The schedule:
• 6.30pm: pizza and drinks
• 7:00pm: presentation starts
• 8:00pm: presentation end
Meetup are captured on video and uploaded on the PWL London Youtube channel https://www.youtube.com/channel/UCEYe-1uDIkjPtuH_qhoybnA