Skip to content

Spark Meetup in June: GraphX and DataFrames

Photo of Bogdan Pirvu
Hosted By
Bogdan P.
Spark Meetup in June: GraphX and DataFrames

Details

The speaker lineup for the next Spark Meetup is fixed, and this time we have only local speakers from Vienna!

First Christoph Körner ( https://www.meetup.com/Vienna-Kaggle/members/97133082/), will talk about Oozie and GraphX. After that, since nobody else replied to the Call for Talks, I decided to give a talk myself about something I've been recently working with: various libraries for DataFrames. These are the abstracts for the talks:

Using Oozie to schedule GraphX Jobs in Spark (Christoph Körner)

After a quick intro to Apache Oozie (a workflow scheduler for Hadoop) and GraphX in Spark, I will show how it can be used to automatically schedule Graph computations using GraphX in an Hadoop environment.

Data Science with Python - a comparison of DataFrames in Pandas, Spark and GraphLab Create (Bogdan Pirvu)

After a brief introduction to Spark and GraphLab Create I'll explain the DataFrame concept. Then I'll showcase the common features and differences between DataFrames in Spark, Pandas and GraphLab Create during a life demo. I'll finish with the pros and cons that I have experienced and conclude with my current personal preference.

Schedule:

19:00 - Start with some drinks and chatting

19:15 - "Using Oozie to schedule GraphX Jobs in Spark" (Christoph Körner)

20:00 - 5 min break

20:05 - "Data Science with Python - a comparison of DataFrames in Pandas, Spark and GraphLab Create" (Bogdan Pirvu)

20:50 - Food, more drinks & chatting.

Looking forward to see you at the Novomatic Forum!

----------------------------------------

This event is sponsored by

http://photos3.meetupstatic.com/photos/event/7/c/e/e/600_450151982.jpeg

Photo of Vienna AI Engineering group
Vienna AI Engineering
See more events
Novomatic Forum
Friedrichstraße 7, 1010 · Vienna