The speaker lineup for the next Spark Meetup is fixed, and this time we have only local speakers from Vienna!
First Christoph Körner ( http://www.meetup.com/Vienna-Kaggle/members/97133082/), will talk about Oozie and GraphX. After that, since nobody else replied to the Call for Talks, I decided to give a talk myself about something I've been recently working with: various libraries for DataFrames. These are the abstracts for the talks:
Using Oozie to schedule GraphX Jobs in Spark (Christoph Körner)
After a quick intro to Apache Oozie (a workflow scheduler for Hadoop) and GraphX in Spark, I will show how it can be used to automatically schedule Graph computations using GraphX in an Hadoop environment.
Data Science with Python - a comparison of DataFrames in Pandas, Spark and GraphLab Create (Bogdan Pirvu)
After a brief introduction to Spark and GraphLab Create I'll explain the DataFrame concept. Then I'll showcase the common features and differences between DataFrames in Spark, Pandas and GraphLab Create during a life demo. I'll finish with the pros and cons that I have experienced and conclude with my current personal preference.
19:00 - Start with some drinks and chatting
19:15 - "Using Oozie to schedule GraphX Jobs in Spark" (Christoph Körner)
20:00 - 5 min break
20:05 - "Data Science with Python - a comparison of DataFrames in Pandas, Spark and GraphLab Create" (Bogdan Pirvu)
20:50 - Food, more drinks & chatting.
Looking forward to see you at the Novomatic Forum!
This event is sponsored by