Spark DataFrames for Large-Scale Data Science With Spark Machine Learning


Details
join us to go through the recently announced DataFrames on Spark, I was lucky enough to be in San Jose in a databricks meetup so I will try to go through what the guys have done.
Also Mark Moloney will be showing us an intro of R on Spark that with some hands on for Spark beginners.
Further more I am so glad to have with us Dr. Zhen He who will be going through his expeirience with Machine learning on Spark as they have completed several projects.
more information about Zhen below:
In the past year my research team have worked on various machine learning projects using Apache Spark. We have worked on both industry projects and research projects. We have some real success stories which really highlight the power and easy of use of Apache Spark for machine learning. At the same time we have also found some real limitations of Apache Spark for large scale machine learning. In this talk I will try to identify which types of large scale machine learning problems are particularly suitable for Apache Spark and which are not. I will also dispel some common myths about the Apache Spark framework and the Scala programming language.

Spark DataFrames for Large-Scale Data Science With Spark Machine Learning