We're new here!Join us!
This group is for people interested in Apache Spark and large scale machine learning. Apache Spark is a powerful open source processing engine for Hadoop data built around speed, ease of use, and sophisticated analytics. MLlib is Apache Spark's scalable machine learning library. Tools like scikit-learn, R, etc doesn't scale, even though they provide very rich statistical analysis. In this group we will talk about Spark, machine learning algorithms and how to run them on scale with Spark and MLlib. We will also organize hands-on session to help folks get started with Apache Spark.