Building Real-world Machine Learning Apps with PredictionIO and Spark MLlib


Details
PredictionIO is an open source machine learning server, and its latest version is built on Apache Spark and MLlib. The project ranks top on Github with over 5000 engaging developers. PredictionIO is designed for data scientists and developers to build predictive engines for real-world applications in a fraction of the time normally required. In this talk, Simon will introduce the latest developments of PredictionIO, and show how to use it to build and deploy predictive engines in real production environments. Using PredictionIO’s DASE design pattern, Simon will illustrate how developers can develop machine learning applications with the separation of concerns (SoC) in mind. “D" stands for Data Source and the Data Preparator, which take care of the preparation of data for model training. “A" stands for Algorithm, which is where the code of one or more algorithms are implemented. MLlib, the machine learning library of Apache Spark, is natively supported here. “S” stands for Serving, which handles the application logic during the retrieval of predicted results. Finally, “E” stands for Evaluation. Simon will also cover upcoming development work, including new Engine Templates for various business scenarios.
Bio:
Simon Chan is a co-founder of PredictionIO, with years of experience in the tech industry in London, Hong Kong, Mainland China and Silicon Valley. His doctoral research work at University College London was on machine learning techniques for large-scale user preference prediction in noisy non-experimental environments. 6:30 pm - 7 pm ( social and networking)
7 pm - 7: 05 pm ( introduction and announcement)
7:05 pm - 8 :15 pm ( speaker talks )
8:15 - 8:30 pm Q&A
8:30 pm (meeetup ends)
9: 00 pm ( office close)

Building Real-world Machine Learning Apps with PredictionIO and Spark MLlib