Data Mining with Apache Spark (UHUG meeting)


Details
Data Mining with Apache Spark:
Big Data brings Big Challenges and opportunities. How to make it easier for data scientists and analysts to do their job is a big deal these days. This talk covers why we are using Apache Spark and how to use Spark to do data mining potentially machine learning. Spark overview will be covered just in case. A couple small code examples in Spark SQL, MLlib and machine learning will be given. A few small demos at the end if we have time.
BIO:
Yugang Hu is a principal scientist at Overstock.com (https://www.linkedin.com/redir/redirect?url=Overstock%2Ecom&urlhash=iuqK). He got his master degree in Computer Science from Peking University in China in 1996. He created ecommerce search engine on top of Solr/Lucene in 2011, and built recommendation engine using Mahout, Pig and Java in 2012 both for Overstock. He has been doing research in new Big Data technologies such as data mining, and machine learning for the last couple years.

Data Mining with Apache Spark (UHUG meeting)