Machine Learning with Big Data using Spark


Details
We have a very exciting presentation by Aaron Richter. Aaron began as a programmer before moving into data pipelining and analytics where he found a love for data science. He is currently working on a Computer Science PhD focusing on Data Mining & Machine Learning as well as being a practicing data scientist for Modernizing Medicine (http://modmed.com (http://modmed.com/)), an innovative EHR system for several surgical specialties.
In this meetup Aaron will give us a quick background on the Hadoop ecosystem, and how Hadoop MapReduce and Apache Spark work. Then we will build a tweet sentiment classifier using Logistic Regression in Spark and evaluate new instances by streaming live tweets from Twitter.

Sponsors
Machine Learning with Big Data using Spark