Intro to Apache Spark for Java and Scala Developers


Details
6:00-7:00: Socializing (food, beverages - Thanks New Relic! (http://newrelic.com/))
7:00-7:10: Announcements
7:10-8:30: Intro to Apache Spark for Java and Scala Developers
8:30-8:45 Q & A
Intro to Apache Spark for Java and Scala Developers
With the rapid adoption of Apache Spark—one of the most active Apache projects today—and the need for programs to span many machines to solve the world’s greatest problems, distributed computing has resurfaced as a hot commodity that can take your career to the next level and—more important—that can open the door to some really cool and impactful apps. The goal of this presentation is to introduce Java and Scala developers to basic Spark concepts such as DAGs, RDDs, transformations, actions, and executors. Attendees will also learn how their mindset must evolve beyond Java or Scala code that runs in a single JVM. We will also do a deeper dive into three sub-projects of Apache Spark: SparkSql, Spark Streaming and MLlib.
About the Speaker
Anand Iyer is a senior product manager at Cloudera, the leading vendor of open source Apache Hadoop. His primary areas of focus are platforms for real-time streaming, Apache Spark, and tools for data ingestion into the Hadoop platform. Before joining Cloudera, he worked as an engineer at LinkedIn, where he applied machine learning techniques to improve the relevance and personalization of LinkedIn’s Feed. Anand has extensive experience in leveraging big data platforms to deliver products that delight customers. He has a master’s in computer science from Stanford and a bachelor’s from the University of Arizona.

Intro to Apache Spark for Java and Scala Developers