How Apache Spark Fits Into The Big Data Landscape

Name: How Apache Spark Fits Into The Big Data Landscape
Start: 2014-10-02T18:00:00-06:00
End: 2014-10-02T21:00:00-06:00
Location: Datalogix

Hosted by Boulder/Denver Data + AI Meetup Group

Boulder/Denver Data + AI Meetup Group

Details

In conjunction with the Boulder Denver Big Data Meetup (https://www.meetup.com/Boulder-Denver-Big-Data/) we are happy to announce this great event with Paco Nathan!

Apache Spark is intended as a general purpose engine that supports combinations of Batch, Streaming, SQL, ML, Graph, etc., for apps written in Scala, Java, Python, Clojure, R, etc.

This talk provides an introduction to Spark — how it provides so much better performance, and why — and then explores how Spark fits into the Big Data landscape — e.g., other systems with which Spark pairs nicely — and why Spark is needed for the work ahead.

We'll review some of the new features in the 1.1 release, have a demo of notebooks in Databricks Cloud, and also discuss about the new Spark Developer Certificate program.

Paco Nathan, is a "player/coach" who's led innovative Data teams building large-scale apps for several years. Expertise in distributed systems, machine learning, cloud computing, functional programming. Paco is an O'Reilly author -- with a focus on Enterprise data workflows and math literacy among execs, plus a keen interest in Ag+Data -- Apache Spark open source evangelist with Databricks, and an advisor for Amplify Partners. He received his BS Math Sci and MS Comp Sci degrees from Stanford University, and has 30+ years technology industry experience ranging from Bell Labs to early-stage start-ups.

http://cdn.liber118.com/img/paco_oscon.jpg

Boulder/Denver Data + AI Meetup Group

How Apache Spark Fits Into The Big Data Landscape

Boulder/Denver Data + AI Meetup Group

Details

Related topics

You may also like