Skip to content

How Apache Spark Fits Into The Big Data Landscape

How Apache Spark Fits Into The Big Data Landscape

Details

In conjunction with the Boulder Denver Big Data Meetup (https://www.meetup.com/Boulder-Denver-Big-Data/) we are happy to announce this great event with Paco Nathan!

Apache Spark is intended as a general purpose engine that supports combinations of Batch, Streaming, SQL, ML, Graph, etc., for apps written in Scala, Java, Python, Clojure, R, etc.

This talk provides an introduction to Spark — how it provides so much better performance, and why — and then explores how Spark fits into the Big Data landscape — e.g., other systems with which Spark pairs nicely — and why Spark is needed for the work ahead.

We'll review some of the new features in the 1.1 release, have a demo of notebooks in Databricks Cloud, and also discuss about the new Spark Developer Certificate program.

Paco Nathan, is a "player/coach" who's led innovative Data teams building large-scale apps for several years. Expertise in distributed systems, machine learning, cloud computing, functional programming. Paco is an O'Reilly author -- with a focus on Enterprise data workflows and math literacy among execs, plus a keen interest in Ag+Data -- Apache Spark open source evangelist with Databricks, and an advisor for Amplify Partners. He received his BS Math Sci and MS Comp Sci degrees from Stanford University, and has 30+ years technology industry experience ranging from Bell Labs to early-stage start-ups.

http://cdn.liber118.com/img/paco_oscon.jpg

Photo of Boulder/Denver Data + AI Meetup Group group
Boulder/Denver Data + AI Meetup Group
See more events
Datalogix
10075 Westmoor Drive · Westminster, CO