Skip to content

Meet Apache Spark: a faster and more flexible compute engine

Photo of Kip Sigman
Hosted By
Kip S. and Mauricio A.
Meet Apache Spark: a faster and more flexible compute engine

Details

Please join us as we take a first look at Spark, a fast and general engine for large-scale data processing that can run programs up to 100x faster than MapReduce in memory, or 10x faster on disk.

Spark has emerged as a new darling of the open-source world, with widespread take-up by data teams and developers, backed by a highly active community. It provides a unified framework to create sophisticated applications involving workloads that until now might have required several systems: Everything from ETL to machine learning, stream processing and interactive queries is possible.

Spark over HDFS is a common deployment option, but it can also run on different storage engines, so in Amazon it runs over S3, and some run it on top of Cassandra, in a Mesos cluster or standalone.

While Spark has Java and Python APIs too, Scala makes it a breeze to work with its distributed collections.

We'll have pizza and drinks at 6:15 and present at 7-8pm, with a Q&A afterwards.

The event will be hosted by ImpactRadius - http://www.impactradius.com and is being presented by Cloudera http://www.cloudera.com

More info:

https://spark.apache.org

http://www.zdnet.com/databricks-ceo-why-so-many-firms-are-fired-up-over-apache-spark-7000035773

Photo of Scala SB group
Scala SB
See more events
ImpactRadius
10 E Figueroa - 2nd floor · Santa Barbara, CA