Meet Apache Spark: a faster and more flexible compute engine


Details
Please join us as we take a first look at Spark, a fast and general engine for large-scale data processing that can run programs up to 100x faster than MapReduce in memory, or 10x faster on disk.
Spark has emerged as a new darling of the open-source world, with widespread take-up by data teams and developers, backed by a highly active community. It provides a unified framework to create sophisticated applications involving workloads that until now might have required several systems: Everything from ETL to machine learning, stream processing and interactive queries is possible.
Spark over HDFS is a common deployment option, but it can also run on different storage engines, so in Amazon it runs over S3, and some run it on top of Cassandra, in a Mesos cluster or standalone.
While Spark has Java and Python APIs too, Scala makes it a breeze to work with its distributed collections.
We'll have pizza and drinks at 6:15 and present at 7-8pm, with a Q&A afterwards.
The event will be hosted by ImpactRadius - http://www.impactradius.com and is being presented by Cloudera http://www.cloudera.com
More info:
http://www.zdnet.com/databricks-ceo-why-so-many-firms-are-fired-up-over-apache-spark-7000035773

Meet Apache Spark: a faster and more flexible compute engine