Skip to content

Details

The data analysis landscape has changed dramatically in a short amount of time. Hadoop and the MapReduce paradigm make sense for some use cases, but I'll introduce you to high-speed in-memory analysis using Spark. Spark's developer-friendly collection-based API is reason enough to take a look, not to mention its support for streaming analysis. Shark allows you to run HiveQL queries against your Spark cluster. I'll show you how to get everything up and running--and working with open source Cassandra.

Members are also interested in