Beyond MapReduce - In-memory Analysis with Spark and Shark

Hosted By
Robbie S.

Details
The data analysis landscape has changed dramatically in a short amount of time. Hadoop and the MapReduce paradigm make sense for some use cases, but I'll introduce you to high-speed in-memory analysis using Spark. Spark's developer-friendly collection-based API is reason enough to take a look, not to mention its support for streaming analysis. Shark allows you to run HiveQL queries against your Spark cluster. I'll show you how to get everything up and running--and working with open source Cassandra.

Atlanta Cassandra Users Group
See more events
The Weather Channel Inc.
300 Interstate North Pkwy SE · Atlanta, GA
Beyond MapReduce - In-memory Analysis with Spark and Shark