Spark + Cassandra: Working Together for Good


Details
Apache Spark is the hottest open-source software product in the Big Data ecosystem. It provides a single framework that can be used for streaming analytics, machine learning, recommendation systems, and massively parallel computations. It is significantly easier to use than the tools that it replaces: Map/Reduce, Pig, Hive, Sqoop, Mahout, and Storm. And the latest version includes better support for dataframes, pandas, and R integration. Spark just won the award for sorting 100TB of data - smashing the old record set by Yahoo.
Cassandra has earned a spot at the top of the NoSQL rankings by combining an obsession with scalability, reliability, and availability with a simple SQL-like interface.
It is used on a massive scale by companies such as Apple, Macy's, Sony, and Netflix.
DataStax, the creator of Cassandra, has created a Cassandra/Spark connector to allow 2 best-of-breed products to work together seamlessly.
Agenda:
6:30 networking, pizza, and beer
7:00-8:00 Cassandra + Spark Working Together for Good
Presenter: - Russell Spitzer has a BioInformatics Ph.D. from UCSF and is the resident Spark Analytics expert at DataStax. He has given wildly popular presentations about how to combine these two powerful tools. Please come on out to learn a lot, ask questions, meet your peers, share your experiences, get a Cassandra T-Shirt, and enjoy a cold beer with the pizza.
Location: Exit Certified Address: 8950 Cal Center Drive, Suite 110, Building 1, Sacramento, CA 95826
NOTE: The doors will be locked after 5:30 We will have somebody at the doors to let you in, but please try to be on time. If you arrive after we start, just text/call (916) 538-2708 and I'll come out to rescue you.

Spark + Cassandra: Working Together for Good