Skip to content

Spark + Cassandra: Working Together for Good

Photo of mark quinsland
Hosted By
mark q.
Spark + Cassandra:  Working Together for Good

Details

Apache Spark is the hottest open-source software product in the Big Data ecosystem. It provides a single framework that can be used for streaming analytics, machine learning, recommendation systems, and massively parallel computations. It is significantly easier to use than the tools that it replaces: Map/Reduce, Pig, Hive, Sqoop, Mahout, and Storm. And the latest version includes better support for dataframes, pandas, and R integration. Spark just won the award for sorting 100TB of data - smashing the old record set by Yahoo.

Cassandra has earned a spot at the top of the NoSQL rankings by combining an obsession with scalability, reliability, and availability with a simple SQL-like interface.
It is used on a massive scale by companies such as Apple, Macy's, Sony, and Netflix.

DataStax, the creator of Cassandra, has created a Cassandra/Spark connector to allow 2 best-of-breed products to work together seamlessly.

Agenda:

6:30 networking, pizza, and beer

7:00-8:00 Cassandra + Spark Working Together for Good

Presenter: - Russell Spitzer has a BioInformatics Ph.D. from UCSF and is the resident Spark Analytics expert at DataStax. He has given wildly popular presentations about how to combine these two powerful tools. Please come on out to learn a lot, ask questions, meet your peers, share your experiences, get a Cassandra T-Shirt, and enjoy a cold beer with the pizza.

Location: Exit Certified Address: 8950 Cal Center Drive, Suite 110, Building 1, Sacramento, CA 95826

NOTE: The doors will be locked after 5:30 We will have somebody at the doors to let you in, but please try to be on time. If you arrive after we start, just text/call (916) 538-2708 and I'll come out to rescue you.

Photo of NorCal BigData User Group group
NorCal BigData User Group
See more events
ExitCertified
8950 Cal Center Drive Suite 110, Bldg. 1 · Sacramento, CA