Skip to content

Getting started with Spark & Cassandra by Jon Haddad of Datastax

Photo of Subash DSouza
Hosted By
Subash D.
Getting started with Spark & Cassandra by Jon Haddad of Datastax

Details

Abstract:
Massively scalable, always on, and ridiculously fast. Apache Cassandra is the database chosen by Apple, Netflix, and 30 of the Fortune 100 to power their critical infrastructure. How do we analyze petabytes of data, whether it be massive batching or as it’s ingested via streaming with Apache Kafka? Enter Apache Spark. Challenging MapReduce head on, Apache Spark offers powerful constructs that make it possible to slice and dice your data, whether it be through machine learning, graph queries, as well as transformations familiar to people with functional programming backgrounds such as map, filter, and reduce. Step away ready to rock with the most powerful distributed database, scalable messaging, and analytics platform on the planet.

Bio: Jon has 15 years experience in both development and operations. For the last 10 he’s worked at various startups in southern California. For the last 2 years he's been a committer to cqlengine, the Python object mapper for Cassandra. He's now a Technical Evangelist at Datastax, continuing to focus on advancing Cassandra in both the Python and operations communities.

Photo of Los Angeles Apache Spark Users Group group
Los Angeles Apache Spark Users Group
See more events
Symantec Corporation
900 Corporate Pointe · Culver City, CA