Interactive OLAP Queries using Cassandra and Spark

  • July 16, 2014 · 6:15 PM
  • White Pages

Session Details:
How do you rapidly derive complex insights on top of really big data sets in Cassandra? This session draws upon Evan's experience building a distributed, interactive, columnar query engine on top of Cassandra and Spark. We will start by surveying the existing query landscape of Cassandra and discuss ways to integrate Cassandra and Spark. We will dive into the design and architecture of a fast, column-oriented query architecture for Spark, and why columnar stores are so advantageous for OLAP workloads. I will present a schema for Parquet-like storage of analytical datasets onCassandra. Find out why Cassandra and Spark are the perfect match for enabling fast, scalable, complex querying and storage of big analytical data.

About the Speaker:
Evan loves to design, build, and improve bleeding edge distributed data and backend systems using the latest in open source technologies. He has led the design and implementation of multiple big data platforms based on Storm, Spark, Kafka, Cassandra, and Scala/Akka, including a columnar real-time distributed query engine. He is an active contributor to the Apache Spark project and co-creator of the open-source Spark Job Server. He is a big believer in GitHub, open source, and meetups, and have given talks at various conferences including the Spark Summit and Cassandra Summit. He has Bachelor's and Master's degrees in Electrical Engineering from Stanford University.

Location Information:
Use the 4th ave entrance, up to 3rd floor and take elevator to 16

6:15 to 6:45 : Networking / food & drinks
6:45 to 7:30 : Main session
7:30 to 8:00 : Open Q&A & Wrap up

Join or login to comment.

  • Joseph

    Claudiu Barbura(Director Engineering, Atigeo) is addressing attendees on "Spark, Shark, Mesos, Tachyon, Cassandra at scale" at 15th Big Data Bootcamp Seattle August 8-10, Seattle Attend 3 day Big Data Bootcamp starting Friday August 8,2014 @ Washington State Convention Center,Seattle

    To attend any One Day: Price $799 ( $100 discount, Use Discount code MEETUP)
    To attend any Two Days: Price $1099 ( $100 discount, Use Discount code MEETUP)
    To attend all Three Days: Price $1499 ( $100 discount, Use Discount code MEETUP)
    Discount expires on July 24 Register:­ August 08th-10th

    Global Big Data Conference is offering 3 days extensive bootcamp(August 8th - 10th) on Big Data. This is a fast paced,vendor agnostic. No prior knowledge of databases or programming is assumed. Big Data Bootcamp is targeted towards both technical and non-technical people who want to understand the emerging world of Big Data, with a specific focus on Hadoop, NoSQL & Machine learning

    July 22

  • Denny L.

    Here are Evan's slides - enjoy!­

    1 · July 17

  • Jeff H.

    I attended early session at Expedia today. It was great. Thanks Evan!

    July 16

  • Ben S.

    I really would have liked to attend this one but couldn't make it. Will a recording and/or slides be posted afterward?

    July 16

    • Denny L.

      Yes we will! After the session we'll prop them up.

      July 16

People in this
Meetup are also in:

Sometimes the best Meetup Group is the one you start

Get started Learn more

I'm surprised by the level of growth I've seen since becoming an organizer, it's given me more confidence in my abilities.

Katie, started NYC ICO

Start your Meetup today

Act now and get 50% off.
Until February 1.

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy