Skip to content

Faster than Parquet! A deep dive into Kudu

Photo of Michael
Hosted By
Michael
Faster than Parquet! A deep dive into Kudu

Details

Bring your questions (and laptops) for a Q&A with a member of the core team behind Kudu, a low-latency column store for your Spark clusters. There will be discussion of the architecture and future integrations. Food and drink will be provided. We will be filming the talk and posting it to the SF Spark Hackers YouTube page.

Agenda:

6:30: Mingling

7-7:05: Intro's

7:05-8:15: Technical Talk along with Q&A

8:15: Mingling and Breakout Groups

Jean-Daniel, from the Cloudera Kudu team

Jean-Daniel Cryans is a Software Engineer at Cloudera currently working on the Kudu team, and an Apache HBase PMC member. Previous to Cloudera, he worked at StumbleUpon where he worked on HBase while maintaining its production deployment there.

Jean-Daniel will speak about the history of Kudu and the low-level operations that it performs. He will explain the choice to build the product in C++ instead of Java, the decisions that led to winning benchmarks and integrations with Spark.

Interested in learning more?

Clone the repo

git clone https://github.com/cloudera/kudu.git

Join the community

https://getkudu-slack.herokuapp.com/

Submit a PR

http://gerrit.cloudera.org:8080/#/q/status:open+project:kudu

See you at the event!

Photo of Advanced Spark and TensorFlow Meetup (North Bay) group
Advanced Spark and TensorFlow Meetup (North Bay)
See more events
Folsom Reactor
680 Folsom St · San Francisco, CA