addressalign-toparrow-leftarrow-rightbackbellblockcalendarcameraccwcheckchevron-downchevron-leftchevron-rightchevron-small-downchevron-small-leftchevron-small-rightchevron-small-upchevron-upcircle-with-checkcircle-with-crosscircle-with-pluscrossdots-three-verticaleditemptyheartexporteye-with-lineeyefacebookfolderfullheartglobegmailgooglegroupsimageimagesinstagramlinklocation-pinm-swarmSearchmailmessagesminusmoremuplabelShape 3 + Rectangle 1outlookpersonStartprice-ribbonImported LayersImported LayersImported Layersshieldstartickettrashtriangle-downtriangle-uptwitteruseryahoo

Adding Native SQL Support to Spark with Catalyst

  • Apr 8, 2014 · 6:30 PM

Live Stream:


Tagged, in San Francisco, will be hosting us for a Meetup on Spark SQL. 

This event will be live-streamed and recorded.   

Arrive at 6:30pm, Talks start at 7pm. 

Adding Native SQL Support to Spark with Catalyst

By Michael Armburst

Twitter:  @michaelarmbrust

In this talk I'll describe Spark SQL, a new Alpha component that is part of the Spark 1.0 release.  Spark SQL lets developers natively query data stored in both existing RDDs and external sources such as Apache Hive.  A key feature of Spark SQL is the ability to blur the lines between relational tables and RDDs, making it easy for developers to intermix SQL commands that query external data with complex analytics.  In addition to Spark SQL, I'll also talk about the Catalyst optimizer framework, which allows Spark SQL to automatically rewrite query plans to execute more efficiently.

Join or login to comment.

People in this
Meetup are also in:

Sign up

Meetup members, Log in

By clicking "Sign up" or "Sign up using Facebook", you confirm that you accept our Terms of Service & Privacy Policy