Skip to content

Rethinking SQL for Big Data with Apache Drill

Photo of Alex Rovner
Hosted By
Alex R.
Rethinking SQL for Big Data with Apache Drill

Details

We are going to start off with a presentation by Jacques Nadeau, who is the CTO and co-founder of Dremio. He is also the founding PMC chair of the open source Apache Drill project, spearheading the project’s technology and community.

Jacques will cover recent innovations in Drill, along with a discussion of upcoming items. He'll talk about a number of existing use cases where customers are using Drill. He'll also cover off on some exciting developments around supporting more types of data storage systems, even more flexibility and continued performance improvements. Lastly, he'll touch on some new things the Drill community will be dropping in the coming months.

Jim Scott will then lead us through the hands-on demo:

Learn the basics of Apache Drill, from installing the tool to running your first query within minutes. The demo will show users how to install and setup instantly and start getting value out of data on their computers. For Drill queries, the data can be in any format or could be from any data source - including JSON, CSV, HBase or even MongoDB data. The talk will also cover interesting use cases and code snippets to query different data formats and data sources in a self service fashion without going through the pain of creating centralized schemas or metadata stores.

This is a hands-on demo so, if you want to follow along, please download drill at the following page: Download Drill (http://events.mapr.com/HadoopNYC) (See the "Additional Resources" section)

Jim Scott has held positions running Operations, Engineering, Architecture and QA teams and is the cofounder of the Chicago Hadoop Users Group (CHUG), where he has coordinated the Chicago Hadoop community for the past 5 years. Jim has worked in the Consumer Packaged Goods, Digital Advertising, Digital Mapping, Chemical and Pharmaceutical industries where he has built systems that handle more than 50 billion transactions per day. Jim's work with high-throughput computing at Dow Chemical was a precursor to more standardized big data concepts like Hadoop.

LinkedIn: https://www.linkedin.com/in/kingmesal

Photo of New York Hadoop User group group
New York Hadoop User group
See more events
Magnetic
360 Park Ave South, Floor 19 · New York, NY